xiguiw

Pinned Repositories

ai-documents
00
docs
This repo contains documents of the OPEA project
Language:Python0 0 00
Gaudi-tutorials
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
Language:Jupyter Notebook00
GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
Language:Python00
GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
Language:Python00
GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Shell0 0 00
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python0 0 00
model_server
A scalable inference server for models optimized with OpenVINO™
Language:C++00
neural-speed
An innovation library for efficient LLM inference via low-bit quantization and sparsity
Language:C++0 0 00
oneAPI-samples
Samples for Intel oneAPI toolkits
Language:C++0 0 00

xiguiw's Repositories

xiguiw/ai-documents
00
xiguiw/docs
This repo contains documents of the OPEA project
Language:Python0 0 00
xiguiw/Gaudi-tutorials
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
Language:Jupyter Notebook00
xiguiw/GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
Language:Python00
xiguiw/GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
Language:Python00
xiguiw/GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Shell0 0 00
xiguiw/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python0 0 00
xiguiw/model_server
A scalable inference server for models optimized with OpenVINO™
Language:C++00
xiguiw/neural-speed
An innovation library for efficient LLM inference via low-bit quantization and sparsity
Language:C++0 0 00
xiguiw/oneAPI-samples
Samples for Intel oneAPI toolkits
Language:C++0 0 00
xiguiw/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python00