wangkl2

@IntelShanghai, China

Pinned Repositories

ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
Language:Python6.5k 246 2.5k1.2k
ai-reference-models
Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs
Language:Python674 37 73220
oneAPI-samples
Samples for Intel® oneAPI Toolkits
Language:C++922 32 139681
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python82.1k 1.7k 45k22.1k
ai-documents
00
GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Python00
inference_results_v2.1
00
Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
Language:Python00
models
Model Zoo for Intel® Architecture: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors
Language:Python01
oneAPI-samples
Samples for Intel oneAPI toolkits
Language:HTML00

wangkl2/ai-documents
00
wangkl2/GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Python00
wangkl2/inference_results_v2.1
00
wangkl2/Model-References
Reference models for Intel(R) Gaudi(R) AI Accelerator
Language:Python00
wangkl2/models
Model Zoo for Intel® Architecture: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors
Language:Python01
wangkl2/oneAPI-samples
Samples for Intel oneAPI toolkits
Language:HTML00