letonghan

Intel CorporationShanghai, Zizhu

Pinned Repositories

intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.1k 28 166211
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.2k 33 206257
langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook95.2k 690 7.9k15.4k
AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Language:TypeScript00
GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
Language:Python00
GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
Language:Python0 0 00
GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Shell0 0 02
GenAIInfra
Containerization and cloud native suite for OPEA
Language:Go00
INC
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python0 0 00
INC-Performance
Language:Dockerfile0 1 00

letonghan's Repositories

letonghan/GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
Language:Python00
letonghan/GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
Language:Python0 0 00
letonghan/GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Shell0 0 02
letonghan/GenAIInfra
Containerization and cloud native suite for OPEA
Language:Go00
letonghan/INC
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python0 0 00
letonghan/INC-Performance
Language:Dockerfile0 1 00

letonghan

Pinned Repositories

intel-extension-for-transformers

neural-compressor

langchain

AgentGPT

GenAIComps

GenAIEval

GenAIExamples

GenAIInfra

INC

INC-Performance

letonghan's Repositories

letonghan/GenAIComps

letonghan/GenAIEval

letonghan/GenAIExamples

letonghan/GenAIInfra

letonghan/INC

letonghan/INC-Performance