Zhenzhong1
ML Engineer, OPEA Contributor, ITREX & NeuralSpeed Developer, Major in HPC & AI, Working in Intel, Graduated from the University of Edinburgh
Pinned Repositories
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
intel-extension-for-transformers
Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.
onnx
Open standard for machine learning interoperability
ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
sg-d76dd0dd
https://github.com/Zhenzhong1/sg-d76dd0dd
Zhenzhong1's Repositories
Zhenzhong1/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Zhenzhong1/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Zhenzhong1/GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
Zhenzhong1/GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination
Zhenzhong1/GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Zhenzhong1/intel-extension-for-transformers
Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.
Zhenzhong1/onnx
Open standard for machine learning interoperability
Zhenzhong1/sg-a9c880e4
https://github.com/Zhenzhong1/sg-a9c880e4
Zhenzhong1/sg-bba4e902
Zhenzhong1/sg-d76dd0dd
https://github.com/Zhenzhong1/sg-d76dd0dd