Yanli2190

Pinned Repositories

multimodal_cognitive_ai_llava_mpt
research work on multimodal cognitive ai
Language:Python00
neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
Language:Python00
ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
Language:Python6.5k1.2k
multimodal_cognitive_ai
research work on multimodal cognitive ai
Language:Python549
intel-extension-for-openxla
Language:C++3811

Yanli2190's Repositories

Yanli2190/multimodal_cognitive_ai_llava_mpt
research work on multimodal cognitive ai
Yanli2190/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.