mc112611

Pinned Repositories

gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
Language:Python90 1 88
haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language:Python16.7k 133 3.5k1.8k
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.1k 34 199251
haystack-chinese
:mag: 基于deepsetAI的开源项目haystack进行修改，使其支持中文场景下的任务
Language:Python7 1 11
mc112611
Config files for my GitHub profile.
0 1 00
mojo
The Mojo Programming Language
0 0 00
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
Language:Python0 0 00
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python0 0 00
SRP
Sturctured pruning algorithm for pruning Transformer
Language:Python291

mc112611's Repositories

mc112611/haystack-chinese
:mag: 基于deepsetAI的开源项目haystack进行修改，使其支持中文场景下的任务
Language:Python7 1 11
mc112611/mc112611
Config files for my GitHub profile.
0 1 00
mc112611/mojo
The Mojo Programming Language
0 0 00
mc112611/open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
Language:Python0 0 00
mc112611/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python0 0 00