jiengup
Code more, Study more, Think more...
Huazhong University of Science and TechnologyWuhan,Hubei,China
jiengup's Stars
ggerganov/llama.cpp
LLM inference in C/C++
0voice/interview_internal_reference
2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
jmorganca/ollama
Get up and running with Llama 2, Mistral, and other large language models locally.
Sanster/lama-cleaner
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
tracel-ai/burn
Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
mosaicml/composer
Supercharge Your Model Training
FedML-AI/FedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
skyzh/mini-lsm
A tutorial of building an LSM-Tree storage engine in a week!
youngyangyang04/Skiplist-CPP
A tiny KV storage based on skiplist written in C++ language| 使用C++开发,基于跳表实现的轻量级键值数据库🔥🔥 🚀
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
tech-conferences/confs.tech
Frontend for https://confs.tech
risinglightdb/risinglight
An educational OLAP database system.
FelixKratz/dotfiles
My personal macOS configuration
zhllxt/asio2
Header only c++ network library, based on asio,support tcp,udp,http,websocket,rpc,ssl,icmp,serial_port,socks5.
harvard-edge/cs249r_book
Collaborative book Machine Learning Systems
MegEngine/InferLLM
a lightweight LLM model inference framework
FelixKratz/SketchyVim
Adds all vim moves and modes to macOS text fields
mindspore-courses/step_into_llm
MindSpore online courses: Step into LLM
Eddie-Wang1120/HPC-Learning-Notes
高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!
FederatedAI/FATE-Serving
A scalable, high-performance serving system for federated learning models
Sunt-ing/stick
:innocent: A PyTorch-like deep learning framework. Just for fun.
mayooot/gpu-docker-api
Easier than K8s to lift and lower the gpu number of docker container and scale capacity size of volume.