nghuyong

Be a simple bright man

@TencentBeijing

nghuyong's Stars

krahets/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing
Language:Java104k 564 23713k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.9k 384 1.7k4.3k
bentoml/OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
Language:Python10.3k 56 270652
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.8k 126 147871
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.3k 85 38873
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.9k 95 2.1k1k
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Language:TypeScript7.9k 55 671k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 107 290486
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python7k 75 160484
gee1k/uPic
📤uPic is a native, powerful, beautiful and simple picture and file upload tool for macOS.
Language:Swift3.5k 40 0239
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Language:Python2.6k 17 77316
TigerResearch/TigerBot
TigerBot: A multi-language multi-task LLM
Language:Python2.2k 31 126194
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k 25 183343
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Language:Python1.9k 42 313175
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
Language:Makefile1.5k 27 33100
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 144221
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。
Language:Python1.2k 24 63110
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell981 37 19101
wangyuxinwhy/uniem
unified embedding model
Language:Python840 8 10666
haonan-li/CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
Language:Python705 11 3759
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
Language:Python681 5 9386
xverse-ai/XVERSE-13B
XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.
Language:Python648 18 3159
yangjianxin1/LLMPruner
Language:Python299 5 522
twang2218/vocab-coverage
语言模型中文认知能力分析
Language:Python236 5 424
FudanNLPLAB/CBook-150K
中文图书语料MD5链接
Language:Python212 6 1623
liziniu/ReMax
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
Language:Python154 2 313
sh0416/llama-classification
Text classification with Foundation Language Model LLaMA
Language:Python110 4 29
leogao2/lm_dataformat
Language:Python77 3 431
xsysigma/TencentLLMEval
TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, data verification methods, and more.
38 1 21
gmftbyGMFTBY/Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
Language:Python30 2 01

nghuyong

nghuyong's Stars

krahets/hello-algo

hpcaitech/ColossalAI

bentoml/OpenLLM

mistralai/mistral-inference

karpathy/minbpe

NVIDIA/TensorRT-LLM

leptonai/search_with_lepton

01-ai/Yi

deepseek-ai/DeepSeek-Coder

gee1k/uPic

DLLXW/baby-llama2-chinese

TigerResearch/TigerBot

microsoft/Megatron-DeepSpeed

microsoft/DeepSpeed-MII

deepseek-ai/DeepSeek-LLM

bigscience-workshop/Megatron-DeepSpeed

SkyworkAI/Skywork

bigscience-workshop/bigscience

wangyuxinwhy/uniem

haonan-li/CMMLU

IEIT-Yuan/Yuan-2.0

xverse-ai/XVERSE-13B

yangjianxin1/LLMPruner

twang2218/vocab-coverage

FudanNLPLAB/CBook-150K

liziniu/ReMax

sh0416/llama-classification

leogao2/lm_dataformat

xsysigma/TencentLLMEval

gmftbyGMFTBY/Rep-Dropout