chengtbf's Stars
deepseek-ai/DeepSeek-V3
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
kyutai-labs/moshi
volcengine/veGiantModel
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
LC044/WeChatMsg
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
ShiArthur03/ShiArthur03
triton-lang/triton
Development repository for the Triton language and compiler
SkyworkAI/Skywork-MoE
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
SkyworkAI/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
breezedeus/Pix2Text
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Ucas-HaoranWei/Vary
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
meta-llama/llama3
The official Meta Llama 3 GitHub site
Sanster/xy-cut
xai-org/grok-1
Grok open release
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Alice1998/URS
URS Benchmark: Evaluating LLMs on User Reported Scenarios
hendrycks/math
The MATH Dataset (NeurIPS 2021)
THUDM/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
niderhoff/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
mistralai/megablocks-public
GPT-Fathom/GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.
OpenBMB/InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Azure/MS-AMP
Microsoft Automatic Mixed Precision Library
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
vectara/hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents