wenzezhang's Stars
freeCodeCamp/freeCodeCamp
freeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
JiuTian-VL/MoME
NVIDIA/audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
espnet/espnet
End-to-End Speech Processing Toolkit
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
karpathy/LLM101n
LLM101n: Let's build a Storyteller
phlippe/uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
mosaicml/llm-foundry
LLM training code for Databricks foundation models
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
2noise/ChatTTS
A generative speech model for daily dialogue.
facebookresearch/ssl-data-curation
PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning
mistralai/mistral-finetune
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
pipecat-ai/pipecat
Open Source framework for voice and multimodal conversational AI
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
agiresearch/AIOS
AIOS: LLM Agent Operating System
andialbrecht/sqlparse
A non-validating SQL parser module for Python
dataease/dataease
🔥 人人可用的开源 BI 工具,Tableau、帆软的开源替代。
KwaiKEG/KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents