npujcong's Stars
ggerganov/llama.cpp
LLM inference in C/C++
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
state-spaces/mamba
Mamba SSM architecture
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
openai/guided-diffusion
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
huggingface/parler-tts
Inference and training library for high-quality TTS models.
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
openai/consistencydecoder
Consistency Distilled Diff VAE
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
OpenLMLab/MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
MC-E/DragonDiffusion
ICLR 2024 (Spotlight)
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
tinyzqh/awesome-reinforcement-learning
Learning Resources And Links Of Reinforcement Learning (updating)
glory20h/VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
adelacvg/ttts
Train the next generation of TTS systems.
princeton-nlp/CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
AudiogenAI/agc
Audiogen Codec
seastar105/pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
danpovey/conditional-flow-matching
KerfuffleV2/llm-samplers
A collection of LLM token samplers in Rust