npujcong's Stars
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
danpovey/conditional-flow-matching
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
seastar105/pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
MC-E/DragonDiffusion
ICLR 2024 (Spotlight)
princeton-nlp/CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
huggingface/parler-tts
Inference and training library for high-quality TTS models.
AudiogenAI/agc
Audiogen Codec
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
adelacvg/ttts
Train the next generation of TTS systems.
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
glory20h/VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
tinyzqh/awesome-reinforcement-learning
Learning Resources And Links Of Reinforcement Learning (updating)
KerfuffleV2/llm-samplers
A collection of LLM token samplers in Rust
ggerganov/llama.cpp
LLM inference in C/C++
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
OpenLMLab/MOSS-RLHF
MOSS-RLHF
state-spaces/mamba
Mamba SSM architecture
openai/guided-diffusion
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
openai/consistencydecoder
Consistency Distilled Diff VAE