funcwj's Stars
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
meta-llama/llama3
The official Meta Llama 3 GitHub site
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
facebookresearch/flow_matching
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
pytorch/data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
danmic/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
k2-fsa/fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
xv44586/Chinese-instruction-datasets
中文 Instruction tuning datasets
fakufaku/torchiva
Blind source separation with independent vector analysis family of algorithm in torch
wenet-e2e/wesignal
Production first, nn-based on-device signal processing toolkit.
rwth-i6/rasr
The RWTH ASR Toolkit.