funcwj

Speech Recognition & Enhancement & Separation & Generation

ByteDanceBellevue, WA

funcwj's Stars

Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Language:Dockerfile69.9k 403 6798.9k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python40.6k 451 3215.2k
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
Language:Rust38.6k 105 3.1k1.1k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook37.3k 329 4614.4k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python28.6k 244 2873.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python25.8k 203 5732.5k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python16.5k 125 1.3k1.6k
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++16.1k 252 7.1k3.1k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.9k 155 3731.1k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python9.2k 76 612653
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML8.1k 105 442765
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python5.9k 35 552576
volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Language:Python5.7k 40 350560
bytedance/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
Language:C++3.3k 56 287330
facebookresearch/flow_matching
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
Language:Python2.2k 27 29115
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python2k 25 52112
pytorch/data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
Language:Python1.2k 33 526161
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
Language:Python992 42 440231
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
952 84 459
ddlBoJack/emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python778 16 6158
X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
Language:Python759 23 6675
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
474 43 328
SuperKogito/SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Language:HTML318 15 4044
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
266 8 310
danmic/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
205 12 222
k2-fsa/fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
Language:Python141 9 2122
xv44586/Chinese-instruction-datasets
中文 Instruction tuning datasets
129 2 06
fakufaku/torchiva
Blind source separation with independent vector analysis family of algorithm in torch
Language:Python96 5 46
wenet-e2e/wesignal
Production first, nn-based on-device signal processing toolkit.
64 17 73
rwth-i6/rasr
The RWTH ASR Toolkit.
Language:C++55 10 1716

funcwj

funcwj's Stars

Anduin2017/HowToCook

Stability-AI/stablediffusion

typst/typst

suno-ai/bark

meta-llama/llama3

hpcaitech/Open-Sora

Dao-AILab/flash-attention

microsoft/onnxruntime

PKU-YuanGroup/Open-Sora-Plan

facebookresearch/xformers

LianjiaTech/BELLE

OpenRLHF/OpenRLHF

volcengine/verl

bytedance/lightseq

facebookresearch/flow_matching

facebookresearch/chameleon

pytorch/data

lhotse-speech/lhotse

hollobit/GenAI_LLM_timeline

ddlBoJack/emotion2vec

X-LANCE/SLAM-LLM

liusongxiang/Large-Audio-Models

SuperKogito/SER-datasets

floodsung/LLM-with-RL-papers

danmic/av-se

k2-fsa/fast_rnnt

xv44586/Chinese-instruction-datasets

fakufaku/torchiva

wenet-e2e/wesignal

rwth-i6/rasr