BinWang28

Scientist, I2R, A*STAR

I2RSingapore

BinWang28's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36.8k 219 5.6k4.5k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.6k 230 2733.2k
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell7.5k 42 772460
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.2k 90 1.1k1.1k
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.8k 41 160341
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python3.5k 58 71310
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 47 137158
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.9k 26 51112
jianfch/stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
Language:Python1.7k 32 280182
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.3k 33 8791
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language:Python924 21 5553
prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
Language:Python828 3 3649
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
794 44 348
NVIDIA/enroot
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
Language:Shell655 25 18097
Yuan-ManX/ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
569 13 140
NVIDIA/pyxis
Container plugin for Slurm Workload Manager
Language:C303 9 13430
IndoNLP/nusa-crowd
A collaborative project to collect datasets in Indonesian languages.
Language:Jupyter Notebook263 6 19162
AI4Bharat/IndicTrans2
Translation models for 22 scheduled languages of India
Language:Python246 10 9568
NVIDIA/audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
Language:Python216 6 1115
aiverify-foundation/moonshot
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
Language:Python191 7 3140
AudioLLMs/AudioLLM
Audio Large Language Models
191 9 211
homebrewltd/llama3-s
Llama3.1 learns to Listen
Language:Python148 5 295
AudioLLMs/AudioBench
AudioBench: A Universal Benchmark for Audio Large Language Models
Language:Python106 8 11
wsntxxn/AudioCaption
Audio captioning recipe
Language:Python45 1 85
Labbeti/aac-metrics
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
Language:Python37 3 113
mulab-mir/muchomusic
MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.
Language:Jupyter Notebook26 2 21
SeaEval/SeaEval
NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
Language:Python24 0 24
openaudiolab/LLaST
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
Language:Python23 6 11
SeaEval/CRAFT
ACL 2024 Workshop: CRAFT: Extracting and Tuning Cultural Instructions from the Wild
Language:Python2 0 10
zouxunlong/web_crawl
Language:Python1 1 01

BinWang28

BinWang28's Stars

hiyouga/LLaMA-Factory

meta-llama/llama3

QwenLM/Qwen2

wenet-e2e/wenet

FunAudioLLM/SenseVoice

facebookresearch/encodec

huggingface/datatrove

facebookresearch/chameleon

jianfch/stable-ts

QwenLM/Qwen2-Audio

jishengpeng/WavTokenizer

prometheus-eval/prometheus-eval

ga642381/speech-trident

NVIDIA/enroot

Yuan-ManX/ai-audio-datasets

NVIDIA/pyxis

IndoNLP/nusa-crowd

AI4Bharat/IndicTrans2

NVIDIA/audio-flamingo

aiverify-foundation/moonshot

AudioLLMs/AudioLLM

homebrewltd/llama3-s

AudioLLMs/AudioBench

wsntxxn/AudioCaption

Labbeti/aac-metrics

mulab-mir/muchomusic

SeaEval/SeaEval

openaudiolab/LLaST

SeaEval/CRAFT

zouxunlong/web_crawl