anthony-wss's Stars
xai-org/grok-1
Grok open release
karpathy/LLM101n
LLM101n: Let's build a Storyteller
karpathy/llm.c
LLM training in simple, raw C/CUDA
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
state-spaces/mamba
Mamba SSM architecture
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
ixartz/SaaS-Boilerplate
🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
brevdev/notebooks
Collection of notebook guides created by the Brev.dev team!
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
fixie-ai/ultravox
A fast multimodal LLM for real-time voice
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
facebookresearch/sound-spaces
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
tincans-ai/gazelle
Joint speech-language model - respond directly to audio!
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
voidful/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
dell-research-harvard/AmericanStories
The official Github for the American Stories dataset as in {link}
speechbrain/benchmarks
This repository contains the SpeechBrain Benchmarks
DanielLin94144/DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
voidful/dtokenizer
discretize everything into tokens
FarnHua/Prompt-Benchmark
YK-Fu/fairseq_dgslm
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.