Pinned Repositories
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
annotated_deep_learning_paper_implementations
🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
apps
one benchmark for llm coding
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
Ax
Adaptive Experimentation Platform
lidbox
End-to-end spoken language identification out of the box.
AI-X-King's Repositories
AI-X-King/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
AI-X-King/apps
one benchmark for llm coding
AI-X-King/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
AI-X-King/brouhaha-vad
AI-X-King/axolotl
Go ahead and axolotl questions
AI-X-King/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
AI-X-King/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
AI-X-King/data_management_LLM
Collection of training data management explorations for large language models
AI-X-King/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
AI-X-King/DENT_DDSP
AI-X-King/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
AI-X-King/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
AI-X-King/FasterTransformer
Transformer related optimization, including BERT, GPT
AI-X-King/lhotse
Tools for handling speech data in machine learning projects.
AI-X-King/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
AI-X-King/llama.cpp
LLM inference in C/C++
AI-X-King/promptbase
All things prompt engineering
AI-X-King/PSST
Prosodic Speech Segmentation with Transformers
AI-X-King/pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
AI-X-King/pytorch-docker
Pure Pytorch Docker Images.
AI-X-King/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
AI-X-King/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
AI-X-King/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
AI-X-King/sherpa
Streaming and non-streaming ASR server for next-gen Kaldi
AI-X-King/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
AI-X-King/vad-1-without-upload
AI-X-King/VAD-with-adversarial-multi-task-learning
AI-X-King/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
AI-X-King/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
AI-X-King/whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.