LydiaXiaohongLi's Stars
arcee-ai/DistillKit
An Open Source Toolkit For LLM Distillation
kyutai-labs/moshi
SpellcraftAI/oaib
Use the OpenAI Batch tool to make async batch requests to the OpenAI API.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
2noise/ChatTTS
A generative speech model for daily dialogue.
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
kenantang/petci
PETCI: A Parallel English Translation Dataset of Chinese Idioms
infinilabs/analysis-pinyin
🛵 This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.
lxyu/pinyin
A simple python script to translate chinese to pinyin based on Mandarin.dat
hsing-wang/Awesome-LLM-MT
Unbabel/COMET
A Neural Framework for MT Evaluation
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
alirezadir/Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
lorenlugosch/transducer-tutorial
Example code for a neural transducer model.
openspeech-team/openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
unit-mesh/auto-dev
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手) with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
xai-org/grok-1
Grok open release
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
numz/sd-wav2lip-uhq
Wav2Lip UHQ extension for Automatic1111
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
ajay-sainy/Wav2Lip-GFPGAN
High quality Lip sync