vxltersmith

vxltersmith's Stars

MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Language:Python4.3k716
vxltersmith/align_refine
Language:Python1
vosen/ZLUDA
CUDA on non-NVIDIA GPUs
Language:Rust9.8k637
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1.4k136
ControlNet/MARLIN
[CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg
Language:Python23220
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.8k2.3k
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python3k344
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.3k161
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language:Python1.6k199
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++6.4k1k
SamsungLabs/SummaryMixing
This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recognition (see: https://arxiv.org/abs/2307.07421). The code is ready to be used with the SpeechBrain toolkit).
Language:Python11111
SJTMusicTeam/Muskits
An opensource music processing toolkit
Language:Python31144
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python22.4k9.5k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.7k992
huggingface/candle
Minimalist ML framework for Rust
Language:Rust15.9k960
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Language:C#17.2k4.2k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.7k2k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.3k982
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.8k643
madebyollin/taesd
Tiny AutoEncoder for Stable Diffusion
Language:Python58328
HKUNLP/reparam-discrete-diffusion
Reparameterized Discrete Diffusion Models for Text Generation
Language:Python903
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.7k2.7k
mts-ai/audiogram
Language:Python51
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
Language:Python817182
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python12.4k854
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python36.8k5.3k
k2-fsa/k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Language:Cuda1.1k215
XPixelGroup/DiffBIR
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Language:Python3.4k289
ShichenLiu/SoftRas
Project page of paper "Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning"
Language:Python1.2k156
BrianPulfer/PapersReimplementations
Personal short implementations of Machine Learning papers
Language:Jupyter Notebook23353