rsandx

Ottawa, Canada

rsandx's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python74.6k 606 08.9k
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.8k 988 1903.5k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k 424 4.2k6.4k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Jupyter Notebook21.3k 211 3992.2k
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook11.2k 144 3741.1k
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.9k 101 372881
PaddlePaddle/PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Language:Python8k 107 3671.2k
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Language:Python7.9k 300 2631.4k
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.7k 71 84355
bryandlee/animegan2-pytorch
PyTorch implementation of AnimeGANv2
Language:Jupyter Notebook4.4k 59 56642
minivision-ai/photo2cartoon
人像卡通化探索项目 (photo-to-cartoon translation project)
Language:Python4k 82 72772
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.7k 67 105309
williamyang1991/VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
Language:Jupyter Notebook3.6k 62 76450
igorprado/react-notification-system
A complete and totally customizable component for notifications in React
Language:JavaScript2.5k 39 127249
context-labs/autodoc
Experimental toolkit for auto-generating codebase documentation using LLMs
Language:TypeScript2k 17 34122
ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python1.9k 28 221548
google-ai-edge/mediapipe-samples
Language:Jupyter Notebook1.8k 46 205452
volotat/SD-CN-Animation
This script allows to automate video stylization task using StableDiffusion and ControlNet.
Language:Python811 15 15663
SpeechifyInc/Meta-voicebox
Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.
571 85 431
Emotional-Text-to-Speech/dl-for-emo-tts
:computer: :robot: A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech :speaker:
Language:Jupyter Notebook439 10 745
soumik-kanad/diff2lip
Language:Python334 29 4240
zhw2590582/WFPlayer
:ocean: WFPlayer.js is an audio waveform generator
Language:JavaScript301 5 5238
keonlee9420/Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Language:Python293 4 2047
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Language:Python189 7 1727
jmoso13/jukebox-diffusion
Language:Python107 5 213
binodswain/react-faq-component
React package to render FAQ section
Language:JavaScript57 4 3110
ganeshmani/react-table-pagination-example
This Repo is a demo for React table Pagination handling 1 million records from server
Language:JavaScript16 3 08
ceramicwhite/IllusionDiffusion
Fork of huggingface.co/spaces/AP123/IllusionDiffusion
Language:Python9 1 13
leiyi420/MsEmoTTS
4 2 10
fhixa/CodeScribe
CodeScribe - An Automate way to describe code
Language:Python3 1 01

rsandx

rsandx's Stars

openai/whisper

XingangPan/DragGAN

facebookresearch/fairseq

facebookresearch/audiocraft

facebookresearch/seamless_communication

guoyww/AnimateDiff

PaddlePaddle/PaddleGAN

openai/jukebox

AILab-CVC/VideoCrafter

bryandlee/animegan2-pytorch

minivision-ai/photo2cartoon

huggingface/distil-whisper

williamyang1991/VToonify

igorprado/react-notification-system

context-labs/autodoc

ming024/FastSpeech2

google-ai-edge/mediapipe-samples

volotat/SD-CN-Animation

SpeechifyInc/Meta-voicebox

Emotional-Text-to-Speech/dl-for-emo-tts

soumik-kanad/diff2lip

zhw2590582/WFPlayer

keonlee9420/Expressive-FastSpeech2

keonlee9420/Cross-Speaker-Emotion-Transfer

jmoso13/jukebox-diffusion

binodswain/react-faq-component

ganeshmani/react-table-pagination-example

ceramicwhite/IllusionDiffusion

leiyi420/MsEmoTTS

fhixa/CodeScribe