rave974

rave974's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python165k 1.6k 2.4k43.8k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python65.3k 545 07.6k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook34k 316 4244k
AliaksandrSiarohin/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
Language:Jupyter Notebook14.4k 351 5303.2k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook12.6k 170 5051.8k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.1k 135 197830
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook11.2k 96 3371.5k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.1k 203 2.2k2.3k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.3k 130 1.1k1.3k
google-deepmind/pysc2
StarCraft II Learning Environment
Language:Python8k 351 2811.2k
PaddlePaddle/PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Language:Python7.8k 107 3571.2k
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language:Jupyter Notebook7.5k 92 146788
facebookresearch/metaseq
Repo for external large-scale work
Language:Python6.4k 111 292722
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook5.6k 70 978735
ThoughtfulDev/EagleEye
Stalk your Friends. Find their Instagram, FB and Twitter Profiles using Image Recognition and Reverse Image Search.
Language:Python4.2k 137 155560
JoePenna/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Language:Jupyter Notebook3.2k 39 107560
yangxy/GPEN
Language:Jupyter Notebook2.4k 57 176448
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1.5k 80 7224
bloc97/CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
Language:Jupyter Notebook1.3k 22 2691
MycroftAI/mimic3
A fast local neural text to speech engine for Mycroft
Language:Python1k 21 4992
harlanhong/CVPR2022-DaGAN
Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation
Language:Python956 27 79125
rhasspy/larynx
End to end text to speech system using gruut and onnx
Language:Python824 23 7149
nepx/halfix
x86 PC emulator that runs both natively and in the browser, via WebAssembly
Language:C658 19 3585
magic-research/magic-avatar
MagicAvatar: Multimodal Avatar Generation and Animation
617 64 431
taylorlu/Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Language:Python461 17 61123
lrusso/VirtualXP
Virtual Machine running on a Web browser
Language:HTML367 9 0105
AleRapchan/flash-swap-arbitrage-bot
Smart Contract BOT code, running on Ethereum Blockchain, watching for and executing profitable arbitrage opportunities using flash loans and flash swaps.
Language:JavaScript221 14 798
watzon/fbmdob
Facebook image Metadata Obfuscation server
Language:Vue157 8 26
Victarry/stable-dreambooth
Dreambooth implementation based on Stable Diffusion with minimal code.
Language:Python141 4 1121
bycloudai/GPEN-colab
Language:Python68 2 023