maxilevi
software, business, tennis, engineering, finance, graphics and dogs. 23 years old.
@leastsquaresLondon
maxilevi's Stars
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Pythagora-io/gpt-pilot
The first real AI developer
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Acly/krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
cmusphinx/pocketsphinx
A small speech recognizer
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
OpenPipe/OpenPipe
Turn expensive prompts into cheap fine-tuned models
minimaxir/aitextgen
A robust Python tool for text-based AI training and generation using GPT-2.
huggingface/transfer-learning-conv-ai
🦄 State-of-the-Art Conversational AI with Transfer Learning
varunshenoy/opendream
An extensible, easy-to-use, and portable diffusion web UI 👨🎨
microsoft/Llama-2-Onnx
CjangCjengh/vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
MycroftAI/mimic-recording-studio
Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice with Mimic2
archinetai/surgeon-pytorch
A library to inspect and extract intermediate layers of PyTorch models.
Ads-cmu/WhatsApp-Llama
Finetune a LLM to speak like you based on your WhatsApp Conversations
Baiyuetribe/ncnn-models
awesome AI models with NCNN, and how they were converted ✨✨✨
devjwsong/gpt2-dialogue-generation-pytorch
The PyTorch implementation of fine-tuning the GPT-2(Generative Pre-trained Transformer 2) for dialogue generation.
rhasspy/piper-recording-studio
Local voice recording for creating Piper datasets
Hecate2/sukasuka-vocal-dataset-builder
すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。
danielgrittner/nanoGPT-LoRA
The simplest, fastest repository for training/finetuning medium-sized GPTs with LoRA support.
Arryboom/fmemopen_windows
fmemopen on windows