Pinned Repositories
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
chroma
the AI-native open-source embedding database
Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
datasets-server
Lightweight web API for visualizing and exploring all types of datasets - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
DeepSeek-V3
DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
TARS-AI
xaugmentoolkitx
Convert Compute And Books Into Instruct-Tuning Datasets
xanomanox's Repositories
xanomanox/TARS-AI
xanomanox/xaugmentoolkitx
Convert Compute And Books Into Instruct-Tuning Datasets
xanomanox/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
xanomanox/chroma
the AI-native open-source embedding database
xanomanox/Comprehensive-Tacotron2
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
xanomanox/datasets-server
Lightweight web API for visualizing and exploring all types of datasets - computer vision, speech, text, and tabular - stored on the Hugging Face Hub
xanomanox/DeepSeek-V3
xanomanox/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
xanomanox/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
xanomanox/gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
xanomanox/GPTARS_Interstellar
TARS from Interstellar x ChatGPT
xanomanox/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
xanomanox/llama.cpp
Port of Facebook's LLaMA model in C/C++
xanomanox/radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
xanomanox/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
xanomanox/Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
xanomanox/SMS-AI
A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS
xanomanox/stable-diffusion-webui
Stable Diffusion web UI
xanomanox/Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
xanomanox/text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
xanomanox/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
xanomanox/whisper
Robust Speech Recognition via Large-Scale Weak Supervision