Fenghuang888

Fenghuang888's Stars

psdwizzard/XTTS-Read-Aloud
This Chrome extension integrates screen reader functionality using the XttS-webui API. Currently in beta and using the XttS Server API backend, it will soon move to AllTalk. It enhances web accessibility with seamless text-to-speech capabilities. Licensed under the MIT License for unrestricted and commercial use
Language:JavaScript142
LinuxDroidMaster/Termux-Desktops
Collection of scripts to launch Desktops with audio in Termux X11 and how to use hardware acceleration
Language:Shell59961
airbnb/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Language:Python5.5k689
jacopotagliabue/you-dont-need-a-bigger-boat
An end-to-end implementation of intent prediction with Metaflow and other cool tools
Language:Python84165
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
Language:Python1.7k142
LarryJane491/Lora-Training-in-Comfy
This custom node lets you train LoRA directly in ComfyUI!
Language:Python37253
LarryJane491/Image-Captioning-in-ComfyUI
Custom nodes for ComfyUI that let the user load a bunch of images and save them with captions (ideal to prepare a database for LORA training)
Language:Python4614
FurkanGozukara/Stable-Diffusion
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod
Language:Jupyter Notebook2.1k289
karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.4k1.6k
MrForExample/ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
Language:Python2.3k228
fofr/cog-consistent-character
Create images of a given character in different poses
Language:Python57256
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
Language:Python2.2k156
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
26.9k2.2k
pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python2.5k182
deepjavalibrary/djl
An Engine-Agnostic Deep Learning Framework in Java
Language:Java4.1k653
phidatahq/phidata
Build AI Agents with memory, knowledge, tools and reasoning
Language:Python11.6k1.7k
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30.3k2.8k
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
Language:Python1.7k50
Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Language:Python13k674
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Language:Python1.4k139
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python34.3k3.9k
AIFSH/ComfyUI-GPT_SoVITS
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
Language:Python19616
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.6k2.3k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.4k1k
kijai/ComfyUI-VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Python492
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook64.3k32.6k
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.2k824
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python44.4k5.3k
developersdigest/llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
Language:TypeScript4.6k735
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.6k741