Fenghuang888's Stars
psdwizzard/XTTS-Read-Aloud
This Chrome extension integrates screen reader functionality using the XttS-webui API. Currently in beta and using the XttS Server API backend, it will soon move to AllTalk. It enhances web accessibility with seamless text-to-speech capabilities. Licensed under the MIT License for unrestricted and commercial use
LinuxDroidMaster/Termux-Desktops
Collection of scripts to launch Desktops with audio in Termux X11 and how to use hardware acceleration
airbnb/knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
jacopotagliabue/you-dont-need-a-bigger-boat
An end-to-end implementation of intent prediction with Metaflow and other cool tools
Nerogar/OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
LarryJane491/Lora-Training-in-Comfy
This custom node lets you train LoRA directly in ComfyUI!
LarryJane491/Image-Captioning-in-ComfyUI
Custom nodes for ComfyUI that let the user load a bunch of images and save them with captions (ideal to prepare a database for LORA training)
FurkanGozukara/Stable-Diffusion
FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod
karpathy/LLM101n
LLM101n: Let's build a Storyteller
MrForExample/ComfyUI-3D-Pack
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
fofr/cog-consistent-character
Create images of a given character in different poses
TMElyralab/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
pytorch/torchtitan
A native PyTorch Library for large model training
deepjavalibrary/djl
An Engine-Agnostic Deep Learning Framework in Java
phidatahq/phidata
Build AI Agents with memory, knowledge, tools and reasoning
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
Kanaries/pygwalker
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
AIFSH/ComfyUI-GPT_SoVITS
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
kijai/ComfyUI-VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
developersdigest/llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild