IamQisir

An idiot with a plan can beat a genius without a plan.

The University of TokyoKashiwa, Chiba, Japan

IamQisir's Stars

dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Language:MDX51.1k 564 2075k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python37k 218 1.4k4.2k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.4k 330 4464.3k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.1k 298 1.1k4.4k
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
Language:Rust36.1k 99 2.8k968
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.6k 288 432.3k
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Language:Python24.2k 419 2964.4k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.4k 174 5211.9k
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python12.1k 152 8222.2k
zauberzeug/nicegui
Create web-based user interfaces with Python. The nice way.
Language:Python10.4k 77 1.1k616
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python8k 71 3361k
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python5k 76 199426
holoviz/panel
Panel: The powerful data exploration & web app framework for Python
Language:Python4.9k 60 3.7k522
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python3.3k 48 193377
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Language:Jupyter Notebook3.2k 49 81427
anyoptimization/pymoo
NSGA2, NSGA3, R-NSGA3, MOEAD, Genetic Algorithms (GA), Differential Evolution (DE), CMAES, PSO
Language:Python2.3k 29 428401
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python1.8k 30 85213
molvqingtai/WebChat
💬 Chat with anyone on any website.
Language:TypeScript1.3k 6 2383
JohnSnowLabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Language:Python882 22 43131
okld/streamlit-elements
Create a draggable and resizable dashboard in Streamlit, featuring Material UI widgets, Monaco editor (Visual Studio Code), Nivo charts, and more!
Language:TypeScript744 6 3583
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python696 13 5278
JarodMica/ai-voice-cloning
Language:Python662 20 139148
initialneil/SplattingAvatar
[CVPR2024] Official implementation of SplattingAvatar.
Language:Python443 12 4139
reservoirpy/reservoirpy
A simple and flexible code for Reservoir Computing architectures like Echo State Networks
Language:Python443 17 103109
okld/streamlit-player
A streamlit component to embed video and music players from various websites.
Language:Python101 2 1122
bouzidanas/streamlit-float
A simple module for fixing the vertical position of Streamlit containers relative to viewport instead of page or content
Language:Python69 1 310
daswer123/deepspeed-windows-wheels
A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows
50 3 33
gerazov/PySFC
Python implementation of the SFC intonation model.
Language:Python18 3 22
Bomingmiao/NoiseDiffusion
Noise Diffusion for Enhancing Faithfulness in Text-to-Image Synthesis
Language:Python4
Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone
Japanese Speaking English Speech Dataset
2 1 01

IamQisir

IamQisir's Stars

dair-ai/Prompt-Engineering-Guide

RVC-Boss/GPT-SoVITS

suno-ai/bark

coqui-ai/TTS

typst/typst

google-research/tuning_playbook

d2l-ai/d2l-en

neonbjb/tortoise-tts

OpenTalker/SadTalker

zauberzeug/nicegui

SWivid/F5-TTS

yl4579/StyleTTS2

holoviz/panel

antgroup/echomimic

serp-ai/bark-with-voice-clone

anyoptimization/pymoo

antgroup/echomimic_v2

molvqingtai/WebChat

JohnSnowLabs/nlu

okld/streamlit-elements

idiap/coqui-ai-TTS

JarodMica/ai-voice-cloning

initialneil/SplattingAvatar

reservoirpy/reservoirpy

okld/streamlit-player

bouzidanas/streamlit-float

daswer123/deepspeed-windows-wheels

gerazov/PySFC

Bomingmiao/NoiseDiffusion

Nexdata-AI/207-Hours-Japanese-Speaking-English-Speech-Data-by-Mobile-Phone