ham-p

ham-p's Stars

modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
Language:Python49884
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.6k4.3k
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python12k1.1k
0hq/WebGPT
Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.
Language:JavaScript3.7k209
Sharrnah/whispering
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
Language:Python40431
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.2k10.1k
innnky/so-vits-svc
基于vits与softvc的歌声音色转换模型
Language:Python3.6k6
skywalker023/sodaverse
🥤🧑🏻‍🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
Language:Python22313
thesephist/monocle
Universal personal search engine, powered by a full text search algorithm written in pure Ink, indexing Linus's blogs and private note archives, contacts, tweets, and over a decade of journals.
Language:JavaScript1.5k38
promptslab/Promptify
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
Language:Jupyter Notebook3.4k245
mpetazzoni/sse.js
A flexible Server-Sent Events EventSource polyfill for Javascript
Language:JavaScript46390
j2kun/imsdb_download_all_scripts
Download all plaintext scripts from imsdb.com
Language:Python3115
Hiswe/vh-check
mobile vh unit utility
Language:TypeScript43416
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language:Python833157
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook5.1k1.4k
quoth/fastapi-cloud-logging
Language:Python72
bsolomon1124/pycld3
Python3 bindings for the Compact Language Detector v3 (CLD3)
Language:C++1496
saffsd/langid.py
Stand-alone language identification system
Language:Python2.3k320
wooorm/franc
Natural language detection
Language:JavaScript4.2k176
Rezmason/matrix
matrix (web-based green code rain, made with love)
Language:JavaScript3.4k228
chrisguttandin/extendable-media-recorder
An extendable drop-in replacement for the native MediaRecorder.
Language:JavaScript28413
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++25.5k4k
openai/point-e
Point cloud diffusion for 3D model synthesis
Language:Python6.6k763
xtermjs/xterm.js
A terminal for the web
Language:TypeScript18k1.6k
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python53.1k8.8k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.4k1.9k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python36.5k4.5k
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Language:Python8.4k739
shirayu/whispering
Streaming transcriber with whisper
Language:Python68653
eladrich/latent-nerf
Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"
Language:Python70151