MrYANG23

Text to Speech,Voice conversion, Singing voice conversion.

chengdu

MrYANG23's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.1k3.1k
microsoft/RecAI
Bridging LLM and Recommender System.
Language:Jupyter Notebook59154
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。
Language:JavaScript36k4.4k
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript18.7k1.4k
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Language:Python1.4k139
fatwang2/coze2openai
Turn Coze API into OpenAI
Language:JavaScript570139
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Language:TypeScript7k549
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Language:Python13.4k1.2k
LLM-Red-Team/kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长：长文本解读整理】，支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话，零配置部署，多路token支持，自动清理会话痕迹。
Language:TypeScript3.8k624
huggingface/dataspeech
Language:Python30346
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
69735
shengxia/RWKV_Role_Playing_API
一个基于Flask实现的RWKV_Role_Playing项目的API。
Language:Python303
Suv00m/StyleTTS2-Train
Language:Jupyter Notebook51
XiangLi2022/CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Language:Python623
TaoHuUMD/StructLDM
Language:Python1024
tanshuai0219/EDTalk
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
Language:Python34832
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Language:Python55560
fishaudio/vocoder
Language:Python764
zju3dv/GeneAvatar
Code for "GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image", CVPR 2024
931
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4.3k314
SerialLain3170/AwesomeAnimeResearch
Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not include.
1.1k68
vra/flopth
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
Language:Python1199
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python22.9k2.2k
myshell-ai/JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
Language:Python96079
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
Language:Go5.7k361
scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
Language:Python584
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Language:Python21336
apple/pytorch-speech-features
Language:Python849
PantoMatrix/PantoMatrix
PantoMatrix: Co-Speech Talking Head and Gestures Generation
Language:Python992175
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
Language:Python13.7k1.4k

MrYANG23

MrYANG23's Stars

meta-llama/llama3

microsoft/RecAI

NaiboWang/EasySpider

mendableai/firecrawl

ShineChen1024/MagicClothing

fatwang2/coze2openai

jina-ai/reader

stanford-oval/storm

LLM-Red-Team/kimi-free-api

huggingface/dataspeech

ga642381/speech-trident

shengxia/RWKV_Role_Playing_API

Suv00m/StyleTTS2-Train

XiangLi2022/CM-TTS

TaoHuUMD/StructLDM

tanshuai0219/EDTalk

Vision-CAIR/MiniGPT4-video

fishaudio/vocoder

zju3dv/GeneAvatar

FoundationVision/VAR

SerialLain3170/AwesomeAnimeResearch

vra/flopth

infiniflow/ragflow

myshell-ai/JetMoE

nilsherzig/LLocalSearch

scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

Xiaobin-Rong/gtcrn

apple/pytorch-speech-features

PantoMatrix/PantoMatrix

princeton-nlp/SWE-agent