MrYANG23's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
microsoft/RecAI
Bridging LLM and Recommender System.
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
fatwang2/coze2openai
Turn Coze API into OpenAI
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
LLM-Red-Team/kimi-free-api
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
huggingface/dataspeech
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
shengxia/RWKV_Role_Playing_API
一个基于Flask实现的RWKV_Role_Playing项目的API。
Suv00m/StyleTTS2-Train
XiangLi2022/CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
TaoHuUMD/StructLDM
tanshuai0219/EDTalk
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
fishaudio/vocoder
zju3dv/GeneAvatar
Code for "GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image", CVPR 2024
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
SerialLain3170/AwesomeAnimeResearch
Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not include.
vra/flopth
A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
myshell-ai/JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
scutcsq/Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
Xiaobin-Rong/gtcrn
The official implementation of GTCRN, an ultra-lite speech enhancement model.
apple/pytorch-speech-features
PantoMatrix/PantoMatrix
PantoMatrix: Co-Speech Talking Head and Gestures Generation
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.