DerryChan's Stars
2noise/ChatTTS
A generative speech model for daily dialogue.
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
houtianze/bypy
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Canner/WrenAI
🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, dashboards and BI. 📈📊📋🧑💻
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
MadcowD/ell
A language model programming library.
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
eseckel/ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
Dataherald/dataherald
Interact with your SQL database, Natural Language to SQL using LLMs
PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
wasiahmad/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
megvii-research/megactor
Weizhi-Zhong/IP_LAP
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
FuxiVirtualHuman/styletalk
Sxjdwang/TalkLip
kepengxu/PGTFormer
[IJCAI'24] Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer
langzizhixin/wav2lip-576x576
This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital human videos.
Aruen24/wav2lip_288x288_test
xuhongming251/ComfyUI-MuseTalkUtils
MuseTalk ComfyUI Preprocess and Postprocess Nodes
iptop/GFPGAN-for-Video
使用GFPGAN进行视频修复
xiyuan-zhou/ElecBench-a-Power-Dispatch-Evaluation-Benchmark-for-Large-Language-Models