willyzw1221

willyzw1221's Stars

fudan-generative-vision/hallo2
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Language:Python3.3k463
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python6.4k732
pkunlp-icler/FastV
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Language:Python2659
GeorgeLuImmortal/PaDeLLM_NER
Language:Python2
langchain-ai/open-canvas
📃 A better UX for chat, writing content, and coding with LLMs.
Language:TypeScript2.3k333
YoMio-Tech-Inc/GPT-SoVITS2
GPT-SoVITS2
Language:Python17714
HKUDS/LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Language:Python6.9k760
Leymore/ruozhiba
64659
jy0205/Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Language:Python2.2k199
The-Run-Philosophy-Organization/run
润学全球官方指定GITHUB，整理润学宗旨、纲领、理论和各类润之实例；解决为什么润，润去哪里，怎么润三大问题；并成为新**人的核心宗教，核心信念。
31.7k2.6k
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Language:Python2.5k175
LinglongQian/Medical-Graph-RAG
Medical Graph RAG: Graph RAG for the Medical Data
Language:Python1
Gsllchb/Handright
A lightweight Python library for simulating Chinese handwriting
Language:Python2k248
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3k269
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.4k240
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python15.8k4.9k
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.3k324
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Language:Python5.3k440
Arthurzhangsheng/echomimic-all-in-one-package
echomimic免环境安装windows一体包，解压即用|echomimic environment-free installation Windows all-in-one package, ready to use after extraction
12
Azure-Samples/graphrag-accelerator
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
Language:Python1.8k295
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python39.4k5.8k
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.8k333
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并支持api调用
Language:Python10.6k1.2k
LayTextLLM/LayTextLLM
Language:Python649
danielgatis/rembg
Rembg is a tool to remove images background
Language:Python16.8k1.9k
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java44.4k3.6k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python18.7k1.8k
fanmingming/live
✯ 可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费直连访问完整开源不断完善的台标支持IPv4/IPv6双栈访问 🔕
Language:JavaScript22.6k3.4k
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2.3k255
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python12.7k1.3k