zhangxd123

zhangxd123's Stars

home-assistant/home-assistant.io
:blue_book: Home Assistant User documentation
Language:HTML5.3k7.4k
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Language:Python25k3.2k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36.9k4.5k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.4k1.4k
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Language:JavaScript29.3k3k
openai/openai-python
The official Python library for the OpenAI API
Language:Python23.7k3.4k
SocialAI-tianji/Tianji
制作懂人情世故的大语言模型 | 提示词工程、RAG、Agent、微调全流程教程
Language:Python1k72
logan-zou/Chat_with_Datawhale_langchain
Language:Python16729
datawhalechina/so-large-lm
大模型基础: 一文了解大模型基础知识
3.3k299
datawhalechina/llm-universe
本项目是一个面向小白开发者的大模型应用开发教程，在线阅读地址：https://datawhalechina.github.io/llm-universe/
Language:Jupyter Notebook5.1k609
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
Language:Jupyter Notebook10.5k1.2k
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版
Language:Jupyter Notebook12.5k1.6k
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python3.3k386
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.3k5.7k
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go105k8.4k
leptonai/leptonai
A Pythonic framework to simplify AI service building
Language:Python2.7k173
dockur/windows
Windows inside a Docker container.
Language:Shell31.2k2.1k
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language:Python3.2k253
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.6k1.3k
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Language:Python2.3k286
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.7k3.2k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.9k1.1k
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。
Language:Python1.9k329
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
Language:Python16.7k101
sdbds/AnyDoor-for-windows
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python14616
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Language:Python4.1k367
PeterL1n/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Language:Python8.7k1.1k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11.1k2.3k
ccmusic-database/ccmusic-database.github.io
This platform is a multi-functional music data sharing platform for academic research. It contains many music datas such as the sound information of Chinese traditional musical instruments and the labeling information of Chinese pop music, which is available for free use by MIR researchers.
Language:HTML284
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python12.1k2.3k