zhangxd123's Stars
home-assistant/home-assistant.io
:blue_book: Home Assistant User documentation
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
KwaiVGI/LivePortrait
Bring portraits to life!
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
openai/openai-python
The official Python library for the OpenAI API
SocialAI-tianji/Tianji
制作懂人情世故的大语言模型 | 提示词工程、RAG、Agent、微调全流程教程
logan-zou/Chat_with_Datawhale_langchain
datawhalechina/so-large-lm
大模型基础: 一文了解大模型基础知识
datawhalechina/llm-universe
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
datawhalechina/llm-cookbook
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
leptonai/leptonai
A Pythonic framework to simplify AI service building
dockur/windows
Windows inside a Docker container.
AiuniAI/Unique3D
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
meta-llama/llama3
The official Meta Llama 3 GitHub site
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
sdbds/AnyDoor-for-windows
Official implementations for paper: Anydoor: zero-shot object-level image customization
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
PeterL1n/RobustVideoMatting
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
ccmusic-database/ccmusic-database.github.io
This platform is a multi-functional music data sharing platform for academic research. It contains many music datas such as the sound information of Chinese traditional musical instruments and the labeling information of Chinese pop music, which is available for free use by MIR researchers.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation