Lukikay's Stars
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
facefusion/facefusion
Industry leading face manipulation platform
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Ciphey/Ciphey
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
netease-youdao/QAnything
Question and Answer based on Anything.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
betaflight/betaflight
Open Source Flight Controller Firmware
InternLM/InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
ExpressLRS/ExpressLRS
ESP32/ESP8285-based High-Performance Radio Link for RC applications
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
praydog/UEVR
Universal Unreal Engine VR Mod (4.8 - 5.4)
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
betaflight/betaflight-configurator
Cross platform configuration tool for the Betaflight firmware
Tony-Tan/CUDA_Freshman
lyhue1991/torchkeras
Pytorch❤️ Keras 😋😋
3DTopia/LGM
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Duxiaoman-DI/XuanYuan
轩辕:度小满中文金融对话大模型
THUDM/LongBench
LongBench v2 and LongBench (ACL 2024)
Yuan-ManX/ai-game-devtools
Here we will keep track of the latest AI Game Development Tools, including LLM, Agent, Code, Writer, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥
3DiVi/nuitrack-sdk
Nuitrack™ is a 3D tracking middleware developed by 3DiVi Inc.
LAION-AI/laion-datasets
Description and pointers of laion datasets
creativeIKEP/HolisticMotionCapture
HolisticMotionCapture is an application and package that can capture the motion of a person with only a monocular color camera and move the VRM avatar's pose, face, and hands.
PeterH0323/ancient-chat-llm
ancient-chat-llm: A LLM which is proficient in Chinese culture 古语说: 一个精通中国文化的大模型