Frankxstt's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Stability-AI/generative-models
Generative Models by Stability AI
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
eseckel/ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
vchoutas/smplx
SMPL-X
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
muralianand12345/chatbot-llamaparse
A simple API chatbot that uses LlamaIndex and LlamaParse to read custom PDF data.
zjc062/mind-vis
Code base for MinD-Vis
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
MedARC-AI/fMRI-reconstruction-NSD
fMRI-to-image reconstruction on the NSD dataset.
yhw-yhw/SHOW
This is the codebase for SHOW in Generating Holistic 3D Human Motion from Speech [CVPR2023],
yhw-yhw/TalkSHOW
This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].
caizhongang/SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
shubham-goel/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
yohanshin/WHAM
vchoutas/smplify-x
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
magic-research/PLLaVA
Official repository for the paper PLLaVA
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
seervideodiffusion/SeerVideoLDM
[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models
YBYBZhang/ControlVideo
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
guoyww/AnimateDiff
Official implementation of AnimateDiff.
maxin-cn/Latte
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
littlepure2333/MindBridge
[CVPR 2024 Highlight] Official PyTorch implementation of "MindBridge: A Cross-Subject Brain Decoding Framework"
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。