Frankxstt

Frankxstt's Stars

hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.8k2.1k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.3k2.7k
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.4k662
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Language:Jupyter Notebook82565
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.2k1k
eseckel/ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
Language:Python2k259
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Language:Python1.3k181
vchoutas/smplx
SMPL-X
Language:Python1.8k306
open-mmlab/mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
Language:Python5.7k1.2k
muralianand12345/chatbot-llamaparse
A simple API chatbot that uses LlamaIndex and LlamaParse to read custom PDF data.
Language:Python3
zjc062/mind-vis
Code base for MinD-Vis
Language:Python74391
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python141k26.6k
MedARC-AI/fMRI-reconstruction-NSD
fMRI-to-image reconstruction on the NSD dataset.
Language:Jupyter Notebook29439
yhw-yhw/SHOW
This is the codebase for SHOW in Generating Holistic 3D Human Motion from Speech [CVPR2023],
Language:Python21326
yhw-yhw/TalkSHOW
This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].
Language:Python29627
caizhongang/SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
Language:Python97769
shubham-goel/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
Language:Python1.2k115
yohanshin/WHAM
Language:Python66272
vchoutas/smplify-x
Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
Language:Python1.7k336
magic-research/PLLaVA
Official repository for the paper PLLaVA
Language:Python56937
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3k248
seervideodiffusion/SeerVideoLDM
[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models
Language:Python173
YBYBZhang/ControlVideo
[ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
Language:Python76456
guoyww/AnimateDiff
Official implementation of AnimateDiff.
Language:Python10.4k849
maxin-cn/Latte
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
Language:Python322
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k539
littlepure2333/MindBridge
[CVPR 2024 Highlight] Official PyTorch implementation of "MindBridge: A Cross-Subject Brain Decoding Framework"
Language:Python675
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.3k441
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.6k440
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript75.4k58.9k