Yinhance's Stars
TencentARC/NVComposer
Boosting Generative Novel View Synthesis with Sparse and Unposed Images
modelscope/evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
geoaigroup/awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
cambridgeltl/visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
TT2TER/autodl_proxy
一个简易的不自动化的autodl部署自己的代理的指南,帮助下载huggingface的模型(鉴于官方学术加速以及hfmirror很不好用)
mseitzer/pytorch-fid
Compute FID scores with PyTorch.
pixegami/claude-3.5-api-tutorial
Simple tutorial project using the Claude 3.5 Sonnet API, showing three simple use-cases.
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
dome272/VQGAN-pytorch
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
aceliuchanghong/FAQ_Of_LLM_Interview
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
allenai/objaverse-xl
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
wtliao/text2image
Text to Image Generation with Semantic-Spatial Aware GAN
bioinf-jku/TTUR
Two time-scale update rule for training GANs
google/prompt-to-prompt
ElesionKyrie/Extreme-Video-Compression-With-Prediction-Using-Pre-trainded-Diffusion-Models-
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
jianzhnie/awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
JiaojiaoYe1994/Awesome-DIffusionModels-paper
A curasted list of papers with the topic of Diffusion Models for Multi-Modal
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
TonyLianLong/LLM-groundedVideoDiffusion
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
ExponentialML/Video-BLIP2-Preprocessor
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
qiuyu96/CoDeF
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
poplpr/EXMODD
semcomm/SwinJSCC