Yinhance

Yinhance's Stars

TencentARC/NVComposer
Boosting Generative Novel View Synthesis with Sparse and Unposed Images
Language:Python441
modelscope/evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
Language:Python30436
geoaigroup/awesome-vision-language-models-for-earth-observation
A curated list of awesome vision and language resources for earth observation.
20116
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Language:Python74745
cambridgeltl/visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
Language:Python1078
TT2TER/autodl_proxy
一个简易的不自动化的autodl部署自己的代理的指南，帮助下载huggingface的模型（鉴于官方学术加速以及hfmirror很不好用）
4
mseitzer/pytorch-fid
Compute FID scores with PyTorch.
Language:Python3.4k515
pixegami/claude-3.5-api-tutorial
Simple tutorial project using the Claude 3.5 Sonnet API, showing three simple use-cases.
Language:Python229
bytedance/1d-tokenizer
This repo contains the code for 1D tokenizer and generator
Language:Jupyter Notebook59129
dome272/VQGAN-pytorch
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
Language:Python46880
aceliuchanghong/FAQ_Of_LLM_Interview
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
Language:Jupyter Notebook32118
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.6k461
allenai/objaverse-xl
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
Language:Python79649
wtliao/text2image
Text to Image Generation with Semantic-Spatial Aware GAN
Language:Python18034
bioinf-jku/TTUR
Two time-scale update rule for training GANs
Language:Jupyter Notebook862173
google/prompt-to-prompt
Language:Jupyter Notebook3.2k300
ElesionKyrie/Extreme-Video-Compression-With-Prediction-Using-Pre-trainded-Diffusion-Models-
Language:Python1347
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Language:Python1.2k151
jianzhnie/awesome-text-to-video
A Survey on Text-to-Video Generation/Synthesis.
64283
JiaojiaoYe1994/Awesome-DIffusionModels-paper
A curasted list of papers with the topic of Diffusion Models for Multi-Modal
232
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Language:Python3.1k220
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.7k2.3k
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.6k205
TonyLianLong/LLM-groundedVideoDiffusion
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
Language:Python1327
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Language:Python4.1k353
lucidrains/video-diffusion-pytorch
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
Language:Python1.3k129
ExponentialML/Video-BLIP2-Preprocessor
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
Language:Python13318
qiuyu96/CoDeF
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Language:Python4.8k385
poplpr/EXMODD
Language:Jupyter Notebook2
semcomm/SwinJSCC
Language:Python444

Yinhance

Yinhance's Stars

TencentARC/NVComposer

modelscope/evalscope

geoaigroup/awesome-vision-language-models-for-earth-observation

dvlab-research/LLaMA-VID

cambridgeltl/visual-spatial-reasoning

TT2TER/autodl_proxy

mseitzer/pytorch-fid

pixegami/claude-3.5-api-tutorial

bytedance/1d-tokenizer

dome272/VQGAN-pytorch

aceliuchanghong/FAQ_Of_LLM_Interview

InternLM/InternLM

allenai/objaverse-xl

wtliao/text2image

bioinf-jku/TTUR

google/prompt-to-prompt

ElesionKyrie/Extreme-Video-Compression-With-Prediction-Using-Pre-trainded-Diffusion-Models-

mini-sora/minisora

jianzhnie/awesome-text-to-video

JiaojiaoYe1994/Awesome-DIffusionModels-paper

PKU-YuanGroup/Video-LLaVA

haotian-liu/LLaVA

showlab/Awesome-Video-Diffusion

TonyLianLong/LLM-groundedVideoDiffusion

Picsart-AI-Research/Text2Video-Zero

lucidrains/video-diffusion-pytorch

ExponentialML/Video-BLIP2-Preprocessor

qiuyu96/CoDeF

poplpr/EXMODD

semcomm/SwinJSCC