Yaxin9Luo's Stars
ysymyth/awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
Guang000/Awesome-Dataset-Distillation
A curated list of awesome papers on dataset distillation and related applications.
antoyang/VidChapters
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
forhaoliu/language-quantized-autoencoders
Language Quantized AutoEncoders
zhuyiche/llava-phi
sebastianstarke/AI4Animation
Bringing Characters to Life with Computer Brains in Unity
pengsida/learning_research
本人的科研经验
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
datawhalechina/leedl-tutorial
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
zhoubolei/bolei_awesome_posters
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
Hhhhhhao/Noisy-Model-Learning
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
harvard-edge/cs249r_book
Collaborative book Machine Learning Systems
OpenGVLab/De-focus-Attention-Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
LLaVA-VL/LLaVA-NeXT
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
garrettj403/SciencePlots
Matplotlib styles for scientific plotting
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
meta-llama/llama-models
Utilities intended for use with Llama models.
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
bytedance/1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
zh460045050/V2L-Tokenizer
karpathy/LLM101n
LLM101n: Let's build a Storyteller
ziqipang/LM4VisualEncoding
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.