whq2017's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
triton-lang/triton
Development repository for the Triton language and compiler
state-spaces/mamba
Mamba SSM architecture
marimo-team/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Lightning-Universe/lightning-bolts
Toolbox of models, callbacks, and datasets for AI/ML researchers.
rmokady/CLIP_prefix_caption
Simple image captioning model
lucidrains/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
tylin/coco-caption
kelvinxu/arctic-captions
Linfeng-Tang/Image-Fusion
Deep Learning-based Image Fusion: A Survey
cvdfoundation/kinetics-dataset
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
Lightning-AI/dl-fundamentals
Deep Learning Fundamentals -- Code material and exercises
showlab/UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
salaniz/pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools
jiyanggao/TALL
TALL: Temporal Activity Localization via Language Query
IndexFziQ/Diffusion4NLP-Papers
A paper list about diffusion models for natural language processing.
Linfeng-Tang/MSRS
MSRS: Multi-Spectral Road Scenarios for Practical Infrared and Visible Image Fusion
isekai-portal/Link-Context-Learning
nengwp/Lion-vs-Adam
Lion and Adam optimization comparison
shivanichander/tSNE
Visualising High Dimensional Data using tSNE
klb2/review-response-template
LaTeX template for the response to reviewer comments (scientific journal publications)
Alokia/Idempotent-Generative-Network
Idempotent Generative Network's unofficial pytorch implementation
wangyuchi369/LaDiC
[NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?
escorciav/video-utils
utilities to deal with videos ...
whq2024/MFF-GP