klauscc's Stars
aleju/imgaug
Image augmentation for machine learning experiments.
fizyr/keras-retinanet
Keras implementation of RetinaNet object detection.
jcjohnson/fast-neural-style
Feedforward style transfer
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
LLaVA-VL/LLaVA-NeXT
danielegrattarola/spektral
Graph Neural Networks with Keras and Tensorflow 2.
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
SHI-Labs/Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
3rd/image.nvim
🖼️ Bringing images to Neovim.
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
Qualeams/Android-Face-Recognition-with-Deep-Learning-Test-Framework
Face Recognition framework for Android devices can be used to test different face recognition methods.
taokong/RON
RON: Reverse Connection with Objectness Prior Networks for Object Detection, CVPR 2017
OpenGVLab/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
GT-RIPL/CODA-Prompt
PyTorch code for the CVPR'23 paper: "CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning"
jayleicn/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
imagegridworth/IG-VLM
KenN7/vim-arsync
vim plugin for async synchronisation of remote files and local files using rsync
klauscc/VindLU
CeeZh/LLoVi
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
Ziyang412/VideoTree
Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
Ziyang412/UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
Media-Smart/vedatad
A single stage temporal action detection toolbox based on PyTorch
klauscc/TALLFormer
ylsung/vl-merging
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
klauscc/lipnet-replication
A replication of Google DeepMind's paper:LipNet: End-to-End Sentence-level Lipreading
SJTUwxz/LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
amazon-science/stochastic-backpropagation
klauscc/DAM
Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning
suzhoushr/kaldi-shr
I have modified kaldi that adds CTC loss function and cosine loss fuction, and others
SourceLoo/testgit
myfirst online repository