klauscc

klauscc's Stars

aleju/imgaug
Image augmentation for machine learning experiments.
Language:Python14.4k 229 5172.4k
fizyr/keras-retinanet
Keras implementation of RetinaNet object detection.
Language:Python4.4k 123 1.3k2k
jcjohnson/fast-neural-style
Feedforward style transfer
Language:Lua4.3k 130 169816
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.5k 139 19201
LLaVA-VL/LLaVA-NeXT
Language:Python2.9k 34 302250
danielegrattarola/spektral
Graph Neural Networks with Keras and Tensorflow 2.
Language:Python2.4k 45 278334
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.8k 21 69115
SHI-Labs/Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
Language:Python1.3k 28 3484
3rd/image.nvim
🖼️ Bringing images to Neovim.
Language:Lua1.1k 11 14450
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
972 25 275
Qualeams/Android-Face-Recognition-with-Deep-Learning-Test-Framework
Face Recognition framework for Android devices can be used to test different face recognition methods.
Language:Java361 29 56160
taokong/RON
RON: Reverse Connection with Objectness Prior Networks for Object Detection, CVPR 2017
Language:Python355 28 33134
OpenGVLab/unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Language:Python295 12 4816
GT-RIPL/CODA-Prompt
PyTorch code for the CVPR'23 paper: "CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning"
Language:Python131 7 1711
jayleicn/singularity
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
Language:Python130 2 3014
imagegridworth/IG-VLM
Language:Python120 4 85
KenN7/vim-arsync
vim plugin for async synchronisation of remote files and local files using rsync
Language:Vim Script105 3 1323
klauscc/VindLU
Language:Python101 4 1111
CeeZh/LLoVi
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
Language:Python83 6 64
Ziyang412/VideoTree
Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
Language:Python81 2 93
Ziyang412/UCoFiA
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
Language:Python61 3 100
Media-Smart/vedatad
A single stage temporal action detection toolbox based on PyTorch
Language:Python53 4 1814
klauscc/TALLFormer
Language:Python50 3 133
ylsung/vl-merging
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
Language:Python36 2 20
klauscc/lipnet-replication
A replication of Google DeepMind's paper:LipNet: End-to-End Sentence-level Lipreading
Language:Python27 7 510
SJTUwxz/LoCoNet_ASD
code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection
Language:Python21 1 34
amazon-science/stochastic-backpropagation
17 7 13
klauscc/DAM
Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning
Language:Python9 1 01
suzhoushr/kaldi-shr
I have modified kaldi that adds CTC loss function and cosine loss fuction, and others
Language:C++9 3 02
SourceLoo/testgit
myfirst online repository
1 2 10