julysun98

julysun98's Stars

DmitryRyumin/CVPR-2023-24-Papers
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!
Language:Python33722
Yidadaa/Pytorch-Video-Classification
Make video classification on UCF101 using CNN and RNN based on Pytorch framework.
Language:Python5929
Charmve/computer-vision-in-action
A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练：算法与应用》中文电子书、源码、读者交流社区（持续更新中 ...） 📘 在线电子书 https://charmve.github.io/computer-vision-in-action/ 👇项目主页
Language:Jupyter Notebook2.5k369
xaggi/claws_eccv
Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECCV 2020 paper.
114
kenshohara/3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
Language:Python3.8k931
KunyuLin/XOV-Action
The first work for cross-domain open-vocabulary action recognition with a benchmark
15
Jingkang50/OpenOOD
Benchmarking Generalized Out-of-Distribution Detection
Language:Python79297
lvchuandong/Awesome-Multi-Camera-3D-Occupancy-Prediction
Awesome papers and code about Multi-Camera 3D Occupancy Prediction, such as TPVFormer, SurroundOcc, PanoOcc, OccFormer, FB-OCC, SelfOcc, COTR. In this repository, you will see the latest 3D occupancy prediction papers and code.
543
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
1114
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python2.9k232
extreme-assistant/CVPR2024-Paper-Code-Interpretation
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集，极市团队整理
12.3k2.3k
lvchuandong/ML3DOP
ML3DOP: A Multi-Camera and LiDAR Dataset for 3D Occupancy Perception
Language:Python23
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python129k25.5k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.2k909
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.5k590
ialhashim/DenseDepth
High Quality Monocular Depth Estimation via Transfer Learning
Language:Jupyter Notebook1.6k357
yuguangnudt/VEC_VAD
Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events. Oral paper in ACM Multimedia 2020.
Language:Python9519
vt-le/astnet
This is an official implementation for "Attention-based Residual Autoencoder for Video Anomaly Detection".
Language:Python9614
shafu0x/vehicle-speed-estimation
Vehicle Speed Estimation from Video using Deep Learning and Optical Flow in PyTorch.
Language:Jupyter Notebook10932
cvlab-yonsei/MNAD
An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.
Language:Python32982
ZhenboSong/mono_velocity
A PyTorch implementation of the ICRA 2020 paper 'End-to-end Learning for Inter-Vehicle Distance and Relative Velocity Estimation in ADAS with a Monocular Camera'.
Language:Python4916
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
Language:Python3.4k345
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python23.9k4.9k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook66.5k10k
IsaacGuan/3D-VAE
A variational autoencoder for volumetric shape generation
Language:Python3310
doublechenching/brats_segmentation-pytorch
3d unet + vae, repoduce brats2018 winner solution
Language:Python11326
Mathux/ACTOR
Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021
Language:Python36751
wolny/pytorch-3dunet
3D U-Net model for volumetric semantic segmentation written in pytorch
Language:Jupyter Notebook1.9k483
FingerRec/3DNet_Visualization
Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!
Language:Python6317
zhuozhuoweiwei/3D-CNN-based-on-attention-mechanism
本文采用基于注意力机制的卷积神经神经网络模型来实现对阿尔兹海默症疾病的分类。采用3D卷积对图像进行特征获取，通过在卷积中添加注意力机制，重点关注疾病脑图像中的患病区域，从而提高分类模型的实验精度。
Language:Python264