julysun98's Stars
DmitryRyumin/CVPR-2023-24-Papers
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!
Yidadaa/Pytorch-Video-Classification
Make video classification on UCF101 using CNN and RNN based on Pytorch framework.
Charmve/computer-vision-in-action
A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练:算法与应用》中文电子书、源码、读者交流社区(持续更新中 ...) 📘 在线电子书 https://charmve.github.io/computer-vision-in-action/ 👇项目主页
xaggi/claws_eccv
Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECCV 2020 paper.
kenshohara/3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
KunyuLin/XOV-Action
The first work for cross-domain open-vocabulary action recognition with a benchmark
Jingkang50/OpenOOD
Benchmarking Generalized Out-of-Distribution Detection
lvchuandong/Awesome-Multi-Camera-3D-Occupancy-Prediction
Awesome papers and code about Multi-Camera 3D Occupancy Prediction, such as TPVFormer, SurroundOcc, PanoOcc, OccFormer, FB-OCC, SelfOcc, COTR. In this repository, you will see the latest 3D occupancy prediction papers and code.
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
extreme-assistant/CVPR2024-Paper-Code-Interpretation
cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理
lvchuandong/ML3DOP
ML3DOP: A Multi-Camera and LiDAR Dataset for 3D Occupancy Perception
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
ialhashim/DenseDepth
High Quality Monocular Depth Estimation via Transfer Learning
yuguangnudt/VEC_VAD
Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events. Oral paper in ACM Multimedia 2020.
vt-le/astnet
This is an official implementation for "Attention-based Residual Autoencoder for Video Anomaly Detection".
shafu0x/vehicle-speed-estimation
Vehicle Speed Estimation from Video using Deep Learning and Optical Flow in PyTorch.
cvlab-yonsei/MNAD
An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.
ZhenboSong/mono_velocity
A PyTorch implementation of the ICRA 2020 paper 'End-to-end Learning for Inter-Vehicle Distance and Relative Velocity Estimation in ADAS with a Monocular Camera'.
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
CompVis/stable-diffusion
A latent text-to-image diffusion model
IsaacGuan/3D-VAE
A variational autoencoder for volumetric shape generation
doublechenching/brats_segmentation-pytorch
3d unet + vae, repoduce brats2018 winner solution
Mathux/ACTOR
Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021
wolny/pytorch-3dunet
3D U-Net model for volumetric semantic segmentation written in pytorch
FingerRec/3DNet_Visualization
Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!
zhuozhuoweiwei/3D-CNN-based-on-attention-mechanism
本文采用基于注意力机制的卷积神经神经网络模型来实现对阿尔兹海默症疾病的分类。采用3D卷积对图像进行特征获取,通过在卷积中添加注意力机制,重点关注疾病脑图像中的患病区域,从而提高分类模型的实验精度。