Zhengxl25's Stars
MikeCreken/lanlanInterview
此仓库将包含各大银行的基本介绍,笔试面试特点,发现这个宝库就离上岸不远了,哼
stoneMo/EZ-VSL
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
Snailclimb/JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
Epiphqny/SOLOv2
SOLOv2: Dynamic, Faster and Stronger, achives 39.5mAP on coco test-dev (36 epochs result)
WXinlong/SOLO
SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
shvdiwnkozbw/Multi-Source-Sound-Localization
This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
jasongief/PSP_CVPR_2021
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
FloretCat/CMRAN
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization, ACM MM 2020
flrngel/TCN-with-attention
Character based Temporal Convolutional Networks + Attention Layer
locuslab/TCN
Sequence modeling benchmarks and temporal convolutional networks
harritaylor/torchvggish
Pytorch port of Google Research's VGGish model used for extracting audio features.
migra1315/ToolBox-ReadMe
YapengTian/AVE-ECCV18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
migra1315/Medical-Image-Registration-ToolBox
jfzhang95/pytorch-video-recognition
PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.
facebookarchive/C3D
C3D is a modified version of BVLC caffe to support 3D ConvNets.
andrewowens/multisensory
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
dexin-wang/slambook
dexin-wang/bullet3
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
dexin-wang/flask_communication_py
dexin-wang/panda_grasp_sim_2
dexin-wang/dynamic-detection
hche11/VGGSound
VGGSound: A Large-scale Audio-Visual Dataset
liyidi/soundnet_localize_sound_source
soundnet and localize sound source
ardasnck/learning_to_localize_sound_source
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
hche11/Localizing-Visual-Sounds-the-Hard-Way
Localizing Visual Sounds the Hard Way