Pinned Repositories
adversarial-deep-structural-networks
ISBI2018: Adversarial Deep Structural Networks for Mammographic Mass Segmentation https://arxiv.org/abs/1612.05970
AnatomyNet-for-anatomical-segmentation
AnatomyNet: Deep 3D Squeeze-and-excitation U-Nets for fast and fully automated whole-volume anatomical segmentation
AutoShot
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023
deep-mil-for-whole-mammogram-classification
Zhu, Wentao, Qi Lou, Yeeleng Scott Vang, and Xiaohui Xie. "Deep Multi-instance Networks with Sparse Label Assignment for Whole Mammogram Classification." MICCAI 2017.
DeepEM-for-Weakly-Supervised-Detection
MICCAI18 DeepEM: Deep 3D ConvNets with EM for Weakly Supervised Pulmonary Nodule Detection
DeepLung
WACV18 paper "DeepLung: Deep 3D Dual Path Nets for Automated Pulmonary Nodule Detection and Classification"
Hierarchical-ELM-Network
IJCNN 2015 Hierarchical extreme learning machine for unsupervised representation learning
protein-cascade-cnn-lstm
Implementation of IJCAI15 cascade cnn and LSTM for protein secondary structure prediction
recurrent-attention-for-QA-SQUAD-based-on-keras
recurrent attention based on keras. Question Answering SQUAD dataset
speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
wentaozhu's Repositories
wentaozhu/DeepLung
WACV18 paper "DeepLung: Deep 3D Dual Path Nets for Automated Pulmonary Nodule Detection and Classification"
wentaozhu/AnatomyNet-for-anatomical-segmentation
AnatomyNet: Deep 3D Squeeze-and-excitation U-Nets for fast and fully automated whole-volume anatomical segmentation
wentaozhu/AutoShot
AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023
wentaozhu/adversarial-deep-structural-networks
ISBI2018: Adversarial Deep Structural Networks for Mammographic Mass Segmentation https://arxiv.org/abs/1612.05970
wentaozhu/speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
wentaozhu/leetcode-master
LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
wentaozhu/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
wentaozhu/asv-subtools
An Open Source Tools for Speaker Recognition
wentaozhu/ccf_2020_qa_match
ccf 2020 qa match competition top1
wentaozhu/CLIP
Contrastive Language-Image Pretraining
wentaozhu/D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
wentaozhu/DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022)
wentaozhu/Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
wentaozhu/Det3D
World's first general purpose 3D object detection codebse.
wentaozhu/Few-shot-NAS
The official repo for Few-Shot Neural Architecture Search (ICML'21 long oral)
wentaozhu/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
wentaozhu/GEBD
Generic Event Boundary Detection: A Benchmark for Event Segmentation
wentaozhu/machine-learning-systems-design
A booklet on machine learning systems design with exercises
wentaozhu/manning
Repository for the book Grokking Machine Learning, by Manning Editors
wentaozhu/mmt
Multi-Modal Transformer for Video Retrieval
wentaozhu/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
wentaozhu/Q-A-matching-of-real-estate-industry
wentaozhu/TuRBO
wentaozhu/ufom
wentaozhu/UniFormer
[ICLR2022] official implementation of UniFormer
wentaozhu/vision
Datasets, Transforms and Models specific to Computer Vision
wentaozhu/ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
wentaozhu/vit-pytorch-1
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
wentaozhu/voxceleb_trainer
In defence of metric learning for speaker recognition
wentaozhu/wav2tok
Codebase for ICLR' 23 paper- ''Wav2Tok: Deep Sequence Tokenizer for Audio Retrieval"