Pinned Repositories
accessmath-icfhr2018
Lecture Video Summarization by Extracting Handwritten Content from Whiteboards
active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
deeplearning.ai-courses
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
FOTS.PyTorch
FOTS Pytorch Implementation
HAT
Arxiv2022 - Activating More Pixels in Image Super-Resolution Transformer
Light-ASD
The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)
rosebbb's Repositories
rosebbb/accessmath-icfhr2018
Lecture Video Summarization by Extracting Handwritten Content from Whiteboards
rosebbb/active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
rosebbb/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
rosebbb/deeplearning.ai-courses
rosebbb/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
rosebbb/DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
rosebbb/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
rosebbb/FOTS.PyTorch
FOTS Pytorch Implementation
rosebbb/HAT
Arxiv2022 - Activating More Pixels in Image Super-Resolution Transformer
rosebbb/Light-ASD
The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)
rosebbb/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
rosebbb/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
rosebbb/PAN.pytorch
A unofficial pytorch implementation of PAN(PSENet2): Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
rosebbb/pan_pp.pytorch
Official implementations of PSENet, PAN and PAN++.
rosebbb/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
rosebbb/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
rosebbb/robin
RObust document image BINarization
rosebbb/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
rosebbb/SPELL
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)
rosebbb/ssd.pytorch
A PyTorch Implementation of Single Shot MultiBox Detector
rosebbb/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
rosebbb/temp
rosebbb/test
rosebbb/TextFuseNet
A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".
rosebbb/Transfer-Learning-in-keras---custom-data
Implementing Transfer Learning for custom data using VGG-16 and Resnet-50
rosebbb/VGG16_feature_computation
c++ class to get the output of a pre-trained VGG16 network
rosebbb/voxceleb_trainer
In defence of metric learning for speaker recognition
rosebbb/yolov7-face
yolov7 face detection with landmark