TiaRao's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
tensorflow/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
ZLMediaKit/ZLMediaKit
WebRTC/RTSP/RTMP/HTTP/HLS/HTTP-FLV/WebSocket-FLV/HTTP-TS/HTTP-fMP4/WebSocket-TS/WebSocket-fMP4/GB28181/SRT server and client framework based on C++11
XiangLinPro/IT_book
本项目收藏这些年来看过或者听过的一些不错的常用的上千本书籍,没准你想找的书就在这里呢,包含了互联网行业大多数书籍和面试经验题目等等。有人工智能系列(常用深度学习框架TensorFlow、pytorch、keras。NLP、机器学习,深度学习等等),大数据系列(Spark,Hadoop,Scala,kafka等),程序员必修系列(C、C++、java、数据结构、linux,设计模式、数据库等等)
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
jason718/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
HRNet/HRNet-Semantic-Segmentation
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
junfu1115/DANet
Dual Attention Network for Scene Segmentation (CVPR2019)
mcordts/cityscapesScripts
README and scripts for the Cityscapes Dataset
lucidrains/reformer-pytorch
Reformer, the efficient Transformer, in Pytorch
NVIDIA/semantic-segmentation
Nvidia Semantic Segmentation monorepo
Smerity/sha-rnn
Single Headed Attention RNN - "Stop thinking with your head"
openseg-group/OCNet.pytorch
Please choose the openseg.pytorch project for the updated code that achieve SOTA on 6 benchmarks!
ShiftMediaProject/FFmpeg
Unofficial FFmpeg with added custom native Visual Studio project build tools. FFmpeg: A complete, cross-platform solution to record, convert and stream audio and video.
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
JDAI-CV/centerX
This repo is implemented based on detectron2 and centernet
hangzhaomit/Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
lucidrains/routing-transformer
Fully featured implementation of Routing Transformer
kaiyuyue/cgnl-network.pytorch
Compact Generalized Non-local Network (NIPS 2018)
nryant/dscore
Diarization scoring tools.
AnyiRao/SceneSeg
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
YapengTian/AVE-ECCV18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
wenguanwang/DHF1K
Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)
georgesterpu/avsr-tf1
Audio-Visual Speech Recognition using Sequence to Sequence Models
goldbattle/pytorch_unet
PyTorch U-Net on Cityscapes Dataset
atsiami/STAViS
Spatio-Temporal AudioVisual Saliency Network
qingzwang/AudioVisualCrowdCounting