bigchou
@ntu_aiailabRoom 542, CSIE Building, National Taiwan University No. 1, Sec. 4, Roosevelt Road, Da’an Dist.
bigchou's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
juanmc2005/diart
A python package to build AI-powered real-time audio applications
egruttadauria98/SSpaVAlDo
juanmc2005/rttm-viewer
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
speechio/chinese_text_normalization
Chinese text normalization for speech processing
marcovwu/knowledge-distillation
sekilab/VehicleOrientationDataset
The vehicle orientation dataset is a large-scale dataset containing more than one million annotations for vehicle detection with simultaneous orientation classification using a standard object detection network.
CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
lucidrains/lion-pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
oshadajay/CeyMo
CeyMo: See More on Roads - A Novel Benchmark Dataset for Road Marking Detection (IEEE/CVF WACV 2022)
achen353/Taiwanese-Traffic-Object-Detection
Training and fine-tuning YOLOv4 Tiny on custom object detection dataset for Taiwanese traffic
2gunsu/monocon-pytorch
Unofficial Pytorch Implementation for MonoCon(AAAI, 2022)
Rock-100/MonoDet
[ICCV21 & WACV23] Monocular 3D Object Detection for Automonous Driving
Tai-Wang/Depth-from-Motion
[ECCV 2022 oral] Monocular 3D Object Detection with Depth from Motion
hisfog/SfMNeXt-Impl
[AAAI 2024] Official implementation of "SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation", and more.
fatemehkarimii/LightDepth
LightDepth: A Resource Efficient Depth Estimation Approach for Dealing with Ground Truth Sparsity via Curriculum Learning
CEWu/PTNL
[ICCV 2023] Official repository of paper titled "Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?"
utiasSTARS/liegroups
Python implementation of SO2, SE2, SO3, and SE3 matrix Lie groups using numpy or pytorch
aliyun/dro-sfm
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
LeonLok/Multi-Camera-Live-Object-Tracking
Multi-camera live traffic and object counting with YOLO v4, Deep SORT, and Flask.
isl-org/ZoeDepth
Metric depth estimation from a single image
Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
TemugeB/python_stereo_camera_calibrate
Stereo camera calibration with python and openCV