Pinned Repositories
ASR-based-KWS
QbE Keyword Spotting System based on ASR
bark-voice-cloning
Personal customization of some bark-voice-cloning implementations
Diff-VC
Diffusion Model for Voice Conversion
Document-Scanner
Simple Document Scanner using Semantic Segmentation
handwritten-ocr
My personal implementation of SVTR model for handwritten OCR
KWS-BCResnet
Keyword Spotting using BCResNet and Arcface Loss
pytorch-ml-utils
Some utility functions / decorators / modules related to Pytorch to help speed up coding
Speaker-Verification-TDNN
tflite-yamnet-audio-classification
Yamnet model using tflite_model_maker with esc-50 dataset
yolo8-tracking-counting-speed_estimation
Tracking, counting and speed estimation using yolo8
trinhtuanvubk's Repositories
trinhtuanvubk/Diff-VC
Diffusion Model for Voice Conversion
trinhtuanvubk/handwritten-ocr
My personal implementation of SVTR model for handwritten OCR
trinhtuanvubk/yolo-ncnn-cpp
everything to infer yolo with ncnn and cpp
trinhtuanvubk/finetune-wav2vec2
trinhtuanvubk/Paraphrasing-Generation-T5
Training paraphasing using huggingface T5
trinhtuanvubk/VITS2-TTS
trinhtuanvubk/ConvNextV2-Classification
trinhtuanvubk/Document-Scanner
Simple Document Scanner using Semantic Segmentation
trinhtuanvubk/Hubert-Training
trinhtuanvubk/Image-Colorizer-Pix2Pix
trinhtuanvubk/MEDIAR
(NeurIPS 2022 CellSeg Challenge - 1st Winner) Open source code for "MEDIAR: Harmony of Data-Centric and Model-Centric for Multi-Modality Microscopy"
trinhtuanvubk/my-wedding-page
trinhtuanvubk/OneFormer3D
My personal customization of OneFormer3D
trinhtuanvubk/pytorch-ml-utils
Some utility functions / decorators / modules related to Pytorch to help speed up coding
trinhtuanvubk/TiktokAutoUploader
Automatically Edits Videos and Uploads to Tiktok with CLI, Requests not Selenium.
trinhtuanvubk/Speaker-Verification-TDNN
trinhtuanvubk/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
trinhtuanvubk/bytetrack-yolo-ncnn-cpp
trinhtuanvubk/Custom-SyncTalk
My personal cusomization of SyncTalk
trinhtuanvubk/Custom-Wav2Lip-GFPGan
trinhtuanvubk/microscope-seg
trinhtuanvubk/mlops-tutorials
trinhtuanvubk/musicgen
trinhtuanvubk/ocr-screen
trinhtuanvubk/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
trinhtuanvubk/QA-Triton-Pipeline
trinhtuanvubk/SAM3D
trinhtuanvubk/trinhtuanvubk.github.io
trinhtuanvubk/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
trinhtuanvubk/Wav2Vec2-Classification