Pinned Repositories
cesium
An open-source JavaScript library for world-class 3D globes and maps :earth_americas:
CLIP
Contrastive Language-Image Pretraining
colmap
COLMAP - Structure-from-Motion and Multi-View Stereo
Coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
CUDA_Freshman
epa-gop-pykaldi
EpaDB
EpaDB: a database of non native English speech by Spanish speakers from Argentina intended for
Getting-Things-Done-with-Pytorch
Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BERT.
gopt
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
donggangj's Repositories
donggangj/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
donggangj/PrivacyPilot
A simple and easy-to-use local LLM plugin that helps you efficiently handle local private data.
donggangj/py3dtilers
Tilers accepting various input formats (OBJ, 3DCity databases, GeoJson, IFC) and producing 3DTiles tilesets.
donggangj/CLIP
Contrastive Language-Image Pretraining
donggangj/cesium
An open-source JavaScript library for world-class 3D globes and maps :earth_americas:
donggangj/WebODM
User-friendly, commercial-grade software for processing aerial imagery. 🛩
donggangj/openvino
OpenVINO
donggangj/lang-seg
Language-Driven Semantic Segmentation
donggangj/ZoeDepth
Metric depth estimation from a single image
donggangj/colmap
COLMAP - Structure-from-Motion and Multi-View Stereo
donggangj/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
donggangj/Coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
donggangj/NeuS
Code release for NeuS
donggangj/llvm
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
donggangj/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
donggangj/epa-gop-pykaldi
donggangj/speech-synthesis-paper
List of speech synthesis papers.
donggangj/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
donggangj/gopt
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
donggangj/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
donggangj/VRT
VRT: A Video Restoration Transformer (official repository)
donggangj/TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
donggangj/mtcnn
MTCNN face detection implementation for TensorFlow, as a PIP package.
donggangj/EpaDB
EpaDB: a database of non native English speech by Spanish speakers from Argentina intended for
donggangj/Getting-Things-Done-with-Pytorch
Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BERT.
donggangj/CUDA_Freshman