k-washi's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
modularml/mojo
The Mojo Programming Language
DioxusLabs/dioxus
Fullstack GUI library for web, desktop, mobile, and more.
LukeMathWalker/zero-to-production
Code for "Zero To Production In Rust", a book on API development using Rust.
deepchecks/deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
HKUST-Aerial-Robotics/VINS-Fusion
An optimization-based multi-sensor state estimator
Rikorose/DeepFilterNet
Noise supression using deep filtering
ZrrSkywalker/Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
fwilliams/point-cloud-utils
An easy-to-use Python library for processing and manipulating 3D point clouds and meshes.
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Ki6an/fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
CAIC-AD/YOLOPv2
YOLOPv2: Better, Faster, Stronger for Panoptic driving Perception
cuiaiyu/dressing-in-order
(ICCV'21) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing" by Aiyu Cui, Daniel McKee and Svetlana Lazebnik
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
kartben/artificial-nose
Instructions, source code, and misc. resources needed for building a Tiny ML-powered artificial nose.
OATML/RHO-Loss
maggiez0138/Swin-Transformer-TensorRT
This project aims to explore the deployment of Swin-Transformer based on TensorRT, including the test results of FP16 and INT8.
ncsoft/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
shivammehta25/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
hunto/LightViT
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
ZM-Zhou/SMDE-Pytorch
The repository is to build a fair environment where the Self-supervised Monocular Depth Estimation (SMDE) methods could be evaluated and developed.
hcy71o/TransferTTS
TransferTTS (Zero-Shot learning of VITS)
xingyuuchen/tri-depth
[WACV 2023] Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem
tonnetonne814/SiFi-VITS2-44100-Ja
DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.
sarulab-speech/xvector_jtubespeech
xvector model on jtubespeech
AngusDujw/SAF
timqqt/Synthinel
Repository for Synthinel dataset. It is presented in WACV 2020
VOICEVOX/pyopenjtalk
テキスト音声合成ライブラリのpyopenjtalkのVOICEVOX用fork版です
k-washi/ml-exp-env
機械学習実験環境
serre-lab/Adversarial-Alignment
Scaling-up deep neural networks to improve their performance on ImageNet makes them more tolerant to adversarial attacks, but successful attacks on these models are misaligned with human perception.