a2819z

a2819z's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python68.6k 575 08.1k
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python52.3k 939 1.1k8.7k
gyoogle/tech-interview-for-developer
👶🏻 신입 개발자 전공 지식 & 기술 면접 백과사전 📖
Language:Java14.5k 143 433.3k
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python12k 170 233815
khangich/machine-learning-interview
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
9.1k 215 41.5k
zhanymkanov/fastapi-best-practices
FastAPI Best Practices and Conventions we used at our startup
8.8k 126 35671
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Language:Python7.6k 81 152756
WooVictory/Ready-For-Tech-Interview
💻 신입 개발자로서 지식을 쌓기 위해 공부하는 공간 👨‍💻
4.5k 39 7527
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
2.9k 27 2488
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.7k 73 82424
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
Language:Python2.3k 45 70177
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.2k 41 56147
SeanNaren/deepspeech.pytorch
Speech Recognition using DeepSpeech2.
Language:Python2.1k 51 503620
boost-devs/ai-tech-interview
👩‍💻👨‍💻 AI 엔지니어 기술 면접 스터디 (⭐️ 1k+)
1.8k 16 0442
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Language:Jupyter Notebook1.5k 29 17494
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
Language:Python1.2k 13 26100
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
Language:Python1.1k 43 2636
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
Language:Python871 15 9148
naver-ai/DenseDiffusion
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
Language:Jupyter Notebook475 11 1933
subinium/Misc-Cheatsheet
대학원 생활을 하며 사용하는 작고 소중한 코딩팁 (linux 명령어 등)
Language:Vim script380 5 038
qkraudghgh/coding-interview
취업 준비를 위해 공부한 내용을 정리하는 레포
Language:JavaScript375 7 144
hutaiHang/Faster-Diffusion
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
Language:Python287 9 1419
YuanGongND/cav-mae
Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".
Language:Python223 5 2923
TiankaiHang/Min-SNR-Diffusion-Training
[ICCV 2023] Efficient Diffusion Training via Min-SNR Weighting Strategy
Language:Python216 3 96
pcb9382/StereoAlgorithms
Stereo Algorithms (Include:CREStereo,RAFT-Stereo,Hitnet,FastACVNet_plus,Stereo Transformers,RealtimeStereo,DistDepth) with TensorRT,ORT,OpenVINO
Language:C++177 5 221
OscarXZQ/weight-selection
Language:Python166 4 612
winddori2002/TriAAN-VC
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
Language:Python143 7 2213
curryjung/InjectFusion_official
Language:Python115 3 112
ibaiGorordo/ONNX-FastACVNet-Depth-Estimation
Python scripts performing stereo depth estimation using the Fast-ACVNet model in ONNX.
Language:Python39 3 14
vinceecws/Monodepth
PyTorch implementation of Unsupervised Monocular Depth Estimation with Left-Right Consistency
Language:Python24 2 05