okankop

Technical University of Munich

okankop's Stars

karpathy/LLM101n
LLM101n: Let's build a Storyteller
31k 2.6k 01.7k
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Language:Python30.4k 222 2643k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda25k 252 1412.8k
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python9.1k 56 5481.5k
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python7.2k 50 222550
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.8k 75 248999
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.7k 45 83597
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python5.1k 77 205445
facebookresearch/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Language:Python2.7k 31 62263
PRIS-CV/DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
Language:Jupyter Notebook2k 34 44226
Pointcept/Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
Language:Python1.8k 20 359194
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Language:Python1.2k 30 61127
facebookresearch/hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Language:Python939 17 3747
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python849 19 2545
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
Language:Python763 71 14112
wyf0912/SinSR
[CVPR 2024] SinSR: Diffusion-Based Image Super-Resolution in a Single Step
Language:Python381 12 3123
brentspell/hifi-gan-bwe
Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.
Language:Python209 9 926
slp-rl/aero
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
Language:Python208 6 3027
xiongyihui/tdoa
TDOA based on GCC-PHAT
Language:Python175 6 865
rishikksh20/HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Language:Python154 12 619
JusperLee/SPMamba
Language:Python143 3 2119
idiap/acoustic-simulator
Implementation of audio degradation processes
Language:Python101 15 236
NXTProduct/TUNet
Language:Python54 1 516
tum-traffic-dataset/tum-traffic-dataset-dev-kit
TUM Traffic Dataset Development Kit
Language:Python54 2 137
tteepe/EarlyBird
Official Code for "EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye View"
Language:Python44 1 53
idiap/nnsslm
Neural Network based Sound Source Localization Models
Language:Python33 6 29
tteepe/TrackTacular
Official Code for "Lifting Multi-View Detection and Tracking to the Bird’s Eye View"
Language:Python32 3 115
Martlgap/livefaceidapp
Simple Live Face Recognition Streamlit App
Language:Python29 2 411
Blueblue4/IoU-AwareCalibration
Code to reproduce the experiments described in "Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration" (https://arxiv.org/pdf/2309.03110.pdf)
Language:Python14 1 21
teo-sl/Audio-Super-Resolution-ViT
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
Language:Jupyter Notebook13 1 02

okankop

okankop's Stars

karpathy/LLM101n

myshell-ai/OpenVoice

karpathy/llm.c

WongKinYiu/yolov9

LiheYoung/Depth-Anything

OpenTalker/video-retalking

facebookresearch/DiT

yl4579/StyleTTS2

facebookresearch/audio2photoreal

PRIS-CV/DemoFusion

Pointcept/Pointcept

haoheliu/versatile_audio_super_resolution

facebookresearch/hiera

NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion

Text-to-Audio/Make-An-Audio

wyf0912/SinSR

brentspell/hifi-gan-bwe

slp-rl/aero

xiongyihui/tdoa

rishikksh20/HiFiplusplus-pytorch

JusperLee/SPMamba

idiap/acoustic-simulator

NXTProduct/TUNet

tum-traffic-dataset/tum-traffic-dataset-dev-kit

tteepe/EarlyBird

idiap/nnsslm

tteepe/TrackTacular

Martlgap/livefaceidapp

Blueblue4/IoU-AwareCalibration

teo-sl/Audio-Super-Resolution-ViT