edward-io's Stars
wkentaro/yolo-world-onnx
ONNX models of YOLO-World (an open-vocabulary object detection).
iShohei220/adopt
Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"
lucidrains/pi-zero-pytorch
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
mirage-project/mirage
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
Tencent/DepthCrafter
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
martin-marek/hdr-plus-swift
📸Night mode on any camera. Based on HDR+.
qiuk2/AAR
[Official Implementation] Acoustic Autoregressive Modeling 🔥
bishopdynamics/superbird-tool
Cross-Platform Spotify Car Thing (superbird) hacking toolkit
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
bpc-clone/bpc_updates
AlexanderKoch-Koch/low_cost_robot
kevinzakka/mjctrl
Minimal, clean, single-file implementations of common robotics controllers in MuJoCo.
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
chsasank/device-benchmarks
Benchmarks of different devices I have come across
cifkao/html-midi-player
🎹 Play and display MIDI files on the web
MZehren/ADTOF
Additional material for the paper ADTOF: A large dataset of non-synthetic music for automatic drum transcription
spfrommer/torchexplorer
Interactively inspect module inputs, outputs, parameters, and gradients.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
jthickstun/anticipation
Anticipatory Autoregressive Models
PCrnjak/PAROL6-Desktop-robot-arm
BOM, STL files and instructions for PAROL6 3D printed robot arm
Audio-AGI/AudioSep
Official implementation of "Separate Anything You Describe"
WangXuan95/FPGA-USB-Device
An FPGA-based USB 1.1 (full-speed) device core to implement USB-serial, USB-camera, USB-audio, USB-hid, etc. It requires only 3 FPGA common IOs rather than additional chips. 基于FPGA的USB 1.1 (full-speed) device端控制器,可实现USB串口、USB摄像头、USB音频、U盘、USB键盘等设备,只需要3个FPGA普通IO,而不需要额外的接口芯片。
simplefoc/Arduino-FOC
Arduino FOC for BLDC and Stepper motors - Arduino Based Field Oriented Control Algorithm Library
Shiriluz/Word-As-Image
descriptinc/audiotools
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Geomitron/Bridge
A rhythm game chart searching and downloading tool.