edward-io

edward-io's Stars

wkentaro/yolo-world-onnx
ONNX models of YOLO-World (an open-vocabulary object detection).
Language:Python152
iShohei220/adopt
Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"
Language:Jupyter Notebook37519
lucidrains/pi-zero-pytorch
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Language:Python1807
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python7.7k962
mirage-project/mirage
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
Language:C++66438
Tencent/DepthCrafter
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Language:Python1k49
martin-marek/hdr-plus-swift
📸Night mode on any camera. Based on HDR+.
Language:Swift21011
qiuk2/AAR
[Official Implementation] Acoustic Autoregressive Modeling 🔥
Language:Python575
bishopdynamics/superbird-tool
Cross-Platform Spotify Car Thing (superbird) hacking toolkit
Language:Python17213
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python65847
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Language:Python2.7k190
bpc-clone/bpc_updates
98536
AlexanderKoch-Koch/low_cost_robot
Language:Python3.1k267
kevinzakka/mjctrl
Minimal, clean, single-file implementations of common robotics controllers in MuJoCo.
Language:Python25014
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python1.4k149
chsasank/device-benchmarks
Benchmarks of different devices I have come across
Language:Python168
cifkao/html-midi-player
🎹 Play and display MIDI files on the web
Language:TypeScript69363
MZehren/ADTOF
Additional material for the paper ADTOF: A large dataset of non-synthetic music for automatic drum transcription
Language:Python484
spfrommer/torchexplorer
Interactively inspect module inputs, outputs, parameters, and gradients.
Language:Python31723
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
Language:Python24.9k3.6k
yl4579/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Language:Python5k424
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Language:Python1.2k118
jthickstun/anticipation
Anticipatory Autoregressive Models
Language:Python15329
PCrnjak/PAROL6-Desktop-robot-arm
BOM, STL files and instructions for PAROL6 3D printed robot arm
Language:HTML1.3k139
Audio-AGI/AudioSep
Official implementation of "Separate Anything You Describe"
Language:Python1.6k118
WangXuan95/FPGA-USB-Device
An FPGA-based USB 1.1 (full-speed) device core to implement USB-serial, USB-camera, USB-audio, USB-hid, etc. It requires only 3 FPGA common IOs rather than additional chips. 基于FPGA的USB 1.1 (full-speed) device端控制器，可实现USB串口、USB摄像头、USB音频、U盘、USB键盘等设备，只需要3个FPGA普通IO，而不需要额外的接口芯片。
Language:Verilog623103
simplefoc/Arduino-FOC
Arduino FOC for BLDC and Stepper motors - Arduino Based Field Oriented Control Algorithm Library
Language:C++2.1k538
Shiriluz/Word-As-Image
Language:Python1.1k83
descriptinc/audiotools
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Language:Python24343
Geomitron/Bridge
A rhythm game chart searching and downloading tool.
Language:TypeScript18427