Pinned Repositories
Invasion
python 3.7,pygame, 街机游戏
libresampy
librosa
a c++ implementation for librosa writing in python
mir_tools
my mir eval tools for piano transcription and singing transcription or any other tools related to music
MusicYOLO
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
myfrp
My frp service for intranet penetration
onsets_and_frames_tensorrt
This repository is a tensorrt deployment of the onsets and frames model, which is implemented using pytorch.
SSVD-v2.0
SSVD-v2.0 is a sight-sing transcription dataset
transition-aware
transition-aware piano transcription model
UMind
This is a simple mind mapping software
xk-wang's Repositories
xk-wang/myfrp
My frp service for intranet penetration
xk-wang/asap-dataset
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
xk-wang/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
xk-wang/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
xk-wang/book
a repository for sharing book
xk-wang/CS_Offer
后台开发基础知识总结(春招/秋招)
xk-wang/http_server_cpp
C++的http服务器,简单好用
xk-wang/jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
xk-wang/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
xk-wang/learning-spark
Example code from Learning Spark book
xk-wang/libresample
Real-time library for sample rate conversion
xk-wang/libsamplerate
An audio Sample Rate Conversion library
xk-wang/linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
xk-wang/MIDIVisualizer
A small MIDI visualizer tool, using OpenGL
xk-wang/MMSP2021-Audio2ScoreAlignment
Audio-to-Score Alignment Using Deep Automatic Music Transcription
xk-wang/moonlight
Optical music recognition in TensorFlow
xk-wang/NMT
基于seq2seq的机器翻译模型
xk-wang/onnx-simplifier
Simplify your onnx model
xk-wang/polyphonic-omr
Code used in research that led to the paper "An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition" (ISMIR 2021)
xk-wang/PSPNet
Pyramid Scene Parsing Network, CVPR2017.
xk-wang/segmenter
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation
xk-wang/snail
xk-wang/sparse-analytic-filters
Code for the paper "Learning Sparse Analytic Filters for Piano Transcription".
xk-wang/tensorRT_Pro
C++ library based on tensorrt integration
xk-wang/torch2trt
An easy to use PyTorch to TensorRT converter
xk-wang/transformer
xk-wang/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
xk-wang/vit-search
Code for "Searching for Efficient Multi-Stage Vision Transformers"
xk-wang/volo
VOLO: Vision Outlooker for Visual Recognition
xk-wang/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).