xk-wang

Stay hungry stay foolish!

@itec-hustWuhan, China

Pinned Repositories

Invasion
python 3.7,pygame, 街机游戏
Language:Python6 2 00
libresampy
1 2 00
librosa
a c++ implementation for librosa writing in python
Language:Makefile17 2 11
mir_tools
my mir eval tools for piano transcription and singing transcription or any other tools related to music
Language:Python1 1 00
MusicYOLO
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
Language:Jupyter Notebook10 2 31
myfrp
My frp service for intranet penetration
Language:C++8 2 01
onsets_and_frames_tensorrt
This repository is a tensorrt deployment of the onsets and frames model, which is implemented using pytorch.
Language:C++8 2 03
SSVD-v2.0
SSVD-v2.0 is a sight-sing transcription dataset
7 2 00
transition-aware
transition-aware piano transcription model
Language:Python2 2 10
UMind
This is a simple mind mapping software
Language:Java3 2 00

xk-wang's Repositories

xk-wang/myfrp
My frp service for intranet penetration
Language:C++8 2 01
xk-wang/asap-dataset
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
xk-wang/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
xk-wang/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
xk-wang/book
a repository for sharing book
xk-wang/CS_Offer
后台开发基础知识总结（春招/秋招）
1
xk-wang/http_server_cpp
C++的http服务器，简单好用
1
xk-wang/jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
xk-wang/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
xk-wang/learning-spark
Example code from Learning Spark book
xk-wang/libresample
Real-time library for sample rate conversion
Language:C
xk-wang/libsamplerate
An audio Sample Rate Conversion library
xk-wang/linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
xk-wang/MIDIVisualizer
A small MIDI visualizer tool, using OpenGL
xk-wang/MMSP2021-Audio2ScoreAlignment
Audio-to-Score Alignment Using Deep Automatic Music Transcription
xk-wang/moonlight
Optical music recognition in TensorFlow
xk-wang/NMT
基于seq2seq的机器翻译模型
xk-wang/onnx-simplifier
Simplify your onnx model
xk-wang/polyphonic-omr
Code used in research that led to the paper "An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition" (ISMIR 2021)
xk-wang/PSPNet
Pyramid Scene Parsing Network, CVPR2017.
xk-wang/segmenter
[ICCV2021] Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation
xk-wang/snail
Language:C++
xk-wang/sparse-analytic-filters
Code for the paper "Learning Sparse Analytic Filters for Piano Transcription".
xk-wang/tensorRT_Pro
C++ library based on tensorrt integration
Language:C++
xk-wang/torch2trt
An easy to use PyTorch to TensorRT converter
xk-wang/transformer
xk-wang/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
xk-wang/vit-search
Code for "Searching for Efficient Multi-Stage Vision Transformers"
xk-wang/volo
VOLO: Vision Outlooker for Visual Recognition
xk-wang/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).