georgid

I bring technology closer to the society by creating innovative, AI-enabled, software solutions - mainly in the domains of music, speech and natural language

Music Technology GroupBarcelona

Pinned Repositories

AlignmentDuration
Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
Language:Python56 5 626
AlignmentEvaluation
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if token is word, phrase, note, section etc.) User for the evaluation of the MIREX Lyrics-to-audio challenge
Language:Python18 5 86
chorus-vocal-covers
A heuristic approach to the detection of choruses in vocal cover versions. Done at the WiMIR workshop at ISMIR 2019 https://docs.google.com/presentation/d/1WYXxChgo8DI_NknyndPdcOuxlhAL_428Olg2fWaVy6U/edit#slide=id.g6f4f88d309_0_57
Language:Python3 3 02
ENST-drums-dataset
the dataset used in the paper https://drive.google.com/file/d/0B4bIMgQlCAuqdGVRbVNNbzJfeUU/view
Language:Shell5 3 00
HMMDuration
Python Hidden Markov Models framework. Adapted for computationally optimal Viterbi forced alignment. Added Explicit Duration model
Language:Python6 4 21
htkModelParser
Parses models created by the HTK Toolkit (http://htk.eng.cam.ac.uk/) as text files into Python class. It enables then various operations with the models like visualization and comparison.
Language:Python6 3 01
lakh_vocal_segments_dataset
singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/
Language:Jupyter Notebook16 4 01
Lyrics2AudioAligner
lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping
Language:Python14 5 313
pypYIN
python pYIN
Language:Python8 3 44
vocal-detection
Language:Python4 2 12

georgid's Repositories

georgid/AlignmentDuration
Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
Language:Python56 5 626
georgid/AlignmentEvaluation
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if token is word, phrase, note, section etc.) User for the evaluation of the MIREX Lyrics-to-audio challenge
Language:Python18 5 86
georgid/lakh_vocal_segments_dataset
singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/
Language:Jupyter Notebook16 4 01
georgid/Lyrics2AudioAligner
lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping
Language:Python14 5 313
georgid/pypYIN
python pYIN
Language:Python8 3 44
georgid/chorus-vocal-covers
A heuristic approach to the detection of choruses in vocal cover versions. Done at the WiMIR workshop at ISMIR 2019 https://docs.google.com/presentation/d/1WYXxChgo8DI_NknyndPdcOuxlhAL_428Olg2fWaVy6U/edit#slide=id.g6f4f88d309_0_57
Language:Python3 3 02
georgid/otmm_vocal_segments_dataset
Manual annotations of audio segments that correspond to sections from score with singing voice present
Language:Python3 5 10
georgid/SourceFilterContoursMelody
Melody extraction based on source-filter modelling
Language:Python3 2 13
georgid/music_hack_sofia
Examples of extracting acoustic features with essentia
Language:Jupyter Notebook2 2 0
georgid/meowify
Language:Python1 2 0
georgid/mfcc-htk-an-librosa
Reproduce the htk-type of MFCC features using the essentia framework. The MFCC extracted with essentia are compared to these extracted with htk these extracted with librosa
Language:Jupyter Notebook1 3 0
georgid/PhDThesis
The data needed to generate my phd thesis
Language:TeX1 2 0
georgid/Position-DBN-HMM-Lyrics
query by textual lyrics in audio with HHMM model of section positions using Viterbi
Language:MATLAB1 2 31
georgid/tune_puzzle
Tune Puzzle
Language:Swift1 3 82
georgid/align-magix
making a few web apps for testing
Language:JavaScript2 0
georgid/Curation_Users
Language:Python1 0
georgid/docker
My Docker scripts and Dockerfile for several frameworks.
Language:Go2 0
georgid/dunya
The Dunya music browser. Developed using Django 2
Language:Python1 0
georgid/englishMLP2turkish
scripts to create mapping from English phoneme models as feed forward network multilayer perceptron network onto a GMM turksih phoneme model
Language:Python2 01
georgid/essentia
C++ library of algorithms to extract features from audio files, including Python bindings.
Language:Jupyter Notebook3 01
georgid/IMFCC-visualization
Example of the inverse MFCC essentia feature
Language:Jupyter Notebook2 01
georgid/madmom
Python audio and music signal processing library. This is a fork adding support for synchronous tracking of vocal note onsets and metrical position in bar. The model used is Dynamic Bayesian Networks.
Language:Python2 0
georgid/makam_acapella
acapella recordings of Makam
Language:Python3 0
georgid/msaf
Music Structure Analysis Framework
Language:Python1 0
georgid/pdnn
PDNN: A Python Toolkit for Deep Learning. http://www.cs.cmu.edu/~ymiao/pdnntk.html
Language:Python3 0
georgid/publications_PhD
latex/lyx code and figures to reproduce the papers of my research http://mtg.upf.edu/biblio/author/810 that are the basis for my PhD http://compmusic.upf.edu/phd-thesis-georgi
Language:TeX2 0
georgid/searchByLyricsEval
evaluation scripts for search by lyrics (a.k.a. keyphrase spotting)
Language:Python2 0
georgid/SINGmasterAndrioidWithGUI
sing master exrecise mode with complete GUI
Language:Java2 81
georgid/turkish_makam_section_dataset
The section test dataset for classical Ottoman-Turkish makam music
Language:TeX2 0
georgid/your_first_neural_network
Language:Jupyter Notebook2 0