georgid/AlignmentDuration
Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
PythonAGPL-3.0
Issues
- 0
- 2
Implement viterbi in c++
#58 opened by georgid - 8
handle non-unicode characters
#61 opened by georgid - 0
refactor FeatureExtractor.loadMFCCs
#53 opened by georgid - 0
Reduce code by refining LyricsWithModels
#49 opened by georgid - 0
- 0
- 0
how to handle words not in dictionary
#55 opened by georgid - 0
- 1
reduce code not used
#59 opened by georgid - 0
- 0
test WITH_SECTION_ANNO = False with new MLP model
#41 opened by georgid - 1
have the same special token for silence in all phoneme sets. Now it is REST for Mandarin and '' for English
#50 opened by georgid - 1
reduce essentia code for loading the audio
#54 opened by georgid - 1
reduce dependency on essentia
#60 opened by georgid - 0
- 1
reduce dependecy on htkmfc
#44 opened by georgid - 0
- 1
remove hard coded sampling rate
#51 opened by georgid - 2
- 1
missing words from dictionary
#45 opened by georgid - 0
reduce code redundancy for computing timstamps
#52 opened by georgid - 0
Move this file to dir for_english
#48 opened by georgid - 0
optimize code for expnasion to syllables
#47 opened by georgid - 0
Add Mandarin Char to mandarinSyllable class
#46 opened by georgid - 1
reducnant Lyric() constructor call
#43 opened by georgid - 1
reduce dependency on htk and scikit learn
#40 opened by georgid - 3
concatenate textGrid data.
#32 opened by georgid - 0
- 1
when WITH_SHORT_PAUSES = 1
#36 opened by georgid - 0
merge SectionLinkMakam.loadSmallAudioFragment and SectionLinkMakam.loadSmallAudioFragmentOracle
#35 opened by georgid - 1
means, covars, weights
#34 opened by georgid - 1
detectedTokenList with flag DETECTION_TOKEN_LEVEL ='words' has Word object and not the string. So there is a problem at json.dump() in LyircsAligner
#31 opened by georgid - 0
improve roganization of MusicXML Parser: use ScoreSection class in MusicXMLParser
#29 opened by georgid - 0
reduce code of constructTransMAtrix
#28 opened by georgid - 0
- put reading pitch as input in lyricsalign not in align.FeatureExtractor.loadMFCCs
#27 opened by georgid - 3
Doesn't install if cython not available
#22 opened by alastair - 0
- 0
- 0
refine and commit LyricAligner
#24 opened by georgid - 0
hmm.Path.Path.
#23 opened by georgid - 1
dont trim file,
#19 opened by georgid - 0
in LyricsWithModels assign small deviation to phonemes and the given by as parameter one to vowels
#21 opened by georgid - 2
- 4
- 0
optimize: do observation prob for a given feature vector for all phonemes in alphabet, not for all phonemes in phonemeNetwork
#18 opened by georgid - 0
REDUCE CODE: check if we can replace some of the code in SymbTrPrser.syllable2LyricsOneSection with code from LyricsParsing.expandlyrics2WordList.
#15 opened by georgid - 0
make consistent. phoneme2states
#17 opened by georgid - 0
- 0