Issues
- 1
KeyError in collators.py line 164
#198 opened by kroll-software - 1
Trailing Bars are not appended into REMI token sequence if last note is multibar length
#200 opened by Mintas - 5
- 3
KeyError in midi_tokenizer.py line 1712
#197 opened by kroll-software - 2
Using pitch bend in Octuple tokenizer
#196 opened by Neptune-S-777 - 6
Environment Issue (Nanobind Error)
#190 opened by MHZ9825 - 4
REMI with rests does not produce Bar tokens if rest is at the end of a bar
#189 opened by mister-magpie - 12
Error running the 2nd example in readme
#157 opened by MikeMpapa - 5
after tokenizing with trained tokenizer, the "tokens" array contains original tokens
#166 opened by theglassofwater - 5
Dration token meaning in REMI
#174 opened by PRamoneda - 3
- 4
- 3
Defining Custom Tokens
#171 opened by MikeMpapa - 6
Cannot install it
#178 opened by shawn120 - 20
- 16
Error in training tokenizer "UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf0 in position 0: invalid continuation byte"
#159 opened by ojus1 - 5
The Example_HuggingFace_Mistral_Transformer.ipynb notebook gives a ValueError when running trainer.train()
#163 opened by briane412 - 14
Installation Error with miditok
#155 opened by LiuZH-19 - 2
Add the possibility to train tokenizer with other token aggregation techniques (WordPiece, Unigram...)
#154 opened by Natooz - 4
Time signature type issue when using from_dict()
#151 opened by JLenzy - 31
Slow Performance of `tokenize_midi_dataset` Function
#147 opened by Kinyugo - 3
cannot import DatasetTok - Error with version 3.0.2
#158 opened by seb-son - 8
ERROR:model.generate(input_tokens.ids, max_length=200)-------AttributeError: 'list' object has no attribute 'ids'
#86 opened by geng-lee - 15
KeyError: 'Chord_ukn4'
#77 opened by VDT5702 - 4
Data Preparation Issues
#126 opened by laceyp99 - 1
Special token question
#140 opened by oiabtt - 11
- 2
Docs and code bug
#135 opened by oiabtt - 0
- 5
Support huggingface AutoTokenizer
#127 opened by oiabtt - 4
[Enhancement] Versionning documentation
#133 opened by leleogere - 4
Need Help! I run the Full_Example_HuggingFace_GPT2_Transformer.ipynb but the output is none
#128 opened by sunyrain - 59
Add a faster midi parsing backend
#112 opened by Yikai-Liao - 1
Incorrect Placement of Time Signatures
#131 opened by EterDelta - 6
Parallelizing tokenization? (enhancement)
#102 opened by drscotthawley - 11
Raise IndexError in Tokenizer
#104 opened by feiyuehchen - 3
- 3
What exactly is beat_res?
#117 opened by parneyw - 2
- 4
- 8
- 4
- 4
- 7
Add ability to add custom labels to DatasetTok
#78 opened by leleogere - 24
When using REMI for tokenization with use_time_signatures=True, many duplicate measures can be encoded.
#74 opened by Chunyuan-Li - 5
- 1
- 4
Design of program tokens in MIDILike tokenization
#71 opened by caenopy - 3
MIDILike needs note events sorted
#69 opened by caenopy - 2