[dataset] How to prepare OtterHD datasets?
yxchng opened this issue · 4 comments
yxchng commented
[dataset] How to prepare OtterHD datasets?
Luodian commented
It's described here
https://github.com/Luodian/Otter/blob/main/docs/mimicit_format.md
yxchng commented
I don't quite understand. I don't see any mention of datasets like M3IT there. Isn't OtterHD using more data than mimicit?
peiliu0408 commented
mark
Luodian commented
I don't quite understand. I don't see any mention of datasets like M3IT there. Isn't OtterHD using more data than mimicit?
you can check the markdown file which describes the format, basically you can easily convert any dataset into this format. That's what we do on our cluster, we have 40+ converted datasets like M3IT, SVIT, PoliteFlamingo...
I could gradually updated them to the folder.
I've already updated some files here.