facebookresearch/stopes
A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
PythonMIT
Issues
- 2
error when prepare_data
#47 opened by ZeroneBo - 0
Runtime Error when running Demo
#67 opened by rumourscape - 1
attribution of LLMs
#65 opened by Wafaa014 - 2
- 2
[VSim ECAPA] What $MODEL_PATH should be used when using the ECAPA model for speaker similarity evaluation?
#64 opened by Poeroz - 2
Missing file and functions
#59 opened by mct10 - 1
Bug in tokenizer for Tibetian Language
#50 opened by asusdisciple - 1
the list index overflow
#60 opened by zhenghuawang6 - 0
- 1
NLLB mined data?
#56 opened by gordicaleksa - 1
- 4
- 0
Fix: Refactor the seamlisten project Documentation (installation) for Mac and Windows User
#51 opened by david-wagih - 2
[seamlisten] download button
#41 opened by Celebio - 1
[seamlisten] add initial backend test
#35 opened by Celebio - 1
[seamlisten] play on paste
#36 opened by Celebio - 2
[seamlisten] remove double slash comments in css
#43 opened by Celebio - 1
[seamlisten] add initial frontend test
#38 opened by Celebio - 0
[seamlisten] The context region is not grayed
#42 opened by Celebio - 0
- 0
[seamlisten] ability to sort/filter by score
#40 opened by Celebio - 0
- 0
[seamlisten] bug: padding is added to segments without the grey area to indicate it
#34 opened by Celebio - 0
[seamlisten] internal server error for some data
#33 opened by Celebio - 0
[seamlisten] Explore folders
#32 opened by Celebio - 0
[seamlisten] Improve audio fetching
#31 opened by Celebio - 1
Some small typo in README.md file
#28 opened by myaxxxxx - 9
Prepare new data for NLLB-200
#24 opened by ibtiRaj - 1
Minor bug in ALTI's code
#25 opened by gegallego - 1
Using `stopes` for filtering instead of mining
#23 opened by ZenBel - 4
spm-200 dictionary duplicate error
#19 opened by edchengg - 4
- 2
- 12
- 4
How to create training data through pipeline
#14 opened by b3y0nd - 5
Using stopes with an unseen language
#16 opened by sete-nay - 0
CUDA error while running on cpus
#8 opened by mingzi151 - 6
- 5
Which pipeline is used specifically to preprocess input to NLLB model for inference?
#4 opened by pluiez - 9