Issues
- 0
- 0
🐛 [BUG] - quality___language___fasttext_filter function is not utilizing the 'subset' column's text values for filtering
#54 opened by 41ow1ives - 0
- 1
🚀 [REQUEST] - Refactor MinhashLSH Dedup
#50 opened by p-idx - 0
🚀 [REQUEST] - Load data from warc
#49 opened by jordane95 - 2
📝 [Docs] - Guides to use Spark Job
#44 opened by Taekyoon - 0
Create Coding Guidelines
#10 opened by 41ow1ives - 0
- 0
Support jupyter notebook while using aws.
#18 opened by 41ow1ives - 0
- 1
🚨 [Security] Critical vulnerable issue based on the result of static source code analysis and dependency review
#38 opened by 41ow1ives - 0
Add CONTRIBUTING.md
#1 opened by 41ow1ives - 3
📝 [Docs] - Docs seems to be misleading
#29 opened by seonWKim - 1
Change convention using ___ (three underscore)
#17 opened by 41ow1ives - 1
📝 [Docs] - Wrong example in docstring
#15 opened by 41ow1ives - 0
📝 [Docs] - Add API doc
#12 opened by 41ow1ives - 0
- 1
Improve README.md
#2 opened by 41ow1ives - 0
- 0
🐛 [BUG] - Error while using polyglot code
#20 opened by 41ow1ives - 0
Add more default environment settings.
#19 opened by 41ow1ives - 0
📝 [Docs] - change example data from test code
#16 opened by 41ow1ives - 1
Add Issue/PR templates
#7 opened by 41ow1ives