CogStack/CogStack-Pipeline
Distributed, fault tolerant batch processing for Natural Language Applications and Search, using remote partitioning
JavaNOASSERTION
Issues
- 0
Add PDF Table Extraction using Tabula
#37 opened by afolarin - 1
Add support for PDF Form Parsing
#23 opened by afolarin - 0
De-Identification
#28 opened by afolarin - 2
ElasticsearchRest Client will fail silently if index contains invalid character
#32 opened by jstuczyn - 0
- 3
- 0
Log metrics on Binary Doc conversion
#9 opened by afolarin - 0
- 1
Unable to view links on confluence
#83 opened by torrybr - 0
fix: read from filesystem or object-store
#78 opened by afolarin - 6
Cogstack docker download issues
#38 opened by polyglot-v - 3
OCR Test failure
#4 opened by jstuczyn - 4
Test LSTM OCR Engine in Tesseract
#30 opened by afolarin - 4
- 7
- 4
- 2
- 1
Refactor the build process
#40 opened by afolarin - 1
Refactor Integration and acceptance tests
#45 opened by yatharthranjan - 1
- 0
add Nginx proxy to the stack for basic Auth
#41 opened by afolarin - 6
Tika_deid not working since ES Upgrade
#35 opened by AMohabeer - 1
- 5
Unable to index Docman Documents
#34 opened by AMohabeer - 1
- 0
Outdated Tika Dependencies
#3 opened by jstuczyn - 1
Etiquette note
#1 opened by RichJackson - 1
Post-processing of bio yodie result
#26 opened by afolarin - 1
- 0
- 4
- 1
- 0
[Feature] Support PDF export for more file types
#21 opened by afolarin - 1
- 1
Log metrics on Binary Doc conversion
#15 opened by afolarin - 1
[Feature] Detect Media Type for binary documents
#16 opened by afolarin - 1
- 0
- 3
- 0
- 1
- 0