/document-index

Index text data from GeoCore records

Primary LanguagePython

document-index

High-level description

Use lambda functions to extract and organize text data from geocore options list. Document indexing involves processing various document types, extracting text, and storing them in a DynamoDB table (often in kye-value pair) for developing fast and efficient search. The indexed document types include .docx, PDF, text, and html.

Deploy

To deploy to AWS Lambda, use Cloud9 and the Serverless Application Model Command Line Interface (SAM CLI).