marklogic-community/demo-cat

Making attachments searchable

grtjn opened this issue · 2 comments

grtjn commented

Use a post-commit trigger on attachments to scrape html out of them, and save those as memo's inside demo json data. It will make them a. searchable, and b. displayable. Use filename as title. Start with xdmp:document-filter, but see if you can get some formatting out of word docs and pdf in second stage.

Taking this issue.

grtjn commented

PR #270 was merged, but might be worth taking a closer look at the PDF convert stuff..