Sprinklr: Create document search item for Assets
Closed this issue · 0 comments
Create a document to "static files" as pdf, docx, etc.. You can create this document when in same time that was created the Article and get the "linkedAssets" from Article json and create the new jobitem to index to Turing. If possible, create all static file jobitems that are included at "content" > "markUpText" (Suggestion: Use the Jsoup API)
Need read the document and convert the document to plain text using the /api/ocr service and save the value to "text" attribute. The publication date and created date will be the current date. (date = new Date()).
Check if the Sprinklr API Document has some API that get more information about the Asset. https://developer.sprinklr.com/docs/read/API_Overview
The fields will be:
id - sprinklr + File name.
title - File Name
publication_date - new Date()
modification_date - new Date()
url - "linkedAssets" > "assetId" , or the src attribute for "content" > "markUpText"
text - File Content in Plain Text
filesize - File Size
extension - File Extension
abstract - TBD?
source-apps - ["Sprinklr"]
type - Static File
tags - Equals Article (TBD - Customer needs send more information how obtain these values)
categories - Equals Article (TBD - Customer needs send more information how obtain these values)
The new fields can created manually in Turing Console, or can create a JobItem Definition about all field that the current content is using. Reference: TurSNJobItem.setSpecs( List)