AttributeExtraction insurance product information extraction JSON TYPE GOT make attr_get_file clean to replace invalid character and prepare for json type of mongoDB