/opennxmlparser

Parser for NXML biomedical article files, produced by the National Library of Medicine (NLM). This repo extends the NLM parser provided for TREC CDS.

Primary LanguageJava

opennxmlparser

Parser for NXML biomedical article files, produced by the National Library of Medicine (NLM). This repo extends the NLM parser provided for TREC CDS.

Example usage

import gov.nih.nlm.iti.driver.Driver;

import gov.nih.nlm.iti.text.model.Document;

...

Document document = Driver.getDocInfo("123123.nxml");

System.out.println(document.getFullText());