This code uses for training recurrent neural network models for normalizing free-text descriptions of causes of death into icd-10 codes. Code written in python 2.7
Installing required libraries:
pip install -r requirements.txt
After installing libraries nltk punkt model must be downloaded in python interpreter: import nltk nltk.download('punkt')
Usage:
python predict.py --data path/to/training/data \
--embeddings path/to/word2vec/binary
--embeddings_dim embedding_vector_size
--encoder_size encoder_rnn_size
--decoder_size decoder_rnn_size
--use_sim indicates that similarities vector must be used
--dropout_rate drate to use after on similarities vector
Citing:
KFU at CLEF eHealth 2017 Task 1: ICD-10 Coding of English Death Certificates with Recurrent Neural Networks Z Miftakhutdinov, E Tutubalina - 2017
http://ceur-ws.org/Vol-1866/paper_64.pdf
BibTex:
@inproceedings{
miftakhutdinov2017kfu,
title={KFU at CLEF eHealth 2017 Task 1: ICD-10 Coding of English Death Certificates with Recurrent Neural Networks},
author={Miftakhutdinov, Z and Tutubalina, Elena},
year={2017},
organization={CLEF}
}