/Source-Free-Domain-Adaptation

Source free domain adaptation in clinical temporal expression extraction

Primary LanguagePythonApache License 2.0Apache-2.0

Source-free-domain-adaptation

Data sharing restrictions are common in NLP datasets. For example, Twitter policies do not allow sharing of tweet text, though tweet IDs may be shared. The situation is even more common in clinical NLP, where patient health information must be protected, and annotations over health text, when released at all, often require the signing of complex data use agreements

The goal is to develop an accurate system for a target domain when annotations exist for a related domain but cannot be distributed. Instead of annotated training data, participants are given a model trained on the annotations.

Currently baseline models and data are provided inline with https://github.com/Machine-Learning-for-Medical-Language/source-free-domain-adaptation.git