Analyzing-domain-similarity

This repo introduces some procedures to analyze similarity between domains to select BERT pre-training data