QuiltCleaner: A Jupyter Notebook repository from DeepMicroscopy - Generating Deep Insights from Deep Learning in Microscopy - DeepMicroscopy - Generating Deep Insights from Deep Learning in Microscopy

Welcome to the QuiltCleaner repository. We labeled 1% of the QUILT_1M dataset for common image impurities that would be deteriorating image generation in a text-conditional image synthesis setting. We provide predictions for the remaining 99% of the QUILT_1M dataset. Additionally, we provide scores for text-image alignment as provided by the CONCH vision-language model.

Paper (accepted for MIDL 2024): Aubreville et al: Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis

Base dataset

These annotations and predictions are complimenting the QUILT_1M dataset (Ikezogwo et al., NeurIPS 2023). Please look at their repository as to how to retrieve it.

How to use

The annotations are provided in the following files:

train_annotations.csv Training set (70%)
val_annotations.csv Validation set (15%, used for model selection)
test_annotations.csv Hold out test set (15%)

You will need to download the QUILT-1M dataset separately, as this can not be provided in this repository due to licensing reasons. Place all files that were annotated into the images folder of this repository. Then, you will be able to train your own QuiltCleaner using the provided notebook.

Citation

@inproceedings{aubreville2024modelbased,
      title={Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image Synthesis}, 
      author={Marc Aubreville and Jonathan Ganz and Jonas Ammeling and Christopher C. Kaltenecker and Christof A. Bertram},
      booktitle={Medical Imaging with Deep Learning},
      url={https://openreview.net/forum?id=m7wYKrUjzV},
      year={2024},
      eprint={2404.07676},
      archivePrefix={arXiv},
}

DeepMicroscopy/QuiltCleaner

Base dataset

Categories

How to use

Citation