liuquande/SAML

Dataset splits

JunMa11 opened this issue · 5 comments

Dear @liuquande ,

Thanks for sharing the great work.

Could you please share the data split files in Table 2?

For example, in each Site, which cases are used as training/testing.

image

image

Best,
Jun

Hi Jun,

Thanks for the interest.

We use all the data from each site for training for testing.
Taking Site A as target domain for instance, we will use all data from site B-F for training and all data from site A for testing.

Hi @liuquande,

Thanks for your reply very much.

Taking Site A as target domain for instance, we will use all data from site B-F for training and all data from site A for testing.

This is the intra-site setting, right?

Q1. How about the training and testing data in DeepAll setting?

with some outlier cases excluded to provide general internal performance on each site

Q2. What are these outlier cases in each site?

Looking forward to your reply:)

Kindest regards,
Jun

Hi Jun,

Taking Site A as target domain for instance, we will use all data from site B-F for training and all data from site A for testing.

This denote the DeepAll setting actually, and the Intra-site setting denote training and testing on the same site.

For Q2, we notice that in Intra-site setting, sometimes the model developed on Site X may not perform well on certain testing case of Site X (with Dice less than 20% if I remembered correctly). We think the reason could be the distribution of that particular testing case may not fit well with the learned data distribution at Site X, and regard cases like that as outlier cases.

Hi @liuquande ,

Got it. Thanks for your kind reply very much.

Could you share which cases are exclueded?
Looking forwart to your reply.