cpystan/WSI-VQA

Clarification on WSI Selection from TCGA Cases

Opened this issue · 1 comments

Dear authors,

Thank you for sharing the WSI-VQA dataset and code. I have a question regarding the dataset usage.
In TCGA, each case may have multiple WSIs (whole slide images), but in the dataset's JSON files, only the case names are provided without specifying the WSI filenames. Could you please clarify which specific WSIs were used from each case? How were the WSIs selected for inclusion in the dataset?

I appreciate your help and look forward to your response!

Good Question. We use the slides whose names own 'DX' (which means the diagnostic slide). And for the situation that one patient has several DX slides, we use the 'DX1' in default.