Order of individuals

Question

Order of individuals

sophiekj opened this issue 2 years ago · 2 comments

Hello,

Thank you for sharing this program!

I could not find in your Github description of the output Qhat file with the ancestry proportions whether or not the order of sample/individual names is conserved from the input Plink files.

I read your publication, which states that "For our PSD simulations, we performed hierarchical clustering with complete linkage on a Euclidean distance matrix calculated from the true admixture proportion matrix (Q) to obtain the order of samples".

Does this mean there is no true way of retaining the sample names order, and it must be inferred mathematically? How exactly should this be done, if so?

Thank you,
Sophie

Answer 1 · 2022-07-05T21:53:21.000Z

Hi Sophie,

Thanks for your interest in our software.

The order of sample/individuals is indeed conserved from the PLINK files (i.e. same order as FAM file).

The statement you were referring to is for aligning the populations from different unsupervised runs from SCOPE or other methods.

Best,

Alec

Answer 2 · 2022-07-05T21:54:25.000Z

Ah wonderful, thank you! I appreciate the clarification.