marbl/CHM13

More info on the centromeric satellite masking BED file

guilhermesena1 opened this issue · 3 comments

Hello,

Thank you very much for all the effort in this historic project!

We have some sequencing data which requires a particularly complex analysis in centromeric regions. We are using the centromeric satellite masked BED file, but we could not find too much information regarding how these regions were inferred. Is it based on sequence patterns? Staining? Exactly how were the boundaries defined?

We would be extremely grateful if any references on the construction of these regions could be provided. Thank you!

Hello,

Please use the complete cen/sat annotation. The hor (alpha HOR), dhor (diverged alpha) and mon (monomers) may be the ones you would be interested to look at for the centromeric annotation.

Detailed methods on how the cen/sats were defined is described in Altemose et al.

Best,
Arang

Thank you so much for the quick response! This definitely addresses our question.