ambiguity in clonotype assignment
lincoln-harris opened this issue · 7 comments
Hi
@sdarmanis and I have been digging into the sequence level alignment of cells that are assigned to the same clonogroup and have been noticing some strange things. It seems that the CDR3 sequence doesnt have to be a perfect match for two cells to be assigned to the same clonogroup. For example, in this figure all of these cells have been assigned to the same clonogroup, yet the bottom 3 have very different CDR3 sequence than the rest (apologies for the poor resolution)
Why is Tracer assigning these to the same clonogroup?
Thanks
Lincoln
yep, output folder is here
https://github.com/czbiohub/sclung_adeno/tree/master/TCR_analysis/filtered_TCRAB_summary
its a private repo but i just gave you read access, i think
Thanks a lot. So maybe a worthwhile modification is to define a clonogroup as only those cells that share an A and B? Or otherwise accept that when dealing with clonogroups of this size, youre going to see some messiness.
No problem.
I think that the best way to do it would be to have an option that sets the level of stringency required for assigning cells to clonotypes as they are reported in the summary.
Ideally this would be coupled to better reporting and visualisation to make it easier to assess the structure of the clonotypes and how tenuously cells are connected to them.