Teichlab/tracer

ambiguity in clonotype assignment

lincoln-harris opened this issue · 7 comments

Hi
@sdarmanis and I have been digging into the sequence level alignment of cells that are assigned to the same clonogroup and have been noticing some strange things. It seems that the CDR3 sequence doesnt have to be a perfect match for two cells to be assigned to the same clonogroup. For example, in this figure all of these cells have been assigned to the same clonogroup, yet the bottom 3 have very different CDR3 sequence than the rest (apologies for the poor resolution)
screen shot 2018-11-30 at 1 33 42 pm
Why is Tracer assigning these to the same clonogroup?
Thanks
Lincoln

yep, output folder is here
https://github.com/czbiohub/sclung_adeno/tree/master/TCR_analysis/filtered_TCRAB_summary
its a private repo but i just gave you read access, i think

yep, its clonogroup 1, so the cell names are:

screen shot 2018-11-30 at 2 57 44 pm

The odd ones out are K20_B003659, E22_B003659 and G21_B000883. Im noticing that even the flanking V and J regions are different within these 'odd' cells.
Thanks a lot!

Thanks a lot. So maybe a worthwhile modification is to define a clonogroup as only those cells that share an A and B? Or otherwise accept that when dealing with clonogroups of this size, youre going to see some messiness.

No problem.

I think that the best way to do it would be to have an option that sets the level of stringency required for assigning cells to clonotypes as they are reported in the summary.

Ideally this would be coupled to better reporting and visualisation to make it easier to assess the structure of the clonotypes and how tenuously cells are connected to them.