cmbi/hssp

residue 124 in 1sx2 is skipped

Closed this issue · 6 comments

When generating the dssp file for pdb entry 1sx2, residues 123 and 124 are ignored. This probably has to do with the fact that residue 123 only has one alternate location with identifier B. This should however, not result in residues 123 and 124 missing.

@tonyelewis suggests the following:

From what I can see, it looks like the problem is to do with interaction
between AltLoc handling and next-residue detection. In more detail:
  - the atoms in 124 get ignored
  - because at that point, firstAltLoc is set to 'B'
  - because firstAltLoc doesn't get reset at the start of residue 124
  - because the start of residue 124 doesn't get detected
  - because, at that point, atoms.empty()
  - because all atoms in 123 have been skipped

Thanks for opening this issue. I'll submit a pull request soon.

@cbaakman : Please may I ask you to change my contact on the original description from my email to @tonyelewis? Thanks very much.

Thanks very much.

For information... the problem can be replicated using just residues A:120, A:123, A:124 and A:125 and can be simplified down to the backbone atoms:

ATOM   1032  N   MET A 120      23.127   5.241  -5.234  1.00  9.79           N  
ATOM   1033  CA  MET A 120      24.061   4.468  -6.042  1.00  9.61           C  
ATOM   1034  C   MET A 120      24.811   3.451  -5.187  1.00  9.38           C  
ATOM   1035  O   MET A 120      25.054   2.306  -5.620  1.00 10.06           O  
ATOM   1065  N  BGLN A 123      22.567   0.603  -4.409  1.00 11.11           N  
ATOM   1066  CA BGLN A 123      22.431  -0.200  -5.625  1.00 11.15           C  
ATOM   1067  C  BGLN A 123      23.706  -0.950  -5.973  1.00 10.67           C  
ATOM   1068  O  BGLN A 123      23.706  -1.700  -6.955  1.00 11.70           O  
ATOM   1074  N  ALYS A 124      24.772  -0.666  -5.315  0.64  9.24           N  
ATOM   1075  N  BLYS A 124      24.774  -0.724  -5.226  0.36 10.75           N  
ATOM   1076  CA ALYS A 124      26.058  -1.322  -5.576  0.64  9.02           C  
ATOM   1077  CA BLYS A 124      26.116  -1.237  -5.480  0.36 10.92           C  
ATOM   1078  C  ALYS A 124      26.612  -0.964  -6.962  0.64  9.77           C  
ATOM   1079  C  BLYS A 124      26.599  -0.944  -6.904  0.36  9.69           C  
ATOM   1080  O  ALYS A 124      27.304  -1.768  -7.599  0.64 11.47           O  
ATOM   1081  O  BLYS A 124      27.229  -1.776  -7.565  0.36 11.86           O  
ATOM   1092  N   ARG A 125      26.324   0.274  -7.353  1.00  9.58           N  
ATOM   1093  CA  ARG A 125      26.849   0.839  -8.587  1.00 10.21           C  
ATOM   1094  C   ARG A 125      28.105   1.626  -8.216  1.00  9.67           C  
ATOM   1095  O   ARG A 125      28.075   2.852  -8.074  1.00 10.39           O  

In this comment, I describe the results of running the proposed fix in #80 across a 2015 snapshot of the PDB. I include details of the 29 PDBs that the proposed fix affects...

In some of these cases, the proposed fix restores more residues than in 1sx2 (eg, 4 residues in 3f9s, the whole of chain E in 3adm).

Thanks.