Figure 7. (A) Target T1169, a mosquito protein relevant
to pathogen transmission (PDB:8FJP) with four evaluation units defined:
D1: 1-345; D2: 1302-2735; D3: 378-699,1223-1301; D4: 700-1222.(B) Parsing of SGS1 into domains as suggested by the authors of
the structure 28.(C) Top HHsearch hits showing similarity of the query sequence
to known folds in two areas: 395-670 (intermediate domain between the
two beta-propellers - see panel B) and 1718-2735 (region after the
lectin-CRD domain and up to the TM domain).
3.1.4 | Targets that were split into more EUs than suggested by
Grishin plots
Two single-domain targets as suggested by the domain parsers (T1137s2
and T1137s3) were split into two domains for consistency with the other
subunits of the same heteromeric complex. Target H1137 (PDB: 8fef) is a
hetero 9-mer with six subunits forming an intertwined obligatory
complex. The split was made in agreement with the results of template
searches and splits of other related subunits.
Another target, T1125, was split into 6 domains instead of 5 suggested
by the domain parsers. In this target the C-terminal region penetrates
the N-terminal part forming one structural domain, but predictors were
unable to model the circular fold of the protein. Thus, for the
evaluation, the N-terminal domain (#1) and C-terminal domain (#6) were
considered separately.
3.1.5 | Domain swaps
Four targets in CASP15 included domains involved in domain swaps: T1109,
T1113, T1120 and T1176. Target T1120 was discussed above (3.1.3). The
remaining three targets were un-swapped, and models were evaluated
versus both swapped and un-swapped versions of the targets. For T1109
and T1113, models scored higher versus the original (swapped) version,
and thus the original targets were used for the final evaluation; for
T1176, the evaluation scores were higher for the un-swapped version, and
that version was used as the target (T1176-D9: A1-138 + B139-170).