FIGURE 2. Network representation for 142 complete protein sequences similar to PURases linked by 6419 edges. The protein sequences depicted here were selected by clustering at a threshold of 90% sequence identity. Edges (links) were selected at a threshold of 60% global sequence similarity, without defining a core domain region. Nodes are coloured according to their annotated source organisms, with Proteobacteria in blue and unknown bacteria in white. The network on the left represents sequences with an N-terminal lid and a C-terminal β-sandwich domain and contains 127 nodes connected by 6314 edges. Diamonds represent sequences originating from the genusPseudomonas (from the class Gammaproteobacteria). The network on the right represents sequences similar to carboxylesterases and contains 15 nodes connected by 105 edges. Squares represent sequences originating from the class of Betaproteobacteria. See Methods section for more details on the network layout.