NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|585168931|ref|XP_006736522|]
View 

galectin-3 [Leptonychotes weddellii]

Protein Classification

galectin family protein( domain architecture ID 10049222)

galectin family protein may exclusively bind beta-galactosides such as lactose in a manner independent of metal ions

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GLECT cd00070
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as ...
152-279 3.41e-52

Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.


:

Pssm-ID: 238025  Cd Length: 127  Bit Score: 166.66  E-value: 3.41e-52
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931 152 PYDLPLPGGIMPRMLITILGTVKPNANRLALDFKRGN-DVAFHFNPRFSEDnkrVIVCNTKLDNIWGKEERQATFPFESG 230
Cdd:cd00070    1 PYKLPLPGGLKPGSTLTVKGRVLPNAKRFSINLGTGSsDIALHFNPRFDEN---VIVRNSFLNGNWGPEERSGGFPFQPG 77
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 585168931 231 KPFKIQVLVESDHFKVAVNDAHLLQYNHRMKnLQEISKLGISGDIDLTS 279
Cdd:cd00070   78 QPFELTILVEEDKFQIFVNGQHFFSFPHRLP-LESIDYLSINGDVSLTS 125
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
13-148 3.78e-10

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 60.38  E-value: 3.78e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  13 GSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA 92
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  93 PPGAYPGPTAPayPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK07764 671 AKAGGAAPAAP--PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQA 724
 
Name Accession Description Interval E-value
GLECT cd00070
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as ...
152-279 3.41e-52

Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.


Pssm-ID: 238025  Cd Length: 127  Bit Score: 166.66  E-value: 3.41e-52
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931 152 PYDLPLPGGIMPRMLITILGTVKPNANRLALDFKRGN-DVAFHFNPRFSEDnkrVIVCNTKLDNIWGKEERQATFPFESG 230
Cdd:cd00070    1 PYKLPLPGGLKPGSTLTVKGRVLPNAKRFSINLGTGSsDIALHFNPRFDEN---VIVRNSFLNGNWGPEERSGGFPFQPG 77
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 585168931 231 KPFKIQVLVESDHFKVAVNDAHLLQYNHRMKnLQEISKLGISGDIDLTS 279
Cdd:cd00070   78 QPFELTILVEEDKFQIFVNGQHFFSFPHRLP-LESIDYLSINGDVSLTS 125
Gal-bind_lectin smart00908
Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are ...
158-279 1.80e-51

Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. Galectins (previously S-lectins) bind exclusively beta-galactosides like lactose. They do not require metal ions for activity. Galectins are found predominantly, but not exclusively in mammals. Their function is unclear. They are developmentally regulated and may be involved in differentiation, cellular regulation and tissue construction.


Pssm-ID: 214904  Cd Length: 122  Bit Score: 164.69  E-value: 1.80e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   158 PGGIMPRMLITILGTVKPNANRLALDFKRG--NDVAFHFNPRFSEdnkRVIVCNTKLDNIWGKEERQATFPFESGKPFKI 235
Cdd:smart00908   1 PGGLSPGSSITIRGIVLPDAKRFSINLQCGpnADIALHFNPRFDE---GTIVRNSKQNGKWGKEERSGGFPFQPGQPFEL 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 585168931   236 QVLVESDHFKVAVNDAHLLQYNHRMKnLQEISKLGISGDIDLTS 279
Cdd:smart00908  78 EILVEEDEFKVAVNGQHFLEFPHRLP-LESIDTLEISGDVQLTS 120
Gal-bind_lectin pfam00337
Galactoside-binding lectin; This family contains galactoside binding lectins. The family also ...
158-279 5.50e-46

Galactoside-binding lectin; This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).


Pssm-ID: 459768  Cd Length: 124  Bit Score: 150.87  E-value: 5.50e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  158 PGGIMPRMLITILGTVKPNANRLALDFKRG----NDVAFHFNPRFSEDnkrVIVCNTKLDNIWGKEERQATFPFESGKPF 233
Cdd:pfam00337   1 PGGLQPGSSLTIKGIVLPDAQRFSINLQTGvgpsDDIALHFNPRFDEN---VIVRNSRQNGQWGQEEREGGFPFQPGQPF 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 585168931  234 KIQVLVESDHFKVAVNDAHLLQYNHRMKNlQEISKLGISGDIDLTS 279
Cdd:pfam00337  78 ELTILVGDDHFKIYVNGQHFTTFKHRLPP-EDIDALQVRGDVKLTS 122
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
13-148 3.78e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 60.38  E-value: 3.78e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  13 GSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA 92
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  93 PPGAYPGPTAPayPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK07764 671 AKAGGAAPAAP--PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQA 724
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
20-133 2.46e-07

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 51.87  E-value: 2.46e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   20 QGWPGPWGNQPAGGYPGAsypGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYpg 99
Cdd:pfam03157 439 QGQQPGQGQQPGQEQPGQ---GQQPGQGQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYY-- 513
                          90       100       110
                  ....*....|....*....|....*....|....
gi 585168931  100 PTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAP 133
Cdd:pfam03157 514 PTSPLQPGQGQPGYYPTSPQQPGQGQQLGQLQQP 547
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-143 9.73e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 43.36  E-value: 9.73e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  15 GNPNPQGWPGPWGNQPAGGYPGASYPgayPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPP 94
Cdd:NF038329 162 GPAGPQGEAGPQGPAGKDGEAGAKGP---AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPD 238
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 585168931  95 GAyPGPTAPAYP-GPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFG 143
Cdd:NF038329 239 GD-PGPTGEDGPqGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAG 287
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
19-130 1.47e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.87  E-value: 1.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   19 PQGWPGPWGNQpaggYPGASYPGAYPGQAPPGSYPGQapPGGYPGQAPpgaypgqVPPGGYPGQAPPGAYPGQAPPGAYP 98
Cdd:TIGR01628 380 PRMRQLPMGSP----MGGAMGQPPYYGQGPQQQFNGQ--PLGWPRMSM-------MPTPMGPGGPLRPNGLAPMNAVRAP 446
                          90       100       110
                  ....*....|....*....|....*....|..
gi 585168931   99 GPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQP 130
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLP 478
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-148 1.55e-04

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 42.97  E-value: 1.55e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  15 GNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPG--GYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA 92
Cdd:NF038329 198 GETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGptGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKD 277
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931  93 PPGAYPGPTAP-------AYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:NF038329 278 GERGPVGPAGKdgqngkdGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-148 2.18e-04

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 42.20  E-value: 2.18e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  15 GNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPG-QAP 93
Cdd:NF038329 132 GEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGpAGP 211
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931  94 PGA--------YPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGiPAGP 148
Cdd:NF038329 212 AGPdgeagpagEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAG-PDGP 273
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
13-154 3.87e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.55  E-value: 3.87e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  13 GSGNPNPQGWPGPWGNQ----PAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAY 88
Cdd:COG5164   35 STRPAGNTGGTRPAQNQgsttPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDG 114
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  89 PGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGiPAGPLTVPYD 154
Cdd:COG5164  115 GATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTPPG-PGGSTTPPDD 179
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
56-148 1.66e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 39.60  E-value: 1.66e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  56 APPGGYPGQAPPGAYPGQVPPGGYPGQAPPGayPGQAPPGAYPGPTAPAYPGPSAPgahpgqpsgpgaYPPPGQPSAPGA 135
Cdd:NF041121  19 AAPPSPEGPAPTAASQPATPPPPAAPPSPPG--DPPEPPAPEPAPLPAPYPGSLAP------------PPPPPPGPAGAA 84
                         90
                 ....*....|...
gi 585168931 136 HPAAGPFGIPAGP 148
Cdd:NF041121  85 PGAALPVRVPAPP 97
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
46-150 2.01e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 39.47  E-value: 2.01e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  46 QAPPGSYPGQAPpggYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQ-APPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAY 124
Cdd:cd23959  143 QTAPVTPFGQLP---MFGQHPPPAKPLPAAAAAQQSSASPGEVASPfASGTVSASPFATATDTAPSSGAPDGFPAEASAP 219
                         90       100
                 ....*....|....*....|....*.
gi 585168931 125 PPPGQPSAPGAHPAAGPFGIPAGPLT 150
Cdd:cd23959  220 SPFAAPASAASFPAAPVANGEAATPT 245
 
Name Accession Description Interval E-value
GLECT cd00070
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as ...
152-279 3.41e-52

Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.


Pssm-ID: 238025  Cd Length: 127  Bit Score: 166.66  E-value: 3.41e-52
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931 152 PYDLPLPGGIMPRMLITILGTVKPNANRLALDFKRGN-DVAFHFNPRFSEDnkrVIVCNTKLDNIWGKEERQATFPFESG 230
Cdd:cd00070    1 PYKLPLPGGLKPGSTLTVKGRVLPNAKRFSINLGTGSsDIALHFNPRFDEN---VIVRNSFLNGNWGPEERSGGFPFQPG 77
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 585168931 231 KPFKIQVLVESDHFKVAVNDAHLLQYNHRMKnLQEISKLGISGDIDLTS 279
Cdd:cd00070   78 QPFELTILVEEDKFQIFVNGQHFFSFPHRLP-LESIDYLSINGDVSLTS 125
Gal-bind_lectin smart00908
Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are ...
158-279 1.80e-51

Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. Galectins (previously S-lectins) bind exclusively beta-galactosides like lactose. They do not require metal ions for activity. Galectins are found predominantly, but not exclusively in mammals. Their function is unclear. They are developmentally regulated and may be involved in differentiation, cellular regulation and tissue construction.


Pssm-ID: 214904  Cd Length: 122  Bit Score: 164.69  E-value: 1.80e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   158 PGGIMPRMLITILGTVKPNANRLALDFKRG--NDVAFHFNPRFSEdnkRVIVCNTKLDNIWGKEERQATFPFESGKPFKI 235
Cdd:smart00908   1 PGGLSPGSSITIRGIVLPDAKRFSINLQCGpnADIALHFNPRFDE---GTIVRNSKQNGKWGKEERSGGFPFQPGQPFEL 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 585168931   236 QVLVESDHFKVAVNDAHLLQYNHRMKnLQEISKLGISGDIDLTS 279
Cdd:smart00908  78 EILVEEDEFKVAVNGQHFLEFPHRLP-LESIDTLEISGDVQLTS 120
GLECT smart00276
Galectin; Galectin - galactose-binding lectin
153-281 1.26e-48

Galectin; Galectin - galactose-binding lectin


Pssm-ID: 214596  Cd Length: 128  Bit Score: 157.77  E-value: 1.26e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   153 YDLPLPGGIMPRMLITILGTVKPNANRLALDF-KRGNDVAFHFNPRFSEDnkrVIVCNTKLDNIWGKEERQATFPFESGK 231
Cdd:smart00276   1 FTLPIPGGLKPGQTLTVRGIVLPDAKRFSINLlTGGDDIALHFNPRFNEN---KIVCNSKLNGSWGSEEREGGFPFQPGQ 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 585168931   232 PFKIQVLVESDHFKVAVNDAHLLQYNHRMKnLQEISKLGISGDIDLTSAS 281
Cdd:smart00276  78 PFDLTIIVQPDHFQIFVNGVHITTFPHRLP-LESIDYLSINGDVQLTSVS 126
Gal-bind_lectin pfam00337
Galactoside-binding lectin; This family contains galactoside binding lectins. The family also ...
158-279 5.50e-46

Galactoside-binding lectin; This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).


Pssm-ID: 459768  Cd Length: 124  Bit Score: 150.87  E-value: 5.50e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  158 PGGIMPRMLITILGTVKPNANRLALDFKRG----NDVAFHFNPRFSEDnkrVIVCNTKLDNIWGKEERQATFPFESGKPF 233
Cdd:pfam00337   1 PGGLQPGSSLTIKGIVLPDAQRFSINLQTGvgpsDDIALHFNPRFDEN---VIVRNSRQNGQWGQEEREGGFPFQPGQPF 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 585168931  234 KIQVLVESDHFKVAVNDAHLLQYNHRMKNlQEISKLGISGDIDLTS 279
Cdd:pfam00337  78 ELTILVGDDHFKIYVNGQHFTTFKHRLPP-EDIDALQVRGDVKLTS 122
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
13-148 3.78e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 60.38  E-value: 3.78e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  13 GSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA 92
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  93 PPGAYPGPTAPayPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK07764 671 AKAGGAAPAAP--PPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQA 724
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
10-165 6.96e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 59.61  E-value: 6.96e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  10 ALSGSGNPNPQGW-PGPWGNQPAGGYPGASYPGAYPGQAPPGS-------YPGQAPPGGYPGQAPPGAYPGQVPPGGYPG 81
Cdd:PRK07764 594 AAGGEGPPAPASSgPPEEAARPAAPAAPAAPAAPAPAGAAAAPaeasaapAPGVAAPEHHPKHVAVPDASDGGDGWPAKA 673
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  82 QAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPGGI 161
Cdd:PRK07764 674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAG 753

                 ....
gi 585168931 162 MPRM 165
Cdd:PRK07764 754 APAQ 757
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-163 2.38e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.03  E-value: 2.38e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGqvPPGGYPGQAPPGAYPGQA--PP 94
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG--PPAPAPPAAPAAGPPRRLtrPA 2787
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 585168931   95 GAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHP--AAGPFGIPAGPLTVPYDLPLPGGIMP 163
Cdd:PHA03247 2788 VASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPptSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
30-159 8.11e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 56.04  E-value: 8.11e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  30 PAGGYPGASYPGAYPGQAPPGSYPGQAPPggypgQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPS 109
Cdd:PRK12323 387 PAAAAPAAAAPAPAAPPAAPAAAPAAAAA-----ARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAA 461
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 585168931 110 APGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPlTVPYDLPLPG 159
Cdd:PRK12323 462 ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE-ELPPEFASPA 510
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
31-151 9.14e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.15  E-value: 9.14e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  31 AGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGgyPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSA 110
Cdd:PRK07764 390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPA--PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPA 467
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 585168931 111 PGAHPGQPSGPGAYPPPGQPSAPGAHPAA-GPFGIPAGPLTV 151
Cdd:PRK07764 468 PAPAAAPEPTAAPAPAPPAAPAPAAAPAApAAPAAPAGADDA 509
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
21-141 2.01e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 2.01e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  21 GWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPgQVPPGGYPGQAPPGAYPGQAPPGAYPGP 100
Cdd:PRK07764 659 VPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP-PAGQADDPAAQPPQAAQGASAPSPAADD 737
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 585168931 101 TAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGP 141
Cdd:PRK07764 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
31-141 3.43e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.22  E-value: 3.43e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  31 AGGYPGASYPGAYPGQAPPGSYPGQ---APPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPG 107
Cdd:PRK07764 588 VGPAPGAAGGEGPPAPASSGPPEEAarpAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
                         90       100       110
                 ....*....|....*....|....*....|....
gi 585168931 108 PSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGP 141
Cdd:PRK07764 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQP 701
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-171 4.66e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.84  E-value: 4.66e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   9 DALSGSGNPNPQGWPGPWGnQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAY 88
Cdd:PRK07764 652 HHPKHVAVPDASDGGDGWP-AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA 730
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  89 PGQAPPGAYPGPTAPAY-PGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPGGIMPRMLI 167
Cdd:PRK07764 731 PSPAADDPVPLPPEPDDpPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLE 810

                 ....
gi 585168931 168 TILG 171
Cdd:PRK07764 811 EELG 814
dnaA PRK14086
chromosomal replication initiator protein DnaA;
17-152 7.01e-08

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 53.29  E-value: 7.01e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  17 PNPQGWP-GPWGNQPAGGYPG---ASYPGAYPGqaPPGSYPGQAPPGGYPGQAPPGA----YPGQVPPGGYPGQAPP--- 85
Cdd:PRK14086 128 DRPPGLPrQDQLPTARPAYPAyqqRPEPGAWPR--AADDYGWQQQRLGFPPRAPYASpasyAPEQERDREPYDAGRPeyd 205
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  86 ---GAYPGQAPPGAYPGPTAPAYPGPsAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVP 152
Cdd:PRK14086 206 qrrRDYDHPRPDWDRPRRDRTDRPEP-PPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGP 274
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-148 1.28e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.68  E-value: 1.28e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   9 DALSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQA--PPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPG 86
Cdd:PRK07764 619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVavPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP 698
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 585168931  87 AYPGQAPPGAYPGPTAPAYPGPSAPGAH---PGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK07764 699 AQPAPAPAATPPAGQADDPAAQPPQAAQgasAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
PHA03378 PHA03378
EBNA-3B; Provisional
30-180 1.67e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 52.38  E-value: 1.67e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  30 PAGGYPGASYPG-AYPGQAPPGsyPGQAPPGGYPGQAPPGAYPGQV-PPGGYPGQA-PPGAYPGQA-PPGAYPGPTAPay 105
Cdd:PHA03378 691 PGTMQPPPRAPTpMRPPAAPPG--RAQRPAAATGRARPPAAAPGRArPPAAAPGRArPPAAAPGRArPPAAAPGRARP-- 766
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 585168931 106 pgpsaPGAHPGQPSgpgAYPPPGQPSAPGAHPAAGPfgipaGPLTVPYDLPLPGGIMPRMLITILGTVKPNANRL 180
Cdd:PHA03378 767 -----PAAAPGAPT---PQPPPQAPPAPQQRPRGAP-----TPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
18-142 1.87e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.91  E-value: 1.87e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  18 NPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAY 97
Cdd:PRK07764 386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 585168931  98 PGPtAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPF 142
Cdd:PRK07764 466 PAP-APAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
20-133 2.46e-07

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 51.87  E-value: 2.46e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   20 QGWPGPWGNQPAGGYPGAsypGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYpg 99
Cdd:pfam03157 439 QGQQPGQGQQPGQEQPGQ---GQQPGQGQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYY-- 513
                          90       100       110
                  ....*....|....*....|....*....|....
gi 585168931  100 PTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAP 133
Cdd:pfam03157 514 PTSPLQPGQGQPGYYPTSPQQPGQGQQLGQLQQP 547
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
25-141 3.02e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 51.25  E-value: 3.02e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  25 PWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPggyPGQAPPGayPGQAPPGAYPGPTAPA 104
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAP---PAAAPPA--PVAAPAAAAPAAAPAA 440
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....
gi 585168931 105 YPGPSAPGAHPGQPSGPGAY-------PPPGQPSAPGAHPAAGP 141
Cdd:PRK14951 441 APAAVALAPAPPAQAAPETVaipvrvaPEPAVASAAPAPAAAPA 484
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
45-146 3.14e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 51.22  E-value: 3.14e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  45 GQAPPGSYPGQAPPGGYPGQAPPgayPGQVPPGGYP---GQAPPGAYPGQAPPGAYPGPTAP---AYPGPSAPGAHP-GQ 117
Cdd:PRK14959 377 GASAPSGSAAEGPASGGAATIPT---PGTQGPQGTApaaGMTPSSAAPATPAPSAAPSPRVPwddAPPAPPRSGIPPrPA 453
                         90       100       110
                 ....*....|....*....|....*....|...
gi 585168931 118 PSGPGAYPPPGQP----SAPGAHPAAGPFGIPA 146
Cdd:PRK14959 454 PRMPEASPVPGAPdsvaSASDAPPTLGDPSDTA 486
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
11-139 3.19e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.53  E-value: 3.19e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  11 LSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPggyPGQAPPGAYPGQVPPGGYPGQAPPGAYPG 90
Cdd:PRK07764 387 VAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPA---PAPAPAPPSPAGNAPAGGAPSPPPAAAPS 463
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 585168931  91 QAPPgayPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAA 139
Cdd:PRK07764 464 AQPA---PAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDA 509
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
29-141 3.76e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 51.22  E-value: 3.76e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  29 QPAGGypGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPG------QAPPgAYPGPTA 102
Cdd:PRK14959 372 RPSGG--GASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSprvpwdDAPP-APPRSGI 448
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 585168931 103 PAYPGPSAPGAH--PGQP----SGPGAYPPPGQPSAPGAHPAAGP 141
Cdd:PRK14959 449 PPRPAPRMPEASpvPGAPdsvaSASDAPPTLGDPSDTAEHTPSGP 493
PHA03247 PHA03247
large tegument protein UL36; Provisional
40-156 4.78e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 4.78e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   40 PGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGyPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPS 119
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG-PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 585168931  120 GPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLP 156
Cdd:PHA03247 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-141 6.09e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.37  E-value: 6.09e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   9 DALSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAY 88
Cdd:PRK07764 642 APAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP 721
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....
gi 585168931  89 PGQAPPGAYPGP-TAPAYPGPSAPGAHPGqPSGPGAYPPPGQPSAPGAHPAAGP 141
Cdd:PRK07764 722 PQAAQGASAPSPaADDPVPLPPEPDDPPD-PAGAPAQPPPPPAPAPAAAPAAAP 774
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
17-160 7.22e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.37  E-value: 7.22e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  17 PNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQA--PPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPP 94
Cdd:PRK07764 618 PAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVavPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAA 697
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  95 GAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPGG 160
Cdd:PRK07764 698 PAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
42-160 7.97e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.98  E-value: 7.97e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  42 AYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVP-PGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSG 120
Cdd:PRK07764 586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAaPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|
gi 585168931 121 PGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPGG 160
Cdd:PRK07764 666 GDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
12-148 8.12e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.98  E-value: 8.12e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  12 SGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQ 91
Cdd:PRK07764 636 PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQAD 715
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 585168931  92 APPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK07764 716 DPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAA 772
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
56-150 1.12e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 1.12e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  56 APPGGYPGQAPPGAYPGQVPPggypgQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGA 135
Cdd:PRK07764 388 AGGAGAPAAAAPSAAAAAPAA-----APAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462
                         90
                 ....*....|....*
gi 585168931 136 HPAAGPFGIPAGPLT 150
Cdd:PRK07764 463 SAQPAPAPAAAPEPT 477
dnaA PRK14086
chromosomal replication initiator protein DnaA;
17-159 1.62e-06

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 49.05  E-value: 1.62e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  17 PNPQGWPGPWGNQPAG-GYPGASYPGAYPGQAPPGSYPGQA-PPGGYPGQAP---PGAYPG-------QVPPGGYPGQAP 84
Cdd:PRK14086 100 PHARRTSEPELPRPGRrPYEGYGGPRADDRPPGLPRQDQLPtARPAYPAYQQrpePGAWPRaaddygwQQQRLGFPPRAP 179
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  85 PGA----YPGQA----PPGAYPGPTAPAYPGPSAPGAH---PGQPSGPGAYPPPGQPSAP-GAHPAAGPFGIPAGPLTVP 152
Cdd:PRK14086 180 YASpasyAPEQErdrePYDAGRPEYDQRRRDYDHPRPDwdrPRRDRTDRPEPPPGAGHVHrGGPGPPERDDAPVVPIRPS 259

                 ....*..
gi 585168931 153 YDLPLPG 159
Cdd:PRK14086 260 APGPLAA 266
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-157 6.04e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 6.04e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPG---------SYPGQAPPGGYPGQAP-PGAYPGQV---------PPG 77
Cdd:PHA03247 2625 DPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprraRRLGRAAQASSPPQRPrRRAARPTVgsltsladpPPP 2704
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   78 GYPGQAPPGAYPGQ--APPGAYPG----PTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGA-HPAAGPFGIPAGPLT 150
Cdd:PHA03247 2705 PPTPEPAPHALVSAtpLPPGPAAArqasPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPApAPPAAPAAGPPRRLT 2784

                  ....*..
gi 585168931  151 VPYDLPL 157
Cdd:PHA03247 2785 RPAVASL 2791
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
44-158 6.63e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.18  E-value: 6.63e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  44 PGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGA 123
Cdd:PRK12323 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 585168931 124 YPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLP 158
Cdd:PRK12323 445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
71-152 7.12e-06

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.19  E-value: 7.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   71 PGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLT 150
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117

                  ..
gi 585168931  151 VP 152
Cdd:PRK12270  118 TP 119
PHA03378 PHA03378
EBNA-3B; Provisional
16-160 8.87e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 8.87e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  16 NPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA-PP 94
Cdd:PHA03378 618 TSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMqPP 697
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 585168931  95 GAYPGPTAPAYPGPSA---PGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAG---PLTVPYDLPLPGG 160
Cdd:PHA03378 698 PRAPTPMRPPAAPPGRaqrPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRarpPAAAPGRARPPAA 769
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
44-173 8.94e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 46.63  E-value: 8.94e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  44 PGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPP-GGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPgqPSGPG 122
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAaAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAP--AAAPA 443
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|...
gi 585168931 123 AYPPPGQPSAPGA-HPAAGPFGIPAGP-LTVPYDLPLPGGIMPRMLITILGTV 173
Cdd:PRK14951 444 AVALAPAPPAQAApETVAIPVRVAPEPaVASAAPAPAAAPAAARLTPTEEGDV 496
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
23-148 1.04e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 1.04e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  23 PGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTA 102
Cdd:PRK12323 397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAA 476
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 585168931 103 PAYPGPSAPGAHPgqpsGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK12323 477 AAAPARAAPAAAP----APADDDPPPWEELPPEFASPAPAQPDAAP 518
PHA03247 PHA03247
large tegument protein UL36; Provisional
19-159 1.09e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   19 PQGWPGPWGNQPAGGYPGASYPgaypgqAPPGSYPGQAPPGGYPG-QAPPG-AYPGQVPPGGYPGQAPPGAYPGQAPPGa 96
Cdd:PHA03247 2680 PQRPRRRAARPTVGSLTSLADP------PPPPPTPEPAPHALVSAtPLPPGpAAARQASPALPAAPAPPAVPAGPATPG- 2752
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931   97 ypGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPG 159
Cdd:PHA03247 2753 --GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA 2813
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
31-152 1.64e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 46.00  E-value: 1.64e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  31 AGGYPGASYPGAYPGQAPPgsyPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQ---APPGAYPGQAPPGAYPGPTAPAYPG 107
Cdd:PRK07003 364 GGGAPGGGVPARVAGAVPA---PGARAAAAVGASAVPAVTAVTGAAGAALAPkaaAAAAATRAEAPPAAPAPPATADRGD 440
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 585168931 108 PSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVP 152
Cdd:PRK07003 441 DAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAP 485
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
29-163 2.14e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 2.14e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  29 QPAGGYPGASYPGAYPG----QAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPA 104
Cdd:PRK12323 364 RPGQSGGGAGPATAAAApvaqPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931 105 YPGPSAPGAHPGQPSGPGAYPPPGQP-SAPGAHPAAGPFGIPAGPLTVPYDLPLPGGIMP 163
Cdd:PRK12323 444 PGGAPAPAPAPAAAPAAAARPAAAGPrPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP 503
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
30-148 2.74e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 2.74e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  30 PAGGYPGASYPGAYPGQAP-PGSYPGQAPPGGYPGQAP-------PGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPT 101
Cdd:PRK12323 440 SARGPGGAPAPAPAPAAAPaAAARPAAAGPRPVAAAAAaaparaaPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPA 519
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 585168931 102 APAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
6-141 3.27e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.98  E-value: 3.27e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   6 SLNDALSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPP 85
Cdd:PRK07764 663 SDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  86 GAYPGQAPPGAYPGPTAPayPGPSAPGAHPGQPSGPGAyPPPGQPSAPGAHPAAGP 141
Cdd:PRK07764 743 PEPDDPPDPAGAPAQPPP--PPAPAPAAAPAAAPPPSP-PSEEEEMAEDDAPSMDD 795
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
48-164 4.06e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 4.06e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  48 PPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPP 127
Cdd:PRK12323 365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 585168931 128 GQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPGGIMPR 164
Cdd:PRK12323 445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPA 481
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
44-143 5.83e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.50  E-value: 5.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   44 PGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGgypgQAPPGAYPGQAPPGayPGPTAPAYPGPSAPGAHPGqpsgpgA 123
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPA----PAPPAAAAPAAPPK--PAAAAAAAAAPAAPPAAAA------A 105
                          90       100
                  ....*....|....*....|
gi 585168931  124 YPPPGQPSAPGAHPAAGPFG 143
Cdd:PRK12270  106 AAPAAAAVEDEVTPLRGAAA 125
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
16-163 6.54e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 6.54e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   16 NPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPgqVPPGGYPGQAPPG---AYPGQA 92
Cdd:pfam03154 244 SPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFP--LTPQSSQSQVPPGpspAAPGQS 321
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 585168931   93 PPGAYPGPTAPAYPGPSAPGAHPGQP---SGPGAYPPPGQPSAPGAHPAAG---PFGIPAGPLTVPYDLPLPGGIMP 163
Cdd:pfam03154 322 QQRIHTPPSQSQLQSQQPPREQPLPPaplSMPHIKPPPTTPIPQLPNPQSHkhpPHLSGPSPFQMNSNLPPPPALKP 398
PHA03247 PHA03247
large tegument protein UL36; Provisional
14-158 9.22e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 9.22e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   14 SGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGypGQAPPGAYPGQAP 93
Cdd:PHA03247 2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG--SVAPGGDVRRRPP 2867
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 585168931   94 PGAYPG-PTAPAYP-GPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLP 158
Cdd:PHA03247 2868 SRSPAAkPAAPARPpVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
PHA03378 PHA03378
EBNA-3B; Provisional
18-152 9.72e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 9.72e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  18 NPQGWPGPwgNQPaggyPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQV-PPGGYPGQA-PPGAYPGQA-PP 94
Cdd:PHA03378 644 NVLVFPTP--HQP----PQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMqPPPRAPTPMrPPAAPPGRAqRP 717
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 585168931  95 GAYPGPTAP--AYPGPS-APGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVP 152
Cdd:PHA03378 718 AAATGRARPpaAAPGRArPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQP 778
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-143 9.73e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 43.36  E-value: 9.73e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  15 GNPNPQGWPGPWGNQPAGGYPGASYPgayPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPP 94
Cdd:NF038329 162 GPAGPQGEAGPQGPAGKDGEAGAKGP---AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPD 238
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 585168931  95 GAyPGPTAPAYP-GPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFG 143
Cdd:NF038329 239 GD-PGPTGEDGPqGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAG 287
PHA03247 PHA03247
large tegument protein UL36; Provisional
29-158 9.99e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 9.99e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   29 QPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPgqAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAY--- 105
Cdd:PHA03247 2594 QSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARrlg 2671
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 585168931  106 --PGPSAPGAHPGQPSGPGAY---------PPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLP 158
Cdd:PHA03247 2672 raAQASSPPQRPRRRAARPTVgsltsladpPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-148 1.23e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 1.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQgwPGPWGNQPAGGyPGASYPGAYPGQA-------PPGSYPGQAPPGGYPGQAPPgayPGQVPPGGYP-GQAPPGAY 88
Cdd:PHA03247 2569 PPPR--PAPRPSEPAVT-SRARRPDAPPQSArprapvdDRGDPRGPAPPSPLPPDTHA---PDPPPPSPSPaANEPDPHP 2642
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   89 PGQAPPGAYPgPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PHA03247 2643 PPTVPPPERP-RDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
19-130 1.47e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.87  E-value: 1.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   19 PQGWPGPWGNQpaggYPGASYPGAYPGQAPPGSYPGQapPGGYPGQAPpgaypgqVPPGGYPGQAPPGAYPGQAPPGAYP 98
Cdd:TIGR01628 380 PRMRQLPMGSP----MGGAMGQPPYYGQGPQQQFNGQ--PLGWPRMSM-------MPTPMGPGGPLRPNGLAPMNAVRAP 446
                          90       100       110
                  ....*....|....*....|....*....|..
gi 585168931   99 GPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQP 130
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLP 478
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-148 1.55e-04

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 42.97  E-value: 1.55e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  15 GNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPG--GYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA 92
Cdd:NF038329 198 GETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGptGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKD 277
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931  93 PPGAYPGPTAP-------AYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:NF038329 278 GERGPVGPAGKdgqngkdGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
14-117 1.61e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.87  E-value: 1.61e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   14 SGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQA--PPGSYPG-QAPPGGYPGQAPPGAYPGQVPPGgyPGQAPPGAYPg 90
Cdd:TIGR01628 387 MGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSmmPTPMGPGgPLRPNGLAPMNAVRAPSRNAQNA--AQKPPMQPVM- 463
                          90       100
                  ....*....|....*....|....*..
gi 585168931   91 qAPPGAYPGPTAPAYPGPSAPGAHPGQ 117
Cdd:TIGR01628 464 -YPPNYQSLPLSQDLPQPQSTASQGGQ 489
PHA02682 PHA02682
ORF080 virion core protein; Provisional
29-141 1.73e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 42.16  E-value: 1.73e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  29 QPAGGYPGASYPgayPGQAPPGSYPGQAPPGGYPGQAPPGAYPGqVPPGGYPGQAPPGAYPGQA------PPGAYPGPTA 102
Cdd:PHA02682  75 RPSGQSPLAPSP---ACAAPAPACPACAPAAPAPAVTCPAPAPA-CPPATAPTCPPPAVCPAPArpapacPPSTRQCPPA 150
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|...
gi 585168931 103 PAYPGPS-APGAHPG---QPSGPGAYPPPGQPSAPGAhPAAGP 141
Cdd:PHA02682 151 PPLPTPKpAPAAKPIflhNQLPPPDYPAASCPTIETA-PAASP 192
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-148 2.18e-04

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 42.20  E-value: 2.18e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  15 GNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPG-QAP 93
Cdd:NF038329 132 GEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGpAGP 211
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931  94 PGA--------YPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGiPAGP 148
Cdd:NF038329 212 AGPdgeagpagEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAG-PDGP 273
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
40-141 2.36e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 42.30  E-value: 2.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   40 PGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPS 119
Cdd:pfam09606 336 QGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSH 415
                          90       100
                  ....*....|....*....|..
gi 585168931  120 GPGAYPPPGQPSAPGAHPAAGP 141
Cdd:pfam09606 416 PGGMIPSPALIPSPSPQMSQQP 437
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
44-165 2.54e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.45  E-value: 2.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   44 PGQAPPGSYPGQA-PPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAypgPSAPGAHPGQPSGPG 122
Cdd:pfam03154 253 TQPPPPSQVSPQPlPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS---PAAPGQSQQRIHTPP 329
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 585168931  123 AYPPPGQPSAPGAHPaagpfgIPAGPLTVPYDLPLPGGIMPRM 165
Cdd:pfam03154 330 SQSQLQSQQPPREQP------LPPAPLSMPHIKPPPTTPIPQL 366
PHA03264 PHA03264
envelope glycoprotein D; Provisional
41-147 2.85e-04

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 41.91  E-value: 2.85e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  41 GAYPGQAPPGSYPgqAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHP-GQPS 119
Cdd:PHA03264 263 GYEPPPAPSGGSP--APPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRPAPDADRPeGWPS 340
                         90       100
                 ....*....|....*....|....*...
gi 585168931 120 GPGAYPPPGQPSAPGAhPAAGPFGIPAG 147
Cdd:PHA03264 341 LEAITFPPPTPATPAV-PRARPVIVGTG 367
PHA03247 PHA03247
large tegument protein UL36; Provisional
24-164 3.04e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   24 GPWGNQPAGGYPGASYPGAYPGQAPP--GSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYP--G 99
Cdd:PHA03247 2603 DDRGDPRGPAPPSPLPPDTHAPDPPPpsPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPpqR 2682
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 585168931  100 PTAPAYPGPSAPGAH----PGQPSGPGAYPPPGQPSAPGA-HPAAGPFGIPAGPLT-----VPYDLPLPGGIMPR 164
Cdd:PHA03247 2683 PRRRAARPTVGSLTSladpPPPPPTPEPAPHALVSATPLPpGPAAARQASPALPAApappaVPAGPATPGGPARP 2757
PHA03247 PHA03247
large tegument protein UL36; Provisional
33-139 3.15e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 3.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   33 GYPGASYPGAYPGQ-----APPGS---------YPGQAPPGGYPGQ-APPGAYPGQVPPGGYPGQAPPGAYPGQAP-PGA 96
Cdd:PHA03247  341 PRPRQHYPLGFPKRrrptwTPPSSledlsagrhHPKRASLPTRKRRsARHAATPFARGPGGDDQTRPAAPVPASVPtPAP 420
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 585168931   97 YPGP-TAPAYPGPSAPGAHPGQPSGPgAYPPPGQPSAPGAHPAA 139
Cdd:PHA03247  421 TPVPaSAPPPPATPLPSAEPGSDDGP-APPPERQPPAPATEPAP 463
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
13-154 3.87e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.55  E-value: 3.87e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  13 GSGNPNPQGWPGPWGNQ----PAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAY 88
Cdd:COG5164   35 STRPAGNTGGTRPAQNQgsttPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDG 114
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  89 PGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGiPAGPLTVPYD 154
Cdd:COG5164  115 GATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTPPG-PGGSTTPPDD 179
dnaA PRK14086
chromosomal replication initiator protein DnaA;
30-137 4.45e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.35  E-value: 4.45e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  30 PAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPgQAPPgAYPGQAPPGAyPGPtapaYPGPS 109
Cdd:PRK14086  90 PSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLP-TARP-AYPAYQQRPE-PGA----WPRAA 162
                         90       100
                 ....*....|....*....|....*...
gi 585168931 110 APGAHPGQPSGPGAYPPPGQPSAPGAHP 137
Cdd:PRK14086 163 DDYGWQQQRLGFPPRAPYASPASYAPEQ 190
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
23-122 4.50e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 4.50e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  23 PGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQ--APPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGP 100
Cdd:PRK07764 412 PAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPagNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA 491
                         90       100
                 ....*....|....*....|..
gi 585168931 101 TAPAypGPSAPGAHPGQPSGPG 122
Cdd:PRK07764 492 AAPA--APAAPAAPAGADDAAT 511
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
47-155 5.02e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 41.27  E-value: 5.02e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  47 APPGSYPGQAPPGGYPGQAPPgayPGQVPPGGYPGQAPPGaypgQAPPGayPGPTAPAYPGPSAPGAHPGQPSGPGAYPP 126
Cdd:PRK14965 381 APAPPSAAWGAPTPAAPAAPP---PAAAPPVPPAAPARPA----AARPA--PAPAPPAAAAPPARSADPAAAASAGDRWR 451
                         90       100
                 ....*....|....*....|....*....
gi 585168931 127 PGQPSAPGAHPAAGPFGIPAGPLTVPYDL 155
Cdd:PRK14965 452 AFVAFVKGKKPALGASLEQGSPLGVSAGL 480
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
14-146 5.55e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 5.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   14 SGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGgyPGQAPPGAYPGQAP 93
Cdd:PHA03307   97 PASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA--AVASDAASSRQAAL 174
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 585168931   94 PGAYPGPTAPAYPGPSAPgaHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPA 146
Cdd:PHA03307  175 PLSSPEETARAPSSPPAE--PPPSTPPAAASPRPPRRSSPISASASSPAPAPG 225
dnaA PRK14086
chromosomal replication initiator protein DnaA;
57-153 5.61e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.35  E-value: 5.61e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  57 PPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQA----PPGAYPGPTAPAYPgPSAPGAHpgQPSGPGAYPPPGQPSA 132
Cdd:PRK14086  90 PSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRaddrPPGLPRQDQLPTAR-PAYPAYQ--QRPEPGAWPRAADDYG 166
                         90       100
                 ....*....|....*....|.
gi 585168931 133 PGAHPAAGPFGIPAGPLTVPY 153
Cdd:PRK14086 167 WQQQRLGFPPRAPYASPASYA 187
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
54-185 7.37e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.91  E-value: 7.37e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  54 GQAPPGGYPGQAPPGAYPGQVPPGgypgQAPPGAYPGQAPPGAYPGPTAPAypGPSAPGAHPGQPSGPGAYPPPGQPSAP 133
Cdd:PRK14971 367 DDASGGRGPKQHIKPVFTQPAAAP----QPSAAAAASPSPSQSSAAAQPSA--PQSATQPAGTPPTVSVDPPAAVPVNPP 440
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 585168931 134 GAHPAAgpfgipAGPLTVPYDLPLPGGIMPRMLITILGTVKPNANRLALDFK 185
Cdd:PRK14971 441 STAPQA------VRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIK 486
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-164 8.45e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 8.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYP----GQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYP-GQ 91
Cdd:PHA03247 2748 PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTrpavASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPaGP 2827
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   92 APPGAYPGPTAPAYP-GPSAPGAHPGQPSGPG--------AYPPPGQPSAPgAHPAAGPFGIPA-GPLTVPYDLPLPGGI 161
Cdd:PHA03247 2828 LPPPTSAQPTAPPPPpGPPPPSLPLGGSVAPGgdvrrrppSRSPAAKPAAP-ARPPVRRLARPAvSRSTESFALPPDQPE 2906

                  ...
gi 585168931  162 MPR 164
Cdd:PHA03247 2907 RPP 2909
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
9-161 8.97e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 40.51  E-value: 8.97e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   9 DALSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAY 88
Cdd:COG3469   64 TAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSS 143
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931  89 PGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPGGI 161
Cdd:COG3469  144 AGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHV 216
PHA03247 PHA03247
large tegument protein UL36; Provisional
35-140 9.07e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 9.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   35 PGASYPGAYPGQAPPGSYPGQAPPGGYPGQAP----PGAYPGQVPPGGYPGQAPPGAYPGQAP-PGAYPGPTAPAYPGPS 109
Cdd:PHA03247  376 KRASLPTRKRRSARHAATPFARGPGGDDQTRPaapvPASVPTPAPTPVPASAPPPPATPLPSAePGSDDGPAPPPERQPP 455
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 585168931  110 APGAHPGQPSGPGAYP----------PPGQPSAP-----GAHPAAG 140
Cdd:PHA03247  456 APATEPAPDDPDDATRkaldalrerrPPEPPGADlaellGRHPDTA 501
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
36-113 1.06e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 40.50  E-value: 1.06e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 585168931  36 GASYPGAYPGQAPPGSYPGQAPPGGYPgqaPPGAYPGQVPPGGYPGQAPPGAyPGQAPPGAYPGPTAPAYPGPSAPGA 113
Cdd:PRK14965 380 GAPAPPSAAWGAPTPAAPAAPPPAAAP---PVPPAAPARPAAARPAPAPAPP-AAAAPPARSADPAAAASAGDRWRAF 453
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
17-139 1.15e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 40.32  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGG---YPGQAPPGAYPGQVPPGGYPGQappGAYPGQAP 93
Cdd:pfam03157 167 PTSPQQSGQRQQPGQGQQLRQGQQGQQSGQGQPGYYPTSSQQPGqlqQTGQGQQGQQPERGQQGQQPGQ---GQQPGQGQ 243
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 585168931   94 PGAYPGptAPAYPGPSAPGAHPGQPSGPGAYPPPGQpSAPGAHPAA 139
Cdd:pfam03157 244 QGQQPG--QPQQLGQGQQGYYPISPQQPRQWQQSGQ-GQQGYYPTS 286
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
83-158 1.20e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.35  E-value: 1.20e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931  83 APPGAYPGQAPPGAYPGPTAPAyPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLP 158
Cdd:PRK07764 388 AGGAGAPAAAAPSAAAAAPAAA-PAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAP 462
PHA03247 PHA03247
large tegument protein UL36; Provisional
25-163 1.23e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 1.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   25 PWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGypGQAPPGAYPGQVPPGGYPGQAPPGAYP-------GQAPPGAY 97
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG--SVAPGGDVRRRPPSRSPAAKPAAPARPpvrrlarPAVSRSTE 2896
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931   98 PGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPfgIPAGPLTVPYDLPLPGGIMP 163
Cdd:PHA03247 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ--PPLAPTTDPAGAGEPSGAVP 2960
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
10-124 1.29e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 40.05  E-value: 1.29e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  10 ALSGSGNPNPQGW-------PGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGgypGQAPPGAYPGQVPPGGYPGQ 82
Cdd:PRK14959 380 APSGSAAEGPASGgaatiptPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPW---DDAPPAPPRSGIPPRPAPRM 456
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 585168931  83 APPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHpgQPSGPGAY 124
Cdd:PRK14959 457 PEASPVPGAPDSVASASDAPPTLGDPSDTAEH--TPSGPRTW 496
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
11-140 1.35e-03

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 38.87  E-value: 1.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   11 LSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPG 90
Cdd:pfam15240  32 ISEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPPQGGPRPPPGKPQGPPPQGGNQ 111
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 585168931   91 QA---PPGAYPGPTAPAYPGPSAPGAHPGQPSgpgayPPPGQPSAPGAHPAAG 140
Cdd:pfam15240 112 QQgppPPGKPQGPPPQGGGPPPQGGNQQGPPP-----PPPGNPQGPPQRPPQP 159
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
56-148 1.66e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 39.60  E-value: 1.66e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  56 APPGGYPGQAPPGAYPGQVPPGGYPGQAPPGayPGQAPPGAYPGPTAPAYPGPSAPgahpgqpsgpgaYPPPGQPSAPGA 135
Cdd:NF041121  19 AAPPSPEGPAPTAASQPATPPPPAAPPSPPG--DPPEPPAPEPAPLPAPYPGSLAP------------PPPPPPGPAGAA 84
                         90
                 ....*....|...
gi 585168931 136 HPAAGPFGIPAGP 148
Cdd:NF041121  85 PGAALPVRVPAPP 97
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-158 1.68e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 1.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   10 ALSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPgqAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYP 89
Cdd:PHA03247 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 585168931   90 GQAPPGAYPGPTA-------PAYPGPSAPGAHPGQPSGPgAYPPPGQPSAPGAHPAAGPF---GIPAGPLTVPYDLPLP 158
Cdd:PHA03247 2804 PADPPAAVLAPAAalppaasPAGPLPPPTSAQPTAPPPP-PGPPPPSLPLGGSVAPGGDVrrrPPSRSPAAKPAAPARP 2881
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
23-140 1.95e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 39.41  E-value: 1.95e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  23 PGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPggypgqAPPGAYPgqvppggypgqAPPGAYPgqAPPGAYPGPTA 102
Cdd:PRK14950 364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP------KEPVRET-----------ATPPPVP--PRPVAPPVPHT 424
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 585168931 103 PAYPGPSAPGAHPgQPSGPGAYPPPGQPSAPGAHPAAG 140
Cdd:PRK14950 425 PESAPKLTRAAIP-VDEKPKYTPPAPPKEEEKALIADG 461
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
46-150 2.01e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 39.47  E-value: 2.01e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  46 QAPPGSYPGQAPpggYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQ-APPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAY 124
Cdd:cd23959  143 QTAPVTPFGQLP---MFGQHPPPAKPLPAAAAAQQSSASPGEVASPfASGTVSASPFATATDTAPSSGAPDGFPAEASAP 219
                         90       100
                 ....*....|....*....|....*.
gi 585168931 125 PPPGQPSAPGAHPAAGPFGIPAGPLT 150
Cdd:cd23959  220 SPFAAPASAASFPAAPVANGEAATPT 245
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
46-133 2.07e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 39.46  E-value: 2.07e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  46 QAPPGSYPGQAPPGGyPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYP 125
Cdd:PRK07994 365 LPEPEVPPQSAAPAA-SAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEP 443

                 ....*...
gi 585168931 126 PPGQPSAP 133
Cdd:PRK07994 444 AAASRARP 451
PHA03291 PHA03291
envelope glycoprotein I; Provisional
23-125 2.11e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 39.17  E-value: 2.11e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  23 PGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTA 102
Cdd:PHA03291 174 APPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEA 253
                         90       100
                 ....*....|....*....|...
gi 585168931 103 PAYPGPSAPGAHPGQPSGPGAYP 125
Cdd:PHA03291 254 EGTPAPPTPGGGEAPPANATPAP 276
PRK10263 PRK10263
DNA translocase FtsK; Provisional
13-141 2.19e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 39.68  E-value: 2.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   13 GSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAP--------------PGSYPGQAPPGGYPGQAPPGAYPGQVPPGG 78
Cdd:PRK10263  365 GPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPvqpqqpyyapaaeqPAQQPYYAPAPEQPAQQPYYAPAPEQPVAG 444
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931   79 YPGQAPPGAYPGQAPPGAYPGPTAPAyPGPSAPGAHPGQPSGPGAYPPPgQPSAPGAHPAAGP 141
Cdd:PRK10263  445 NAWQAEEQQSTFAPQSTYQTEQTYQQ-PAAQEPLYQQPQPVEQQPVVEP-EPVVEETKPARPP 505
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
13-150 2.23e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 39.24  E-value: 2.23e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  13 GSGNPNPQGWPGPWGNQ----PAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAY 88
Cdd:COG5164    8 KTGPSDPGGVTTPAGSQgstkPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 585168931  89 PGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLT 150
Cdd:COG5164   88 GGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGS 149
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
17-148 2.36e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.38  E-value: 2.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQGWPGPWGNQPAGGYPGASYPGAyPGQAPPGSYPGQAPPGGYPG--QAPPGAYPGQVPPGGYPGQAPPGAYPGQAPP 94
Cdd:PHA03307   71 PPPGPGTEAPANESRSTPTWSLSTLA-PASPAREGSPTPPGPSSPDPppPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA 149
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 585168931   95 GAYPGPTAPAYPGPSAPGAHPG------------QPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGP 148
Cdd:PHA03307  150 ASPPAAGASPAAVASDAASSRQaalplsspeetaRAPSSPPAEPPPSTPPAAASPRPPRRSSPISA 215
dnaA PRK14086
chromosomal replication initiator protein DnaA;
53-164 2.94e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 39.04  E-value: 2.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  53 PGQAPPGGYPGQAPPGAYPGQVPPGGYPGQA--PPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPG------AY 124
Cdd:PRK14086  95 PAPPPPHARRTSEPELPRPGRRPYEGYGGPRadDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGwqqqrlGF 174
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 585168931 125 PPPGQPSAPGAH-PAAGPFGIPAGPLTVPYDLPLPGGIMPR 164
Cdd:PRK14086 175 PPRAPYASPASYaPEQERDREPYDAGRPEYDQRRRDYDHPR 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-159 3.03e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.15  E-value: 3.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   10 ALSGSGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPP-------------GSYPGQAPPGGYPGQAPPGAYPGQVPP 76
Cdd:PHA03247 2491 AAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwirgleelASDDAGDPPPPLPPAAPPAAPDRSVPP 2570
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   77 GgYPGQAPPGaypgqapPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPfgiPAGPLTVPYDLP 156
Cdd:PHA03247 2571 P-RPAPRPSE-------PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPP---PPSPSPAANEPD 2639

                  ...
gi 585168931  157 LPG 159
Cdd:PHA03247 2640 PHP 2642
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
27-114 3.15e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 39.10  E-value: 3.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   27 GNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPggypgqAPPGAYPGQAPPGAYPGPTAPAYP 106
Cdd:PRK12270   35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP------KPAAAAAAAAAPAAPPAAAAAAAP 108

                  ....*...
gi 585168931  107 GPSAPGAH 114
Cdd:PRK12270  109 AAAAVEDE 116
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
8-108 3.18e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 38.99  E-value: 3.18e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   8 NDALSGSGNPNPQGWPGPWGNQPAGGY--PGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPgayPGQVPPGGyPGQAPP 85
Cdd:PRK14971 364 QKGDDASGGRGPKQHIKPVFTQPAAAPqpSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPP---TVSVDPPA-AVPVNP 439
                         90       100
                 ....*....|....*....|...
gi 585168931  86 gayPGQAPPGAYPGPTAPAYPGP 108
Cdd:PRK14971 440 ---PSTAPQAVRPAQFKEEKKIP 459
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
14-163 4.25e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 38.59  E-value: 4.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   14 SGNPNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAP-------------PGGYPGQAPPGAYPGQ-VPPGGY 79
Cdd:pfam03154 178 SGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTqstaaphtliqqtPTLHPQRLPSPHPPLQpMTQPPP 257
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   80 PGQAPPGAYP-----GQAPPGAYP---GPTAPAYPGPSAPGAHPGQpSGPGAYPPPGQPSAPG---AHPAAGPFGIPAGP 148
Cdd:pfam03154 258 PSQVSPQPLPqpslhGQMPPMPHSlqtGPSHMQHPVPPQPFPLTPQ-SSQSQVPPGPSPAAPGqsqQRIHTPPSQSQLQS 336
                         170
                  ....*....|....*
gi 585168931  149 LTVPYDLPLPGGIMP 163
Cdd:pfam03154 337 QQPPREQPLPPAPLS 351
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
35-141 4.40e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 38.61  E-value: 4.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   35 PGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPG---AYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTA---PAYPGP 108
Cdd:PHA03307   49 ELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANesrSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPtppPASPPP 128
                          90       100       110
                  ....*....|....*....|....*....|...
gi 585168931  109 SAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGP 141
Cdd:PHA03307  129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
20-133 5.27e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 38.39  E-value: 5.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   20 QGWPGPWGNQPAggYPGASYPGAYPG---------QAPPGSYPGQAPPGGYPGQAPPGAYPGQVPPGGYPGQAPPGAYPG 90
Cdd:pfam03157 507 QGQPGYYPTSPL--QPGQGQPGYYPTspqqpgqgqQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQQGQQPG 584
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 585168931   91 QAP-PGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAP 133
Cdd:pfam03157 585 QGQqPGQGQPGYYPTSPQQSGQGQQPGQWQQPGQGQPGYYPTSS 628
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
76-160 6.05e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 38.04  E-value: 6.05e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  76 PGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPgaypPPGQPSAPGAHPAAGPFGIPAGPLTVPYDL 155
Cdd:PRK07764 390 GAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP----APAPAPPSPAGNAPAGGAPSPPPAAAPSAQ 465

                 ....*
gi 585168931 156 PLPGG 160
Cdd:PRK07764 466 PAPAP 470
PHA02682 PHA02682
ORF080 virion core protein; Provisional
64-156 6.20e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 37.53  E-value: 6.20e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  64 QAPPGAYPGQVPPggyPGQAPPGAYPGQAPPGAYPGPTAPAyPGPSAPgahpgqPSGPGAYPPPGQPSAPGAHPAAGPFG 143
Cdd:PHA02682  74 QRPSGQSPLAPSP---ACAAPAPACPACAPAAPAPAVTCPA-PAPACP------PATAPTCPPPAVCPAPARPAPACPPS 143
                         90
                 ....*....|....*..
gi 585168931 144 I----PAGPLTVPYDLP 156
Cdd:PHA02682 144 TrqcpPAPPLPTPKPAP 160
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
40-152 6.20e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 37.93  E-value: 6.20e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  40 PGAYPGQAPPGSYPGQAPPggyPGQAPPGAYPGQVPPGGYPGQAP--PGAYPGQAPPGAYPG---PTAPAYPGPSAPGAH 114
Cdd:PRK12323 441 ARGPGGAPAPAPAPAAAPA---AAARPAAAGPRPVAAAAAAAPARaaPAAAPAPADDDPPPWeelPPEFASPAPAQPDAA 517
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 585168931 115 P-GQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVP 152
Cdd:PRK12323 518 PaGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAA 556
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
25-149 6.93e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 37.74  E-value: 6.93e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  25 PWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQ---APPGAYPGQVPPGGYPGQAP---PGAYPGQAPPGAYP 98
Cdd:COG5180  329 PRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEqgaPRPGSSGGDGAPFQPPNGAPqpgLGRRGAPGPPMGAG 408
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 585168931  99 GPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPL 149
Cdd:COG5180  409 DLVQAALDGGGRETASLGGAAGGAGQGPKADFVPGDAESVSGPAGLADQAG 459
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
62-164 7.34e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 37.66  E-value: 7.34e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  62 PGQAPPGAYPGQVPPGGYPGQAPPGAYP-GQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAG 140
Cdd:PRK07764 588 VGPAPGAAGGEGPPAPASSGPPEEAARPaAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
                         90       100
                 ....*....|....*....|....
gi 585168931 141 PFGIPAGPLTVPYDLPLPGGIMPR 164
Cdd:PRK07764 668 GWPAKAGGAAPAAPPPAPAPAAPA 691
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
77-141 8.11e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 37.41  E-value: 8.11e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 585168931  77 GGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGP 141
Cdd:PRK14965 380 GAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAA 444
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
92-159 8.60e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 37.66  E-value: 8.60e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 585168931  92 APPGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLTVPYDLPLPG 159
Cdd:PRK07764 388 AGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPS 455
PHA03169 PHA03169
hypothetical protein; Provisional
23-143 9.42e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 37.26  E-value: 9.42e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931  23 PGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPPG------AYPGQVPPGGYPGQAPPGAYPGQAP--- 93
Cdd:PHA03169 105 PSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAppeshnPSPNQQPSSFLQPSHEDSPEEPEPPtse 184
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 585168931  94 -----PGAYPGPTAPAYPGPSAPGAHPGQPSGPGAYPPPGQPSAPGAHPAAGPFG 143
Cdd:PHA03169 185 pepdsPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTE 239
PRK13729 PRK13729
conjugal transfer pilus assembly protein TraB; Provisional
62-127 9.96e-03

conjugal transfer pilus assembly protein TraB; Provisional


Pssm-ID: 184281 [Multi-domain]  Cd Length: 475  Bit Score: 37.11  E-value: 9.96e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 585168931  62 PGQAPPGAYPGQVPPGGYPGQAPPGAYPGQAPPGAYPGPTAPAYPGPSA--PGAHPGQPSGPGAYPPP 127
Cdd:PRK13729 123 LGANPVTATGEPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPPTAfyPGNGVTPPPQVTYQSVP 190
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
17-150 9.97e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 37.44  E-value: 9.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 585168931   17 PNPQGWPGPWGNQPAGGYPGASYPGAYPGQAPPGSYPGQAPPGGYPGQAPP--------GAYPGQVPPGGYPGQ-APPGA 87
Cdd:pfam03154 177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtliqqtpTLHPQRLPSPHPPLQpMTQPP 256
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 585168931   88 YPGQAPPGAYPgptAPAYPGPSAPGAHPGQpSGPGAYPPPGQPSAPGAHPAAGPFGIPAGPLT 150
Cdd:pfam03154 257 PPSQVSPQPLP---QPSLHGQMPPMPHSLQ-TGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSP 315
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH