|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
46-413 |
5.24e-10 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.42 E-value: 5.24e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 46 PPQASLSIPVSRGLPQQSSPQQLLSLQGLHSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLTMPTATLGNLRAF 125
Cdd:PHA03247 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL 2698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 126 NVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPP--------------PVGVPINPSQLNHSGRNTQKQARTPSST 191
Cdd:PHA03247 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPaapappavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 192 TPNRKTVPledREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-PEPFETLEPPAKRCRSSEESTEKGPTGQP 270
Cdd:PHA03247 2779 PPRRLTRP---AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 271 QARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPkllrQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEK 350
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR---STES----FALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 357527439 351 TQDQPQTWPQGSVPPPEQAsGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 413
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
576-600 |
1.66e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization. :
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 42.16 E-value: 1.66e-05
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
692-725 |
1.57e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins. :
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.85 E-value: 1.57e-03
10 20 30
....*....|....*....|....*....|....
gi 357527439 692 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 725
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_SF super family |
cl15257 |
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ... |
545-612 |
2.65e-03 |
|
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions. The actual alignment was detected with superfamily member cd10442:
Pssm-ID: 472790 Cd Length: 92 Bit Score: 37.73 E-value: 2.65e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 545 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 612
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
46-413 |
5.24e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.42 E-value: 5.24e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 46 PPQASLSIPVSRGLPQQSSPQQLLSLQGLHSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLTMPTATLGNLRAF 125
Cdd:PHA03247 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL 2698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 126 NVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPP--------------PVGVPINPSQLNHSGRNTQKQARTPSST 191
Cdd:PHA03247 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPaapappavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 192 TPNRKTVPledREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-PEPFETLEPPAKRCRSSEESTEKGPTGQP 270
Cdd:PHA03247 2779 PPRRLTRP---AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 271 QARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPkllrQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEK 350
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR---STES----FALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 357527439 351 TQDQPQTWPQGSVPPPEQAsGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 413
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
576-600 |
1.66e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 42.16 E-value: 1.66e-05
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
156-407 |
7.23e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.10 E-value: 7.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 156 RQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPNRKTvPLEDREDPtegsEEATELQMDTcedqdSL--VGPDSM 233
Cdd:pfam09770 101 RFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRT-GYEKYKEP----EPIPDLQVDA-----SLwgVAPKKA 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 234 LSEPQVPEPEPFETLEPPAKRCRSSEESTEKGPTGQPQARVQPQT--------QMTAPKQTQTPDRLPEPPEVQMLPRIQ 305
Cdd:pfam09770 171 AAPAPAPQPAAQPASLPAPSRKMMSLEEVEAAMRAQAKKPAQQPApapaqppaAPPAQQAQQQQQFPPQIQQQQQPQQQP 250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 306 PQALQIQTQPKLLRQAQTQTSPEhLAPQQDQVPTQAQSQEQTSEKTQDQP---------QTWPQGSVPPPEQASGPACAT 376
Cdd:pfam09770 251 QQPQQHPGQGHPVTILQRPQSPQ-PDPAQPSIQPQAQQFHQQPPPVPVQPtqilqnpnrLSAARVGYPQNPQPGVQPAPA 329
|
250 260 270
....*....|....*....|....*....|.
gi 357527439 377 EPQLSSHAAEAGSDPDKALPEPVSAQSSEDR 407
Cdd:pfam09770 330 HQAHRQQGSFGRQAPIITHPQQLAQLSEEEK 360
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
692-725 |
1.57e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.85 E-value: 1.57e-03
10 20 30
....*....|....*....|....*....|....
gi 357527439 692 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 725
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
545-612 |
2.65e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.65e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 545 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 612
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
46-413 |
5.24e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.42 E-value: 5.24e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 46 PPQASLSIPVSRGLPQQSSPQQLLSLQGLHSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLTMPTATLGNLRAF 125
Cdd:PHA03247 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL 2698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 126 NVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPP--------------PVGVPINPSQLNHSGRNTQKQARTPSST 191
Cdd:PHA03247 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPaapappavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 192 TPNRKTVPledREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-PEPFETLEPPAKRCRSSEESTEKGPTGQP 270
Cdd:PHA03247 2779 PPRRLTRP---AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 271 QARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPkllrQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEK 350
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR---STES----FALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 357527439 351 TQDQPQTWPQGSVPPPEQAsGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 413
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
47-419 |
1.78e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 58.41 E-value: 1.78e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 47 PQASLSIPVSRGLPQQSSPQQLLSlqglhSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLT---MPTATLGNLR 123
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVS-----ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPArpaRPPTTAGPPA 2768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 124 AFNVTAPSLAAPSLTPPQMVTPnLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSS--TTPNRKTVPLE 201
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVAS-LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqpTAPPPPPGPPP 2847
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 202 DREdPTEGSEEATelqmdtcedqdslvGPDSMLSEPQVPEPEPFETLEPPAKRCRSSEESTEKGPTGQPQARVQPQTQMT 281
Cdd:PHA03247 2848 PSL-PLGGSVAPG--------------GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQ 2912
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 282 APKQTQTPDRLPEPPEVQMLPRIQPQAlQIQTQPKLLRQAQTQTSPE-------HLAPQQDQVPTQAQSQEQTSEKTqdq 354
Cdd:PHA03247 2913 APPPPQPQPQPPPPPQPQPPPPPPPRP-QPPLAPTTDPAGAGEPSGAvpqpwlgALVPGRVAVPRFRVPQPAPSREA--- 2988
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 357527439 355 pqtwPQGSVPPPEQASGPAcatepqLSSHAAEAGSDPDKAlPEPVS-----------AQSSEDRSREASAGGLDLG 419
Cdd:PHA03247 2989 ----PASSTPPLTGHSLSR------VSSWASSLALHEETD-PPPVSlkqtlwppddtEDSDADSLFDSDSERSDLE 3053
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
139-401 |
7.46e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 7.46e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 139 PPQMVTPNLQqffPQATRQSLLGPPPVGVPINPSQlnhsgrnTQKQARTPSSTTPNRKTVPLEDREDPtEGSEEATELQM 218
Cdd:PHA03247 2551 PPPPLPPAAP---PAAPDRSVPPPRPAPRPSEPAV-------TSRARRPDAPPQSARPRAPVDDRGDP-RGPAPPSPLPP 2619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 219 DTCEDQDSLVGPDSMLSEPQVPEPEPFETLEPPAK-----RCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTPDRLP 293
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDdpapgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLA 2699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 294 EPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPE-HLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGP 372
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAlPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
|
250 260 270
....*....|....*....|....*....|
gi 357527439 373 A-CATEPQLSSHAAEAGSDPDKALPEPVSA 401
Cdd:PHA03247 2780 PrRLTRPAVASLSESRESLPSPWDPADPPA 2809
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
576-600 |
1.66e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 42.16 E-value: 1.66e-05
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
240-407 |
1.83e-05 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 47.37 E-value: 1.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 240 PEPEP----FETLEPPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQMLPRIQPQALQIQ 312
Cdd:PRK10927 77 PKPEErwryIKELESRQPGVRAPTEPSAGGEVKTPEQLTPEQRQLLEQMQAdmrQQPTQLVEVPWNEQTPEQRQQTLQRQ 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 313 TQpkllRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQ--ASGPACATEPQLSSHAAEAGSD 390
Cdd:PRK10927 157 RQ----AQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQdlLQTPAHTTAQSKPQQAAPVTRA 232
|
170
....*....|....*..
gi 357527439 391 PDKalPEPVSAQSSEDR 407
Cdd:PRK10927 233 ADA--PKPTAEKKDERR 247
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
194-407 |
2.54e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.69 E-value: 2.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 194 NRKTVPLEDREDPTEGSEEATE----LQMDTCEDQDSLVGPDSMLSEPQVPEPEPfETLEP-----PAKRCRSSEESTEK 264
Cdd:PRK10263 297 NRATQPEYDEYDPLLNGAPITEpvavAAAATTATQSWAAPVEPVTQTPPVASVDV-PPAQPtvawqPVPGPQTGEPVIAP 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 265 GPTGQPQA--RVQPQTQMTAPKQTqtpdrlPEPPEVqmlPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQ 342
Cdd:PRK10263 376 APEGYPQQsqYAQPAVQYNEPLQQ------PVQPQQ---PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNA 446
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 357527439 343 SQeqtsekTQDQPQTWPQGSVPPPEQASGPACATEPQlsSHAAEAGSDPDKALPEPVSAQSSEDR 407
Cdd:PRK10263 447 WQ------AEEQQSTFAPQSTYQTEQTYQQPAAQEPL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
229-378 |
5.17e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.92 E-value: 5.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 229 GPDSMLSEPQV-PEPEPFETLEPPAKrcrsseestekgpTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQ 307
Cdd:PRK10263 739 GPHEPLFTPIVePVQQPQQPVAPQQQ-------------YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ 805
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 357527439 308 ALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQ-----TSEKTQDQPQTWPQGSVPPPEQASGPACATEP 378
Cdd:PRK10263 806 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTllhplLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEP 881
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
156-407 |
7.23e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.10 E-value: 7.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 156 RQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPNRKTvPLEDREDPtegsEEATELQMDTcedqdSL--VGPDSM 233
Cdd:pfam09770 101 RFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRT-GYEKYKEP----EPIPDLQVDA-----SLwgVAPKKA 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 234 LSEPQVPEPEPFETLEPPAKRCRSSEESTEKGPTGQPQARVQPQT--------QMTAPKQTQTPDRLPEPPEVQMLPRIQ 305
Cdd:pfam09770 171 AAPAPAPQPAAQPASLPAPSRKMMSLEEVEAAMRAQAKKPAQQPApapaqppaAPPAQQAQQQQQFPPQIQQQQQPQQQP 250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 306 PQALQIQTQPKLLRQAQTQTSPEhLAPQQDQVPTQAQSQEQTSEKTQDQP---------QTWPQGSVPPPEQASGPACAT 376
Cdd:pfam09770 251 QQPQQHPGQGHPVTILQRPQSPQ-PDPAQPSIQPQAQQFHQQPPPVPVQPtqilqnpnrLSAARVGYPQNPQPGVQPAPA 329
|
250 260 270
....*....|....*....|....*....|.
gi 357527439 377 EPQLSSHAAEAGSDPDKALPEPVSAQSSEDR 407
Cdd:pfam09770 330 HQAHRQQGSFGRQAPIITHPQQLAQLSEEEK 360
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
130-391 |
1.45e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 130 PSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSG-------RNTQKQARTPSSTTPNRKTVPLED 202
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrpRRARRLGRAAQASSPPQRPRRRAA 2688
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 203 RedPTegseeatelqmdtcedqdslVGPDSMLSEPQVPEPEPfETLEPPAKRCRSSEESTEKGPTGQPQARVQPQTQM-- 280
Cdd:PHA03247 2689 R--PT--------------------VGSLTSLADPPPPPPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAvp 2745
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 281 TAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQT-QPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQPQTwP 359
Cdd:PHA03247 2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGpPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS-P 2824
|
250 260 270
....*....|....*....|....*....|....
gi 357527439 360 QGSVPPPEQA--SGPACATEPQLSSHAAEAGSDP 391
Cdd:PHA03247 2825 AGPLPPPTSAqpTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
692-725 |
1.57e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.85 E-value: 1.57e-03
10 20 30
....*....|....*....|....*....|....
gi 357527439 692 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 725
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
545-612 |
2.65e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.65e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 545 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 612
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
296-407 |
2.82e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.22 E-value: 2.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 296 PEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQPQTwPQGSVPPPEQASGPaca 375
Cdd:PRK10263 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA-PQPQYQQPQQPVAP--- 826
|
90 100 110
....*....|....*....|....*....|..
gi 357527439 376 tEPQLSSHAAEAGSDPDKALPEPVSAQSSEDR 407
Cdd:PRK10263 827 -QPQYQQPQQPVAPQPQDTLLHPLLMRNGDSR 857
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
229-417 |
3.33e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.12 E-value: 3.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 229 GPDSMLSEPQVPEPEPFETLEPPAKRCR------SSEESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQ 299
Cdd:PRK07764 602 APASSGPPEEAARPAAPAAPAAPAAPAPagaaaaPAEASAAPAPGVAAPEHHPKHVAVPDASDGgdgWPAKAGGAAPAAP 681
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 300 MLPRIQPQALQIQTQPKllRQAQTQTSPEHLAPQQDQVPTQAQSQEQTsektQDQPQTWPQGSVPPPEQASGPACATEPQ 379
Cdd:PRK07764 682 PPAPAPAAPAAPAGAAP--AQPAPAPAATPPAGQADDPAAQPPQAAQG----ASAPSPAADDPVPLPPEPDDPPDPAGAP 755
|
170 180 190
....*....|....*....|....*....|....*....
gi 357527439 380 LSSHAAEAGSDPDKALPEP-VSAQSSEDRSREASAGGLD 417
Cdd:PRK07764 756 AQPPPPPAPAPAAAPAAAPpPSPPSEEEEMAEDDAPSMD 794
|
|
| PRK12757 |
PRK12757 |
cell division protein FtsN; Provisional |
266-373 |
6.10e-03 |
|
cell division protein FtsN; Provisional
Pssm-ID: 237191 [Multi-domain] Cd Length: 256 Bit Score: 39.26 E-value: 6.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 266 PTGQPQARVQPQ---------TQMTAPKQtQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPehlAPQQDQ 336
Cdd:PRK12757 68 PSAGGEVNSPTQltdeqrqllEQMQADMR-QQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQP---QPVTPP 143
|
90 100 110
....*....|....*....|....*....|....*..
gi 357527439 337 VPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPA 373
Cdd:PRK12757 144 RQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEAEKE 180
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
249-369 |
6.71e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 39.28 E-value: 6.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 249 EPPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTpdRLPEPPEVQMLPRIQPQALQIQTQPKLLrQAQTQTSPE 328
Cdd:PRK10927 144 QTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQT--RTSQAAPVQAQPRQSKPASTQQPYQDLL-QTPAHTTAQ 220
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 357527439 329 HLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQA 369
Cdd:PRK10927 221 SKPQQAAPVTRAADAPKPTAEKKDERRWMVQCGSFRGAEQA 261
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
260-395 |
8.16e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 39.58 E-value: 8.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 357527439 260 ESTEKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPKLLRQAQTQTSPEHLAPQQDQVPT 339
Cdd:PRK07764 379 ERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA---APQPAPAPAPAPAPPSPAGNAPAGGAPS 455
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 357527439 340 QAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEPQLSSH-AAEAGSDPDKAL 395
Cdd:PRK07764 456 PPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAApAAPAGADDAATL 512
|
|
|