|
Name |
Accession |
Description |
Interval |
E-value |
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
173-1173 |
0e+00 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 1090.96 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 173 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 252
Cdd:pfam03154 1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 253 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 332
Cdd:pfam03154 79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 333 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 412
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 413 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 489
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 490 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 568
Cdd:pfam03154 317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 569 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 647
Cdd:pfam03154 397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 648 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 727
Cdd:pfam03154 477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 728 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 807
Cdd:pfam03154 555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 808 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 887
Cdd:pfam03154 634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 888 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 967
Cdd:pfam03154 714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 968 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1047
Cdd:pfam03154 794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 1048 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1127
Cdd:pfam03154 874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
|
970 980 990 1000
....*....|....*....|....*....|....*....|....*.
gi 1907157157 1128 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1173
Cdd:pfam03154 946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
|
|
| SANT_MTA3_like |
cd11661 |
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ... |
7-45 |
3.91e-18 |
|
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.
Pssm-ID: 212559 [Multi-domain] Cd Length: 46 Bit Score: 78.81 E-value: 3.91e-18
10 20 30
....*....|....*....|....*....|....*....
gi 1907157157 7 KRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 45
Cdd:cd11661 8 KLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
|
|
| ZnF_GATA |
smart00401 |
zinc finger binding to DNA consensus sequence [AT]GATA[AG]; |
108-157 |
2.03e-15 |
|
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
Pssm-ID: 214648 [Multi-domain] Cd Length: 52 Bit Score: 71.30 E-value: 2.03e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1907157157 108 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 157
Cdd:smart00401 2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
|
|
| ZnF_GATA |
cd00202 |
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ... |
111-165 |
2.46e-15 |
|
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C
Pssm-ID: 238123 [Multi-domain] Cd Length: 54 Bit Score: 71.25 E-value: 2.46e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157 111 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 165
Cdd:cd00202 1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
|
|
| GATA |
pfam00320 |
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ... |
112-147 |
9.37e-12 |
|
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Pssm-ID: 425605 [Multi-domain] Cd Length: 36 Bit Score: 60.41 E-value: 9.37e-12
10 20 30
....*....|....*....|....*....|....*.
gi 1907157157 112 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 147
Cdd:pfam00320 1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
150-336 |
1.94e-09 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 62.23 E-value: 1.94e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 150 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 221
Cdd:NF033609 539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 222 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 299
Cdd:NF033609 619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
|
170 180 190
....*....|....*....|....*....|....*..
gi 1907157157 300 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609 699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
183-632 |
3.96e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 3.96e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 183 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 260
Cdd:PHA03247 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 261 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 335
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 336 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 415
Cdd:PHA03247 2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 416 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 494
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 495 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 573
Cdd:PHA03247 2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157 574 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 632
Cdd:PHA03247 2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
207-367 |
1.01e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.91 E-value: 1.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 285
Cdd:NF033609 716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 286 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 363
Cdd:NF033609 796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875
|
....
gi 1907157157 364 GTPQ 367
Cdd:NF033609 876 NSPK 879
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
185-336 |
1.52e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.06 E-value: 1.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 185 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 264
Cdd:NF033609 628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907157157 265 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609 708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
207-336 |
2.23e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 45.67 E-value: 2.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 286
Cdd:NF033609 674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1907157157 287 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609 753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
173-1173 |
0e+00 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 1090.96 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 173 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 252
Cdd:pfam03154 1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 253 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 332
Cdd:pfam03154 79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 333 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 412
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 413 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 489
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 490 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 568
Cdd:pfam03154 317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 569 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 647
Cdd:pfam03154 397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 648 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 727
Cdd:pfam03154 477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 728 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 807
Cdd:pfam03154 555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 808 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 887
Cdd:pfam03154 634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 888 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 967
Cdd:pfam03154 714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 968 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1047
Cdd:pfam03154 794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 1048 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1127
Cdd:pfam03154 874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
|
970 980 990 1000
....*....|....*....|....*....|....*....|....*.
gi 1907157157 1128 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1173
Cdd:pfam03154 946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
|
|
| SANT_MTA3_like |
cd11661 |
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ... |
7-45 |
3.91e-18 |
|
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.
Pssm-ID: 212559 [Multi-domain] Cd Length: 46 Bit Score: 78.81 E-value: 3.91e-18
10 20 30
....*....|....*....|....*....|....*....
gi 1907157157 7 KRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 45
Cdd:cd11661 8 KLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
|
|
| ZnF_GATA |
smart00401 |
zinc finger binding to DNA consensus sequence [AT]GATA[AG]; |
108-157 |
2.03e-15 |
|
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
Pssm-ID: 214648 [Multi-domain] Cd Length: 52 Bit Score: 71.30 E-value: 2.03e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1907157157 108 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 157
Cdd:smart00401 2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
|
|
| ZnF_GATA |
cd00202 |
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ... |
111-165 |
2.46e-15 |
|
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C
Pssm-ID: 238123 [Multi-domain] Cd Length: 54 Bit Score: 71.25 E-value: 2.46e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157 111 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 165
Cdd:cd00202 1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
|
|
| GATA |
pfam00320 |
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ... |
112-147 |
9.37e-12 |
|
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Pssm-ID: 425605 [Multi-domain] Cd Length: 36 Bit Score: 60.41 E-value: 9.37e-12
10 20 30
....*....|....*....|....*....|....*.
gi 1907157157 112 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 147
Cdd:pfam00320 1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
339-505 |
1.04e-09 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 62.75 E-value: 1.04e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 339 QM-LQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspatsQPPNQTQSTVAPAAHTHIQQAPtlhppr 417
Cdd:pfam09770 202 AMrAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ------QQPQQPQQHPGQGHPVTILQRP------ 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 418 lpsphpplQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQtgpLLQHP-------------GPPQPFGLPSQPSQGQGPL 484
Cdd:pfam09770 270 --------QSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ---ILQNPnrlsaarvgypqnPQPGVQPAPAHQAHRQQGS 338
|
170 180
....*....|....*....|...
gi 1907157157 485 --GPSPAAAHPHSTIQLPASQSA 505
Cdd:pfam09770 339 fgRQAPIITHPQQLAQLSEEEKA 361
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
150-336 |
1.94e-09 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 62.23 E-value: 1.94e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 150 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 221
Cdd:NF033609 539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 222 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 299
Cdd:NF033609 619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
|
170 180 190
....*....|....*....|....*....|....*..
gi 1907157157 300 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609 699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
183-632 |
3.96e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 3.96e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 183 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 260
Cdd:PHA03247 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 261 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 335
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 336 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 415
Cdd:PHA03247 2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 416 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 494
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 495 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 573
Cdd:PHA03247 2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157 574 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 632
Cdd:PHA03247 2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
323-493 |
8.38e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 8.38e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 323 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA 402
Cdd:PRK07764 597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 403 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLH------SQGPPGPHSLQTGPLLQHPG-PPQPFGLPS 475
Cdd:PRK07764 677 APAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQppqaaqGASAPSPAADDPVPLPPEPDdPPDPAGAPA 756
|
170
....*....|....*...
gi 1907157157 476 QPSQGQGPLGPSPAAAHP 493
Cdd:PRK07764 757 QPPPPPAPAPAAAPAAAP 774
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
341-493 |
9.29e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 9.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 341 LQAQPPALQAPSGAASAPSTAPPGTPQL-------PTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTL 413
Cdd:PRK07764 580 GDWQVEAVVGPAPGAAGGEGPPAPASSGppeeaarPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 414 HPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGP-PQPFGLPSQPSQGQGPLGPSPAAAH 492
Cdd:PRK07764 660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQaDDPAAQPPQAAQGASAPSPAADDPV 739
|
.
gi 1907157157 493 P 493
Cdd:PRK07764 740 P 740
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
271-637 |
1.14e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 1.14e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 271 ITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSAQQQMLQAQPPALQA 350
Cdd:PHA03247 2582 VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-PAANEPDPHPPPTVPPPERPRDDPAPGR 2660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 351 PSGAASAPSTAPPGTPQLPTQGPTPSAtAVPPQGSPATSQPPNQTQSTVAPAahthiqqaPTLHPPRLPsphpplqpmTA 430
Cdd:PHA03247 2661 VSRPRRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPEPA--------PHALVSATP---------LP 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 431 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSALQPQQ 510
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 511 PPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNA------------NLPPPPALKPLSSLSTHH 578
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRPPSRSPAAKPAAPARPP 2882
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 579 PPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP-FPQHP 637
Cdd:PHA03247 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPpRPQPP 2942
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
315-744 |
1.60e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.63 E-value: 1.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 315 NRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSG------AASAPSTAPPGTPQLPTQGPTPSATAV-PPQGSPA 387
Cdd:PHA03247 2565 DRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDdrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 388 TSQPPNQTQSTVAPAA-----HTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHS---LQTG 459
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSatpLPPG 2724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 460 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHStiqlPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLP 539
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 540 APQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQL--------------MPQSQPLPSSPAQPPG 605
Cdd:PHA03247 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvrrRPPSRSPAAKPAAPAR 2880
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 606 lTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvPGGPPPITPPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSC 685
Cdd:PHA03247 2881 -PPVRRLARPAVSRSTESFALPPDQPERPPQP--QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157 686 PLPAVQikeeaLDEAEEPESPPPPPRSPSPEPTVvdtPSHASQSARFYKHLDRGYNSCA 744
Cdd:PHA03247 2958 AVPQPW-----LGALVPGRVAVPRFRVPQPAPSR---EAPASSTPPLTGHSLSRVSSWA 3008
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
330-485 |
3.71e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 51.14 E-value: 3.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 330 SDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPAtsqPPNQTQSTVAPAAhthiQQ 409
Cdd:PRK07764 367 ASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPA---PAAAPQPAPAPAP----AP 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907157157 410 APTlhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPslhsQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLG 485
Cdd:PRK07764 440 APP---SPAGNAPAGGAPSPPPAAAPSAQPAPAP----AAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
207-367 |
1.01e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.91 E-value: 1.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 285
Cdd:NF033609 716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 286 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 363
Cdd:NF033609 796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875
|
....
gi 1907157157 364 GTPQ 367
Cdd:NF033609 876 NSPK 879
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
343-461 |
2.04e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 48.56 E-value: 2.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 343 AQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPH 422
Cdd:PRK14951 382 ARPEA--AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA------APAAVALAPAPPA 453
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1907157157 423 PPLQPMTAPPSQSSAQPH-PQPSLHSQGPPGPHSLQTGPL 461
Cdd:PRK14951 454 QAAPETVAIPVRVAPEPAvASAAPAPAAAPAAARLTPTEE 493
|
|
| PLN02967 |
PLN02967 |
kinase |
163-294 |
2.29e-05 |
|
kinase
Pssm-ID: 215521 [Multi-domain] Cd Length: 581 Bit Score: 48.50 E-value: 2.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 163 PVKEEDDGLSGKHSMRTRRSRgsgqmstlRSGRKKQPTSPDGRASPINEDIRssgrNSPSAASTSSNDSKAETVKKSA-- 240
Cdd:PLN02967 57 AVDEEPDENGAVSKKKPTRSV--------KRATKKTVVEISEPLEEGSELVV----NEDAALDKESKKTPRRTRRKAAaa 124
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157 241 -KKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGES 294
Cdd:PLN02967 125 sSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
324-405 |
2.62e-05 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 47.71 E-value: 2.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 324 SPQDNES---DSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVA 400
Cdd:PRK10856 155 SQNSGQSvplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234
|
....*
gi 1907157157 401 PAAHT 405
Cdd:PRK10856 235 APLPT 239
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
147-560 |
2.66e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.78 E-value: 2.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 147 LPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAAST 226
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 227 SSNDSKAETVKKSAKKVKEEAASPL---KSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDE 303
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLppgPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 304 GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTA---PPGTPQLPTQGPTPSATAV 380
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPPSLPLGGSV 2856
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 381 PPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH-SLQTG 459
Cdd:PHA03247 2857 APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpPPPPP 2936
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 460 PLLQHPGPPQPFGLPSQPSQGQGP-----------------LGPSPAAA-------------HPHSTIQLPASQSALQPQ 509
Cdd:PHA03247 2937 PRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrvavprfRVPQPAPSreapasstppltgHSLSRVSSWASSLALHEE 3016
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 510 QPPRE-----------------QPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLS------------GPSPFSLNA 560
Cdd:PHA03247 3017 TDPPPvslkqtlwppddtedsdADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPeagarespssqfGPPPLSANA 3096
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
316-457 |
4.21e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.72 E-value: 4.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 316 RSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTP-------SATAVPPQGSPAT 388
Cdd:pfam09770 204 RAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVtilqrpqSPQPDPAQPSIQP 283
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907157157 389 SQPPNQTQSTVAPAAHTHIQQAPTLhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPP---GPHSLQ 457
Cdd:pfam09770 284 QAQQFHQQPPPVPVQPTQILQNPNR--LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPiitHPQQLA 353
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
272-453 |
5.30e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 47.67 E-value: 5.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 272 TSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSA--QQQMLQAQPPALQ 349
Cdd:PRK07764 600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDggDGWPAKAGGAAPA 679
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 350 APSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMT 429
Cdd:PRK07764 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
|
170 180
....*....|....*....|....
gi 1907157157 430 APPSQSSAQPHPQPSLHSQGPPGP 453
Cdd:PRK07764 760 PPPAPAPAAAPAAAPPPSPPSEEE 783
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
349-499 |
6.12e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 47.39 E-value: 6.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 349 QAPSGAASAPSTAPPGTPQlptQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPM 428
Cdd:PRK10263 738 DGPHEPLFTPIVEPVQQPQ---QPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 814
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907157157 429 TAPPS-QSSAQPHPQPSLHSQGP-PGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQL 499
Cdd:PRK10263 815 PQYQQpQQPVAPQPQYQQPQQPVaPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFAL 887
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
298-403 |
1.08e-04 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 46.81 E-value: 1.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 298 RSVNDEGSSDPK--DIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPgTPQLPTQGPTP 375
Cdd:PRK12270 17 QYLADPNSVDPSwrEFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAA 95
|
90 100 110
....*....|....*....|....*....|.
gi 1907157157 376 SATAVPPQGSPATSQPPNQTQSTV---APAA 403
Cdd:PRK12270 96 PAAPPAAAAAAAPAAAAVEDEVTPlrgAAAA 126
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
185-336 |
1.52e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.06 E-value: 1.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 185 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 264
Cdd:NF033609 628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907157157 265 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609 708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
343-554 |
2.21e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.64 E-value: 2.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 343 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPnqtqstvAPAAHTHIQQAPTLHPPrlpsph 422
Cdd:PRK12323 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP-------APEALAAARQASARGPG------ 445
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 423 pplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPllqhPGPPQPFGLPSQPSQGQGP---LGPSPAAAHPHSTIQL 499
Cdd:PRK12323 446 ----GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA----PARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAA 517
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157 500 PASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPS 554
Cdd:PRK12323 518 PAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
207-336 |
2.23e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 45.67 E-value: 2.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 286
Cdd:NF033609 674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1907157157 287 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609 753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
230-400 |
3.18e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 45.10 E-value: 3.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 230 DSKAETVKKSAKKVKEEAAsPLKSTKRQREKVASDTEdtdriTSKKTKTQEISRPNSPSEGEGESSDSRSVNdEGSSDPK 309
Cdd:PRK14949 630 SPKEGDGKKSSADRKPKTP-PSRAPPASLSKPASSPD-----ASQTSASFDLDPDFELATHQSVPEAALASG-SAPAPPP 702
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 310 DIDQDNRstsPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATS 389
Cdd:PRK14949 703 VPDPYDR---PPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQ 779
|
170
....*....|.
gi 1907157157 390 QPPNQTQSTVA 400
Cdd:PRK14949 780 DTELNLVLLSS 790
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
342-462 |
4.71e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 44.32 E-value: 4.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 342 QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPT---PSATAVPPQGSPATSQPPNqtQSTVAPAAHTHIQQAPTLHPPRL 418
Cdd:PRK14951 387 AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAaapPAPVAAPAAAAPAAAPAAA--PAAVALAPAPPAQAAPETVAIPV 464
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1907157157 419 PSPHPPLQPMTAPPSqssaQPHPQPSLHSQGPPGPHSLQTGPLL 462
Cdd:PRK14951 465 RVAPEPAVASAAPAP----AAAPAAARLTPTEEGDVWHATVQQL 504
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
323-483 |
6.85e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 43.70 E-value: 6.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 323 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspATSQPPNQTQSTVAPA 402
Cdd:PRK07994 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ---LQRAQGATKAKKSEPA 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 403 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQG 482
Cdd:PRK07994 445 AASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLAAEAIERD 524
|
.
gi 1907157157 483 P 483
Cdd:PRK07994 525 P 525
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
298-410 |
9.14e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 43.23 E-value: 9.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 298 RSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSA 377
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVDPPAAVPVNPP 440
|
90 100 110
....*....|....*....|....*....|....*..
gi 1907157157 378 TAVPPQGSPATSQPPNQ----TQSTVAPAAHTHIQQA 410
Cdd:PRK14971 441 STAPQAVRPAQFKEEKKipvsKVSSLGPSTLRPIQEK 477
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
142-454 |
1.23e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.14 E-value: 1.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 142 KKYGELPPIEK---------PVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASpined 212
Cdd:PTZ00449 491 KSKKKLAPIEEedsdkhdepPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAK----- 565
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 213 irssgRNSPSAASTSSNDSKAETVKKSAKKVKEEaasplKSTKRQRekvaSDTEDTDRITSKKTKTQEI----SRPNSPS 288
Cdd:PTZ00449 566 -----EHKPSKIPTLSKKPEFPKDPKHPKDPEEP-----KKPKRPR----SAQRPTRPKSPKLPELLDIpkspKRPESPK 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 289 EGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIP-SPQDNES--DSDSSAQQQMLQAQPPALQAPSGAASAPSTAP--P 363
Cdd:PTZ00449 632 SPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPfDPKFKEKfyDDYLDAAAKSKETKTTVVLDESFESILKETLPetP 711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 364 GTP-----QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQP----MTAPPSQ 434
Cdd:PTZ00449 712 GTPfttprPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEdihaETGEPDE 791
|
330 340
....*....|....*....|
gi 1907157157 435 SSAQPHpQPSLHSQGPPGPH 454
Cdd:PTZ00449 792 AMKRPD-SPSEHEDKPPGDH 810
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
316-487 |
1.48e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 42.89 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 316 RSTSPSIPSPQDNESDSDSSAQQQMLQAQP--PALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavpPQGSPATSQPPn 393
Cdd:PRK14086 115 RRPYEGYGGPRADDRPPGLPRQDQLPTARPayPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA----PYASPASYAPE- 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 394 qtQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmtaPPSQSSAQPHPQPS---LHSQGPPGPHSLQTGPLLQHPGPPQP 470
Cdd:PRK14086 190 --QERDREPYDAGRPEYDQRRRDYDHPRPDWDR----PRRDRTDRPEPPPGaghVHRGGPGPPERDDAPVVPIRPSAPGP 263
|
170
....*....|....*..
gi 1907157157 471 FglPSQPSQGQGPLGPS 487
Cdd:PRK14086 264 L--AAQPAPAPGPGEPT 278
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
332-555 |
2.11e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 42.33 E-value: 2.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 332 SDSSAQQQML-QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATA------------------------VPPQGSP 386
Cdd:pfam09770 92 SDAIEEEQVRfNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyekykepepipdlqvdaslwgVAPKKAA 171
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 387 ATSQPPnqtqsTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPG 466
Cdd:pfam09770 172 APAPAP-----QPAAQPASLPAPSRKMMSLEEVEAAMRAQ--AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 467 PPQPFGLPSQPSQGQGPlgpsPAAAHPHSTI-QLPASQSALQPQQPPREQPLPPAPLAMPHI---------KPPPTTPIP 536
Cdd:pfam09770 245 QPQQQPQQPQQHPGQGH----PVTILQRPQSpQPDPAQPSIQPQAQQFHQQPPPVPVQPTQIlqnpnrlsaARVGYPQNP 320
|
250
....*....|....*....
gi 1907157157 537 QLPAPQAHKHPPHLSGPSP 555
Cdd:pfam09770 321 QPGVQPAPAHQAHRQQGSF 339
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
340-470 |
2.17e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 42.01 E-value: 2.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 340 MLQAQPP-ALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqQAPTLHPPRL 418
Cdd:PRK14951 361 LLAFKPAaAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP-----VAAPAAAAPA 435
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1907157157 419 PSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQP 470
Cdd:PRK14951 436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA--PAAAPAA 485
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
244-470 |
2.53e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 41.59 E-value: 2.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 244 KEEAASPLKSTKRQREKVASDTEDTDR-ITSKKTKTQEISRPNSPSEGeGESSDSRSVNDEGSSDPKDIDQDNRSTSPSI 322
Cdd:PRK10927 58 KKEESETLQSQKVTGNGLPPKPEERWRyIKELESRQPGVRAPTEPSAG-GEVKTPEQLTPEQRQLLEQMQADMRQQPTQL 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 323 PSPQDNESDSDSsaQQQMLQAQPpalQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPatsQPPNQTQStvapa 402
Cdd:PRK10927 137 VEVPWNEQTPEQ--RQQTLQRQR---QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP---RQSKPAST----- 203
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907157157 403 ahthiqQAPtlhpprlpsphppLQPMTAPPSQSSAQPHPQpslhsqgppgphslQTGPLLQHPGPPQP 470
Cdd:PRK10927 204 ------QQP-------------YQDLLQTPAHTTAQSKPQ--------------QAAPVTRAADAPKP 238
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
343-432 |
2.82e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 41.72 E-value: 2.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 343 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavPPQGSPATsqPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPH 422
Cdd:PRK14950 366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK---EPVRETAT--PPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
|
90
....*....|
gi 1907157157 423 PPLQPMTAPP 432
Cdd:PRK14950 441 KPKYTPPAPP 450
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
346-691 |
2.93e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 2.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 346 PALQAPSGAASApsTAPPGTPQlPTQGPTPSATAVPPQGSPATSQPPNQTQ-STVAPAAHTHIQQAPTLHPPRLPSPHPP 424
Cdd:PHA03247 2478 PVYRRPAEARFP--FAAGAAPD-PGGGGPPDPDAPPAPSRLAPAILPDEPVgEPVHPRMLTWIRGLEELASDDAGDPPPP 2554
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 425 LQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAH--------PHST 496
Cdd:PHA03247 2555 LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHapdppppsPSPA 2634
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 497 IQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGP-SPFSLNANLPPPPALKPLSSLS 575
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvGSLTSLADPPPPPPTPEPAPHA 2714
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 576 THHPPSAHPPPLQLMPQSQPLPSSPAQP--------PGLTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvpGGPPPIT 647
Cdd:PHA03247 2715 LVSATPLPPGPAAARQASPALPAAPAPPavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---RPAVASL 2791
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1907157157 648 PPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSCPLPAVQ 691
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
318-500 |
3.03e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.79 E-value: 3.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 318 TSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQ----GSPATSQPPN 393
Cdd:PRK12323 400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAaagpRPVAAAAAAA 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 394 QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQT---GPLLQHPGPPQP 470
Cdd:PRK12323 480 PARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPApaaAPAPRAAAATEP 559
|
170 180 190
....*....|....*....|....*....|
gi 1907157157 471 FGLPSQPSQGQGPLGPSPAAAHPHSTIQLP 500
Cdd:PRK12323 560 VVAPRPPRASASGLPDMFDGDWPALAARLP 589
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
344-432 |
3.39e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.80 E-value: 3.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 344 QPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPqGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHP 423
Cdd:PRK12270 37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP-AAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
|
....*....
gi 1907157157 424 PLQPMTAPP 432
Cdd:PRK12270 116 EVTPLRGAA 124
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
200-462 |
3.44e-03 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 41.70 E-value: 3.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 200 TSPDGRASPINEDIRSSGRNSPSAASTSsNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITS---KKT 276
Cdd:PRK08581 21 TSPTAYADDPQKDSTAKTTSHDSKKSND-DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDfiyKNL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 277 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDpKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAL----QAPS 352
Cdd:PRK08581 100 PQTNINQLLTKNKYDDNYSLTTLIQNLFNLN-SDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKadnqKAPS 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 353 GAASAPST-------APPGTPQLPTQGPTPSATAVPPQGS--------------------PATSQPPNQTQSTVAPAAHT 405
Cdd:PRK08581 179 SNNTKPSTsnkqpnsPKPTQPNQSNSQPASDDTANQKSSSkdnqsmsdsaldsildqyseDAKKTQKDYASQSKKDKTET 258
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1907157157 406 HIQQAPTLHPPRLPSPhpplqpmTAPPSQSSAQPHPQPSLHSQgppgpHSLQTGPLL 462
Cdd:PRK08581 259 SNTKNPQLPTQDELKH-------KSKPAQSFENDVNQSNTRST-----SLFETGPSL 303
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
352-467 |
4.47e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 41.25 E-value: 4.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 352 SGAASAPSTAPpgTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTlhpprlPSPHPPLQPMTAP 431
Cdd:PHA03269 17 LIIANLNTNIP--IPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT------PAASEKFDPAPAP 88
|
90 100 110
....*....|....*....|....*....|....*...
gi 1907157157 432 PSQSSAQPHPQ--PSLHSQGPPGPHSLQTGPLLQHPGP 467
Cdd:PHA03269 89 HQAASRAPDPAvaPQLAAAPKPDAAEAFTSAAQAHEAP 126
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
141-315 |
4.72e-03 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 41.19 E-value: 4.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 141 FKKYGELPPIEKPVDPPPFMFKPVKEEDDglsgKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNS 220
Cdd:PTZ00108 1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSS----KSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNG 1298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 221 PSAASTSSNDSKAETVKKSAKKVKEeaasPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEgESSDSRSV 300
Cdd:PTZ00108 1299 GSKPSSPTKKKVKKRLEGSLAALKK----KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSS-EDDDDSEV 1373
|
170
....*....|....*
gi 1907157157 301 NDEGSSDPKDIDQDN 315
Cdd:PTZ00108 1374 DDSEDEDDEDDEDDD 1388
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
197-506 |
5.62e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 40.86 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 197 KQPTSPDGRASPINEDIRSSgrNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKT 276
Cdd:PRK14949 473 EASSSLDADNSAVPEQIDST--AEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLD 550
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 277 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNrSTSPSIPSPQDNESDSDSS--------AQQQMLQAQPPAL 348
Cdd:PRK14949 551 AYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPS-SQSLSPISAVTTAAASLADddildavlAARDSLLSDLDAL 629
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 349 QAPSGAASAPSTA--PPGTPQLPTQGPTPSATAVPP--QGSPATSQPPNQTQSTV--APAAHTHIQQAPTLHPPRLPSPH 422
Cdd:PRK14949 630 SPKEGDGKKSSADrkPKTPPSRAPPASLSKPASSPDasQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDR 709
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 423 PplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQPFGLPSQPSQGQGPlGPSPAAAHPHSTIQLPAS 502
Cdd:PRK14949 710 P---PWEEAPEVASANDGPNNAAEGNLSESVEDASNSEL--QAVEQQATHQPQVQAEAQSP-ASTTALTQTSSEVQDTEL 783
|
....
gi 1907157157 503 QSAL 506
Cdd:PRK14949 784 NLVL 787
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
331-436 |
6.55e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 40.53 E-value: 6.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 331 DSDSSAQQQMLQAQPPALQAPSgAASAPSTAPPGTPQLPTQGPTPSatavPPQGSPATSQPPNQTQSTVAPAAHTHIQQA 410
Cdd:PRK14971 366 GDDASGGRGPKQHIKPVFTQPA-AAPQPSAAAAASPSPSQSSAAAQ----PSAPQSATQPAGTPPTVSVDPPAAVPVNPP 440
|
90 100
....*....|....*....|....*.
gi 1907157157 411 PTLHPPRLPSPHPPLQPMtaPPSQSS 436
Cdd:PRK14971 441 STAPQAVRPAQFKEEKKI--PVSKVS 464
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
368-505 |
7.00e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 40.74 E-value: 7.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 368 LPTQGPTPSATAVPPQGSPAtSQPPNQTQSTVAPAAhthiqqaptlhpprlpsphpplqpMTAPPSQSSAQPHPQPSLHS 447
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAA-APAAAPAPAAAAPAA------------------------AAAPAPAAAPQPAPAPAPAP 439
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907157157 448 QGPPGPHSLQTGPLLQHP----GPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSA 505
Cdd:PRK07764 440 APPSPAGNAPAGGAPSPPpaaaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
342-476 |
7.89e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.44 E-value: 7.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 342 QAQPPALQAPSGAAS--APSTAPPGTPQLPTQGPTPsatAVPPQGSPATSQPPNQTQSTVAP--AAHTHIQQAPTLHPPR 417
Cdd:PHA03378 688 QWAPGTMQPPPRAPTpmRPPAAPPGRAQRPAAATGR---ARPPAAAPGRARPPAAAPGRARPpaAAPGRARPPAAAPGRA 764
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157 418 LPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQ 476
Cdd:PHA03378 765 RPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQ 823
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
286-502 |
7.92e-03 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 39.92 E-value: 7.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 286 SPSEGEGESSDSRSVNDEGSSDpkdiDQDNRSTspsiPSP-QDNESDSDSSAQQQMlqAQPPALQAPSGAASAPstAPPG 364
Cdd:PRK10905 24 STSSSDQTASGEKSIDLAGNAT----DQANGVQ----PAPgTTSAEQTAGNTQQDV--SLPPISSTPTQGQTPV--ATDG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 365 TPQLPTQG------------------------PTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPS 420
Cdd:PRK10905 92 QQRVEVQGdlnnaltqpqnqqqlnnvavnstlPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQAT 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 421 PHPPLQPMTAPP--SQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQ 498
Cdd:PRK10905 172 AKTEPKPVAQTPkrTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQ 251
|
....
gi 1907157157 499 LPAS 502
Cdd:PRK10905 252 LSSS 255
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
351-496 |
8.53e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 40.23 E-value: 8.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 351 PSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSphpplqpmta 430
Cdd:PRK07994 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQ---------- 430
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907157157 431 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHST 496
Cdd:PRK07994 431 RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVAT 496
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
375-495 |
8.71e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 40.10 E-value: 8.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 375 PSATAVPPQGSPATSQPPNQtqstvAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH 454
Cdd:PHA03269 23 NTNIPIPELHTSAATQKPDP-----APAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD 97
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1907157157 455 SLQTGPLLQHPGPPQPFGLPSQPSQGQGPL---------GPSPAAAHPHS 495
Cdd:PHA03269 98 PAVAPQLAAAPKPDAAEAFTSAAQAHEAPAdagtsaaskKPDPAAHTQHS 147
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
195-488 |
9.41e-03 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 40.44 E-value: 9.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 195 RKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPlkstkrqrekvaSDTEDTDRITSK 274
Cdd:PTZ00395 267 RGASSAAESGYAHHRGSNIASHTPNDNIMHAANNPLNNTNDAQRNAIQGDLVRGAP------------NDKNSFDRGNEK 334
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 275 KTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSipSPQDNESDSDSSAQQQmlqAQPPALQAPSGA 354
Cdd:PTZ00395 335 TYQIYGGFHDGSPNAASAGAPFNGLGNQADGGHINQVHPDARGAWAG--GPHSNASYNCAAYSNA---AQSNAAQSNAGF 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 355 ASAPSTAPPGTPQlPTQGPTPSATAV--PPQGSPATSQPPN-QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAP 431
Cdd:PTZ00395 410 SNAGYSNPGNSNP-GYNNAPNSNTPYnnPPNSNTPYSNPPNsNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAY 488
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1907157157 432 PSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGlpSQPSQGQGPLGPSP 488
Cdd:PTZ00395 489 QHRAANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFG--SAPYGGNAATTADP 543
|
|
|