NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|217330641|ref|NP_060439|]
View 

protein IWS1 homolog isoform 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TFIIS_I super family cl00146
N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a ...
528-792 3.78e-29

N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme


The actual alignment was detected with superfamily member COG5139:

Pssm-ID: 469629  Cd Length: 397  Bit Score: 120.57  E-value: 3.78e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 528 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 607
Cdd:COG5139  126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 608 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 687
Cdd:COG5139  206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 688 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMNSTGGQTpRRDLEKVLTGEEKALRPGDPGFCARARV---PMPSNKDYVV 764
Cdd:COG5139  285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMDSAKNRK-KKSSGEDPTSRGSSVQTLYEQAAARRNRaaaPAQTTTDYKY 362
                        250       260       270
                 ....*....|....*....|....*....|....*
gi 217330641 765 RP-------KWNVEMESSRFQATSKKGISRLDKQM 792
Cdd:COG5139  363 APvsnlsavPTNARAVGVGSTLNNSEMYKRLTSRL 397
MSCRAMM_ClfA super family cl41352
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
102-442 6.25e-08

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


The actual alignment was detected with superfamily member NF033609:

Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 6.25e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 102 EPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGhASDSENeDVGKHPASDSEIEELQKSPASDSETEDALKPQ 181
Cdd:NF033609 555 EPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDS-ASDSDS-ASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 182 ISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQAS 261
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 711
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 262 DSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPA---SDSETEDASRHKQKPESDDDSDRENK 338
Cdd:NF033609 712 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSD 791
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 339 GEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDAD 418
Cdd:NF033609 792 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNN 871
                        330       340
                 ....*....|....*....|....
gi 217330641 419 DSDSDAVSDKSGKREKTIASDSEE 442
Cdd:NF033609 872 VVPPNSPKNGTNASNKNEAKDSKE 895
PTZ00482 super family cl27491
membrane-attack complex/perforin (MACPF) Superfamily; Provisional
9-176 6.76e-03

membrane-attack complex/perforin (MACPF) Superfamily; Provisional


The actual alignment was detected with superfamily member PTZ00482:

Pssm-ID: 240433 [Multi-domain]  Cd Length: 844  Bit Score: 40.23  E-value: 6.76e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   9 DQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREdglpkghhvtDSENDEPLNlNASDSESEELH 88
Cdd:PTZ00482  88 DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN----------DQTNDFDQD-DSSNSQTDQGL 156
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  89 RQKDSDSESEERAEPPASDSENE-DVNQHGSDSESEETRKLPGSDSENEELLNghaSDSENEDVGkhpASDSEIEELQKS 167
Cdd:PTZ00482 157 KQSVNLSSAEKLIEEKKGQTENTfKFYNFGNDGEEAAAKDGGKSKSSDPGPLN---DSDGQGDDG---DPESAEEDKAAS 230

                 ....*....
gi 217330641 168 PASDSETED 176
Cdd:PTZ00482 231 NTRAAYTKA 239
 
Name Accession Description Interval E-value
COG5139 COG5139
Uncharacterized conserved protein [Function unknown];
528-792 3.78e-29

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 227468  Cd Length: 397  Bit Score: 120.57  E-value: 3.78e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 528 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 607
Cdd:COG5139  126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 608 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 687
Cdd:COG5139  206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 688 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMNSTGGQTpRRDLEKVLTGEEKALRPGDPGFCARARV---PMPSNKDYVV 764
Cdd:COG5139  285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMDSAKNRK-KKSSGEDPTSRGSSVQTLYEQAAARRNRaaaPAQTTTDYKY 362
                        250       260       270
                 ....*....|....*....|....*....|....*
gi 217330641 765 RP-------KWNVEMESSRFQATSKKGISRLDKQM 792
Cdd:COG5139  363 APvsnlsavPTNARAVGVGSTLNNSEMYKRLTSRL 397
Med26 pfam08711
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is ...
636-689 3.44e-12

TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species {1-2]. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator exists in two major forms in human cells: a smaller form that interacts strongly with pol II and activates transcription, and a large form that does not interact strongly with pol II and does not directly activate transcription. Notably, the 'small' and 'large' Mediator complexes differ in their subunit composition: the Med26 subunit preferentially associates with the small, active complex, whereas cdk8, cyclin C, Med12 and Med13 associate with the large Mediator complex. This family includesthe C terminal region of a number of eukaryotic hypothetical proteins which are homologous to the Saccharomyces cerevisiae protein IWS1. IWS1 is known to be an Pol II transcription elongation factor and interacts with Spt6 and Spt5.


Pssm-ID: 462573 [Multi-domain]  Cd Length: 52  Bit Score: 61.76  E-value: 3.44e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 217330641  636 ELLKILQELPsVSQETLKHSGIGRAVMYLYKHPkESRSNKDMAGKLINEWSRPI 689
Cdd:pfam08711   1 KLLKKLEKLP-VTLELLKSTGIGKVVNKLRKHK-ENPEIKKLAKELVKKWKRLV 52
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
102-442 6.25e-08

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 6.25e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 102 EPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGhASDSENeDVGKHPASDSEIEELQKSPASDSETEDALKPQ 181
Cdd:NF033609 555 EPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDS-ASDSDS-ASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 182 ISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQAS 261
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 711
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 262 DSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPA---SDSETEDASRHKQKPESDDDSDRENK 338
Cdd:NF033609 712 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSD 791
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 339 GEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDAD 418
Cdd:NF033609 792 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNN 871
                        330       340
                 ....*....|....*....|....
gi 217330641 419 DSDSDAVSDKSGKREKTIASDSEE 442
Cdd:NF033609 872 VVPPNSPKNGTNASNKNEAKDSKE 895
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2-316 7.33e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.99  E-value: 7.33e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDgEDDVNEQHSGSDTGSVERHSENETSDRE-DGLPKGHHVTDSENDEPLNLNAS 80
Cdd:NF033609 583 GSDSTSDSGSDSASDSDSASDSDSASD-SDSASDSDSASDSDSASDSDSASDSDSAsDSDSDSDSDSDSDSDSDSDSDSD 661
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  81 DSESEELHRQKDSDSESEERAEPPA-SDSENEDVNQHGSDSESEETRKlpgSDSENEELLNGHASDSENEDVGKHPASDS 159
Cdd:NF033609 662 SDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDS 738
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 160 EIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElPKPQVSDSESEEPPRHQASDSEN 239
Cdd:NF033609 739 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSD 817
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 217330641 240 EELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSE 316
Cdd:NF033609 818 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK 894
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2-295 8.55e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.99  E-value: 8.55e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASD 81
Cdd:NF033609 607 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 686
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  82 SESEELHRQKDSDSESEERAEpPASDSENEDVNQHGSDSESE-ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSE 160
Cdd:NF033609 687 DSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 765
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 161 IEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENE 240
Cdd:NF033609 766 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDS 844
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 217330641 241 ELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSEnEELP 295
Cdd:NF033609 845 DSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK-EPLP 898
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
11-329 2.74e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 2.74e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   11 SDDGGATPVqderdSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNA--SDSESEELH 88
Cdd:PHA03307   28 PGDAADDLL-----SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSlsTLAPASPAR 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   89 RQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI----EEL 164
Cdd:PHA03307  103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlsspEET 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  165 QKSPASDSETEDALKPQISDSESEEPPRHQASDSeneePPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElPK 244
Cdd:PHA03307  183 ARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC-PL 257
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  245 PRIS--------------DSESEDPPRHQASDSENEELPKP--------------RISDSESEDPPRNQASDSENEELPK 296
Cdd:PHA03307  258 PRPApitlptriweasgwNGPSSRPGPASSSSSPRERSPSPspsspgsgpapsspRASSSSSSSRESSSSSTSSSSESSR 337
                         330       340       350
                  ....*....|....*....|....*....|....
gi 217330641  297 PR-VSDSESEGPQKGPASDSETEDASRHKQKPES 329
Cdd:PHA03307  338 GAaVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP 371
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2-338 4.24e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 4.24e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDgEDDVNEQHSGSDTGSVERHSENETSDrEDGLPKGHHVTDSENDEPLNLNASD 81
Cdd:NF033609 577 DSGSDSGSDSTSDSGSDSASDSDSASD-SDSASDSDSASDSDSASDSDSASDSD-SASDSDSASDSDSDSDSDSDSDSDS 654
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  82 SESEELHRQKDSDSESEERAEppaSDSENEDVNQHGSDSESEetrklpgSDSENEELLNGHASDSENEDVGKHPASDSEI 161
Cdd:NF033609 655 DSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 724
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 162 EELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElPKPQVSDSESEEPPRHQASDSENEE 241
Cdd:NF033609 725 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSD 803
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 242 LPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEG-----PQKGPASDSE 316
Cdd:NF033609 804 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSnnnvvPPNSPKNGTN 883
                        330       340
                 ....*....|....*....|..
gi 217330641 317 TEDASRHKQKPESDDDSDRENK 338
Cdd:NF033609 884 ASNKNEAKDSKEPLPDTGSEDE 905
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
201-528 5.38e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.29  E-value: 5.38e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 201 EEPPKPRMSDSESEELPKPQVSDSEseePPRHQASDSENEELPKPRISDSESeDPPRHQASDSENEELPKpriSDSESED 280
Cdd:NF033609 540 DKPVVPEQPDEPGEIEPIPEDSDSD---PGSDSGSDSSNSDSGSDSGSDSTS-DSGSDSASDSDSASDSD---SASDSDS 612
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 281 PPRNQASDSENEELPKPRVSDSESEGPQKGpASDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKF 360
Cdd:NF033609 613 ASDSDSASDSDSASDSDSASDSDSASDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 361 HSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDS 440
Cdd:NF033609 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 771
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 441 EEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDDNIKRG 520
Cdd:NF033609 772 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 851

                 ....*...
gi 217330641 521 KHMDFLSD 528
Cdd:NF033609 852 SDSDSESD 859
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
151-471 1.15e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.14  E-value: 1.15e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 151 VGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESeelpkpqVSDSESEEPP 230
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDS-------ASDSDSASDS 616
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 231 RHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKG 310
Cdd:NF033609 617 DSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 696
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 311 PA-SDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKv 389
Cdd:NF033609 697 DSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD- 775
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 390 AKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKEL-SDKKNEEKDLFGSDSESGNE 468
Cdd:NF033609 776 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSD 855

                 ...
gi 217330641 469 EEN 471
Cdd:NF033609 856 SES 858
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2-203 1.86e-05

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 48.45  E-value: 1.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641     2 DSEYYSGDQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASD 81
Cdd:TIGR00927  698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKED 777
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641    82 SESEELHRQKDSDSESEERAEPP--ASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDvgkhpasDS 159
Cdd:TIGR00927  778 EDEGEIQAGEDGEMKGDEGAEGKveHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQD-------EK 850
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 217330641   160 EIEELQKSPASDSETEDALKPQISDSESEEPPRHQaSDSENEEP 203
Cdd:TIGR00927  851 GVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEEP 893
Ebola_NP pfam05505
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These ...
42-293 2.91e-05

Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus.


Pssm-ID: 398905  Cd Length: 717  Bit Score: 47.81  E-value: 2.91e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   42 TGSVERHSENETSdredglpkGHHVTDSENDEPLNLNASDSESEelhrQKDSDSESEERAEPPASDSENEdvNQHGSDSE 121
Cdd:pfam05505 388 TEAITAASLPKTS--------GHYDDDDDIPFPGPINDDDNPGH----QDDDPTDSQDTTIPDVVVDPDD--GSYGEYQS 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  122 SEETrklpGSDSENEELLNGhaSDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPR--HQASDSE 199
Cdd:pfam05505 454 YSEN----GMNAPDDLVLLN--EDEDDLEDTKPVPNRSTKGGQQKNSQKGQHIEGRQTQSRPIQNVPGPHRtiHHASAPL 527
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  200 NEEPPKPRMSDSESEELPKPQvsdseSEEPPRHQASDSENEELPkPRISDSESED-------------PPRHQASDSENE 266
Cdd:pfam05505 528 TDNDRRNEPSGSTSPRMLTPI-----NEEADPLDDADDETSSLP-PLESDDEEQDrdgtsnrtptvapPAPVYRDHSEKK 601
                         250       260
                  ....*....|....*....|....*..
gi 217330641  267 ELPKPRISDSESEDPPRNQASDSENEE 293
Cdd:pfam05505 602 ELPQDEQQDQDHTQEARNQDSDNTQSE 628
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
253-530 5.08e-03

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 40.66  E-value: 5.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 253 EDPPRHQASDSENEELPKPRISDSeseDPPRNQASDSENEELPKPRVSDSESEgpqKGPASDSETEDASRHKQKPESDDD 332
Cdd:NF033609 540 DKPVVPEQPDEPGEIEPIPEDSDS---DPGSDSGSDSSNSDSGSDSGSDSTSD---SGSDSASDSDSASDSDSASDSDSA 613
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 333 SDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKegeeekvakrkaavlSDSEDEEKASAKKSR 412
Cdd:NF033609 614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD---------------SDSDSDSDSDSDSDS 678
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 413 VVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKElSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFN 492
Cdd:NF033609 679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 757
                        250       260       270
                 ....*....|....*....|....*....|....*...
gi 217330641 493 QEDLEEEKGETQVKEAEDSDSDDNIKRGKHMDFLSDFE 530
Cdd:NF033609 758 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 795
PTZ00482 PTZ00482
membrane-attack complex/perforin (MACPF) Superfamily; Provisional
9-176 6.76e-03

membrane-attack complex/perforin (MACPF) Superfamily; Provisional


Pssm-ID: 240433 [Multi-domain]  Cd Length: 844  Bit Score: 40.23  E-value: 6.76e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   9 DQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREdglpkghhvtDSENDEPLNlNASDSESEELH 88
Cdd:PTZ00482  88 DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN----------DQTNDFDQD-DSSNSQTDQGL 156
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  89 RQKDSDSESEERAEPPASDSENE-DVNQHGSDSESEETRKLPGSDSENEELLNghaSDSENEDVGkhpASDSEIEELQKS 167
Cdd:PTZ00482 157 KQSVNLSSAEKLIEEKKGQTENTfKFYNFGNDGEEAAAKDGGKSKSSDPGPLN---DSDGQGDDG---DPESAEEDKAAS 230

                 ....*....
gi 217330641 168 PASDSETED 176
Cdd:PTZ00482 231 NTRAAYTKA 239
 
Name Accession Description Interval E-value
COG5139 COG5139
Uncharacterized conserved protein [Function unknown];
528-792 3.78e-29

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 227468  Cd Length: 397  Bit Score: 120.57  E-value: 3.78e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 528 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 607
Cdd:COG5139  126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 608 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 687
Cdd:COG5139  206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 688 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMNSTGGQTpRRDLEKVLTGEEKALRPGDPGFCARARV---PMPSNKDYVV 764
Cdd:COG5139  285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMDSAKNRK-KKSSGEDPTSRGSSVQTLYEQAAARRNRaaaPAQTTTDYKY 362
                        250       260       270
                 ....*....|....*....|....*....|....*
gi 217330641 765 RP-------KWNVEMESSRFQATSKKGISRLDKQM 792
Cdd:COG5139  363 APvsnlsavPTNARAVGVGSTLNNSEMYKRLTSRL 397
Med26 pfam08711
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is ...
636-689 3.44e-12

TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species {1-2]. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator exists in two major forms in human cells: a smaller form that interacts strongly with pol II and activates transcription, and a large form that does not interact strongly with pol II and does not directly activate transcription. Notably, the 'small' and 'large' Mediator complexes differ in their subunit composition: the Med26 subunit preferentially associates with the small, active complex, whereas cdk8, cyclin C, Med12 and Med13 associate with the large Mediator complex. This family includesthe C terminal region of a number of eukaryotic hypothetical proteins which are homologous to the Saccharomyces cerevisiae protein IWS1. IWS1 is known to be an Pol II transcription elongation factor and interacts with Spt6 and Spt5.


Pssm-ID: 462573 [Multi-domain]  Cd Length: 52  Bit Score: 61.76  E-value: 3.44e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 217330641  636 ELLKILQELPsVSQETLKHSGIGRAVMYLYKHPkESRSNKDMAGKLINEWSRPI 689
Cdd:pfam08711   1 KLLKKLEKLP-VTLELLKSTGIGKVVNKLRKHK-ENPEIKKLAKELVKKWKRLV 52
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
102-442 6.25e-08

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 56.46  E-value: 6.25e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 102 EPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGhASDSENeDVGKHPASDSEIEELQKSPASDSETEDALKPQ 181
Cdd:NF033609 555 EPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDS-ASDSDS-ASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 182 ISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQAS 261
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 711
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 262 DSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPA---SDSETEDASRHKQKPESDDDSDRENK 338
Cdd:NF033609 712 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSD 791
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 339 GEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDAD 418
Cdd:NF033609 792 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNN 871
                        330       340
                 ....*....|....*....|....
gi 217330641 419 DSDSDAVSDKSGKREKTIASDSEE 442
Cdd:NF033609 872 VVPPNSPKNGTNASNKNEAKDSKE 895
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2-316 7.33e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.99  E-value: 7.33e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDgEDDVNEQHSGSDTGSVERHSENETSDRE-DGLPKGHHVTDSENDEPLNLNAS 80
Cdd:NF033609 583 GSDSTSDSGSDSASDSDSASDSDSASD-SDSASDSDSASDSDSASDSDSASDSDSAsDSDSDSDSDSDSDSDSDSDSDSD 661
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  81 DSESEELHRQKDSDSESEERAEPPA-SDSENEDVNQHGSDSESEETRKlpgSDSENEELLNGHASDSENEDVGKHPASDS 159
Cdd:NF033609 662 SDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDS 738
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 160 EIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElPKPQVSDSESEEPPRHQASDSEN 239
Cdd:NF033609 739 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSD 817
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 217330641 240 EELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSE 316
Cdd:NF033609 818 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK 894
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2-295 8.55e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.99  E-value: 8.55e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASD 81
Cdd:NF033609 607 ASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 686
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  82 SESEELHRQKDSDSESEERAEpPASDSENEDVNQHGSDSESE-ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSE 160
Cdd:NF033609 687 DSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 765
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 161 IEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENE 240
Cdd:NF033609 766 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDS 844
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 217330641 241 ELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSEnEELP 295
Cdd:NF033609 845 DSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK-EPLP 898
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
11-329 2.74e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 2.74e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   11 SDDGGATPVqderdSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNA--SDSESEELH 88
Cdd:PHA03307   28 PGDAADDLL-----SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSlsTLAPASPAR 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   89 RQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI----EEL 164
Cdd:PHA03307  103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlsspEET 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  165 QKSPASDSETEDALKPQISDSESEEPPRHQASDSeneePPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElPK 244
Cdd:PHA03307  183 ARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC-PL 257
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  245 PRIS--------------DSESEDPPRHQASDSENEELPKP--------------RISDSESEDPPRNQASDSENEELPK 296
Cdd:PHA03307  258 PRPApitlptriweasgwNGPSSRPGPASSSSSPRERSPSPspsspgsgpapsspRASSSSSSSRESSSSSTSSSSESSR 337
                         330       340       350
                  ....*....|....*....|....*....|....
gi 217330641  297 PR-VSDSESEGPQKGPASDSETEDASRHKQKPES 329
Cdd:PHA03307  338 GAaVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP 371
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
2-338 4.24e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 4.24e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDgEDDVNEQHSGSDTGSVERHSENETSDrEDGLPKGHHVTDSENDEPLNLNASD 81
Cdd:NF033609 577 DSGSDSGSDSTSDSGSDSASDSDSASD-SDSASDSDSASDSDSASDSDSASDSD-SASDSDSASDSDSDSDSDSDSDSDS 654
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  82 SESEELHRQKDSDSESEERAEppaSDSENEDVNQHGSDSESEetrklpgSDSENEELLNGHASDSENEDVGKHPASDSEI 161
Cdd:NF033609 655 DSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 724
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 162 EELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElPKPQVSDSESEEPPRHQASDSENEE 241
Cdd:NF033609 725 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSD 803
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 242 LPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEG-----PQKGPASDSE 316
Cdd:NF033609 804 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSnnnvvPPNSPKNGTN 883
                        330       340
                 ....*....|....*....|..
gi 217330641 317 TEDASRHKQKPESDDDSDRENK 338
Cdd:NF033609 884 ASNKNEAKDSKEPLPDTGSEDE 905
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
201-528 5.38e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.29  E-value: 5.38e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 201 EEPPKPRMSDSESEELPKPQVSDSEseePPRHQASDSENEELPKPRISDSESeDPPRHQASDSENEELPKpriSDSESED 280
Cdd:NF033609 540 DKPVVPEQPDEPGEIEPIPEDSDSD---PGSDSGSDSSNSDSGSDSGSDSTS-DSGSDSASDSDSASDSD---SASDSDS 612
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 281 PPRNQASDSENEELPKPRVSDSESEGPQKGpASDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKF 360
Cdd:NF033609 613 ASDSDSASDSDSASDSDSASDSDSASDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 361 HSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDS 440
Cdd:NF033609 692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 771
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 441 EEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDDNIKRG 520
Cdd:NF033609 772 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 851

                 ....*...
gi 217330641 521 KHMDFLSD 528
Cdd:NF033609 852 SDSDSESD 859
PRK08581 PRK08581
amidase domain-containing protein;
79-352 1.07e-05

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 49.02  E-value: 1.07e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  79 ASDSESEELHRQKDSDSESEEraeppasDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSEN--EDVGKHPA 156
Cdd:PRK08581  28 DDPQKDSTAKTTSHDSKKSND-------DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNiiDFIYKNLP 100
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 157 SDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpQVSDSESEEPPRHQASD 236
Cdd:PRK08581 101 QTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIK-----NDTDTQSSKQDKADNQK 175
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 237 SENEELPKPRISDSESEDPPRHQASDSEneelpkpriSDSESEDPPRNQASDSENEEL---PKPRVSDSESEGPQKGPAS 313
Cdd:PRK08581 176 APSSNNTKPSTSNKQPNSPKPTQPNQSN---------SQPASDDTANQKSSSKDNQSMsdsALDSILDQYSEDAKKTQKD 246
                        250       260       270
                 ....*....|....*....|....*....|....*....
gi 217330641 314 DSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSD 352
Cdd:PRK08581 247 YASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFEND 285
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
151-471 1.15e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.14  E-value: 1.15e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 151 VGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESeelpkpqVSDSESEEPP 230
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDS-------ASDSDSASDS 616
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 231 RHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKG 310
Cdd:NF033609 617 DSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 696
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 311 PA-SDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKv 389
Cdd:NF033609 697 DSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD- 775
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 390 AKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKEL-SDKKNEEKDLFGSDSESGNE 468
Cdd:NF033609 776 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSD 855

                 ...
gi 217330641 469 EEN 471
Cdd:NF033609 856 SES 858
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
91-333 1.15e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.40  E-value: 1.15e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   91 KDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEellNGHASDSENEDvGKHPASDSEieelqkSPAS 170
Cdd:PHA03307   59 AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG---SPTPPGPSSPD-PPPPTPPPA------SPPP 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  171 DSETeDALKPQISDSESEEPPRHQASDSENEEPPKPR-------------MSDSESEELPKPQVSDSESEEPPRHQASDS 237
Cdd:PHA03307  129 SPAP-DLSEMLRPVGSPGPPPAASPPAAGASPAAVASdaassrqaalplsSPEETARAPSSPPAEPPPSTPPAAASPRPP 207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  238 EneelPKPRISDSESED---PPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDS----ESEGPQKG 310
Cdd:PHA03307  208 R----RSSPISASASSPapaPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEasgwNGPSSRPG 283
                         250       260
                  ....*....|....*....|...
gi 217330641  311 PASDSETEDASRHKQKPESDDDS 333
Cdd:PHA03307  284 PASSSSSPRERSPSPSPSSPGSG 306
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
50-328 1.50e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 48.92  E-value: 1.50e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  50 ENETSDREDGLPKGHHVTDSENDEPlnlnaSDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLP 129
Cdd:PTZ00449 500 EEEDSDKHDEPPEGPEASGLPPKAP-----GDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPT 574
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 130 GSDSENEELLNGHASDSENEDVGKHPASDS--------EIEELQKSPASDSETEDALKPQisdseSEEPPRHQASDSENE 201
Cdd:PTZ00449 575 LSKKPEFPKDPKHPKDPEEPKKPKRPRSAQrptrpkspKLPELLDIPKSPKRPESPKSPK-----RPPPPQRPSSPERPE 649
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 202 EPPKPRMSDS-ESEELP-----KPQVSDSESEEPPRHQASDSeNEELPKPRISDSESEDPPRHQASDSENEELPKPRISD 275
Cdd:PTZ00449 650 GPKIIKSPKPpKSPKPPfdpkfKEKFYDDYLDAAAKSKETKT-TVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRD 728
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 217330641 276 SES-----EDPPRNQASDSENEELP---KPRVSDSESEGPQKG-PASDSETEDASRHKQKPE 328
Cdd:PTZ00449 729 EEFpfepiGDPDAEQPDDIEFFTPPeeeRTFFHETPADTPLPDiLAEEFKEEDIHAETGEPD 790
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
2-203 1.86e-05

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 48.45  E-value: 1.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641     2 DSEYYSGDQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASD 81
Cdd:TIGR00927  698 EIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKED 777
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641    82 SESEELHRQKDSDSESEERAEPP--ASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDvgkhpasDS 159
Cdd:TIGR00927  778 EDEGEIQAGEDGEMKGDEGAEGKveHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQD-------EK 850
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 217330641   160 EIEELQKSPASDSETEDALKPQISDSESEEPPRHQaSDSENEEP 203
Cdd:TIGR00927  851 GVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEEP 893
PTZ00121 PTZ00121
MAEBL; Provisional
83-456 2.09e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.60  E-value: 2.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   83 ESEELHRQKDSDSESEERAEPPASDSENEDvnqhGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIE 162
Cdd:PTZ00121 1392 KADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  163 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSEN--- 239
Cdd:PTZ00121 1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEkkk 1547
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  240 -EELPKPR-ISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSET 317
Cdd:PTZ00121 1548 aDELKKAEeLKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK 1627
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  318 EDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVL 397
Cdd:PTZ00121 1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 217330641  398 SDSEDEEKASAKKSRVVSDADDSDSDAVSDKS--GKREKTIASDSEEEAGKELSDKKNEEK 456
Cdd:PTZ00121 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAeeDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Ebola_NP pfam05505
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These ...
42-293 2.91e-05

Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus.


Pssm-ID: 398905  Cd Length: 717  Bit Score: 47.81  E-value: 2.91e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   42 TGSVERHSENETSdredglpkGHHVTDSENDEPLNLNASDSESEelhrQKDSDSESEERAEPPASDSENEdvNQHGSDSE 121
Cdd:pfam05505 388 TEAITAASLPKTS--------GHYDDDDDIPFPGPINDDDNPGH----QDDDPTDSQDTTIPDVVVDPDD--GSYGEYQS 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  122 SEETrklpGSDSENEELLNGhaSDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPR--HQASDSE 199
Cdd:pfam05505 454 YSEN----GMNAPDDLVLLN--EDEDDLEDTKPVPNRSTKGGQQKNSQKGQHIEGRQTQSRPIQNVPGPHRtiHHASAPL 527
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  200 NEEPPKPRMSDSESEELPKPQvsdseSEEPPRHQASDSENEELPkPRISDSESED-------------PPRHQASDSENE 266
Cdd:pfam05505 528 TDNDRRNEPSGSTSPRMLTPI-----NEEADPLDDADDETSSLP-PLESDDEEQDrdgtsnrtptvapPAPVYRDHSEKK 601
                         250       260
                  ....*....|....*....|....*..
gi 217330641  267 ELPKPRISDSESEDPPRNQASDSENEE 293
Cdd:pfam05505 602 ELPQDEQQDQDHTQEARNQDSDNTQSE 628
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
77-336 3.00e-05

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 47.73  E-value: 3.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   77 LNASDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPA 156
Cdd:PTZ00108 1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  157 SDSEIEELQKSPASDSETEDALKpqiSDSESEEPPRHQASDSENEEPPKPRMSDSESEELPK--PQVSDSESEEPPrhqa 234
Cdd:PTZ00108 1214 DNKKSNSSGSDQEDDEEQKTKPK---KSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKnaPKRVSAVQYSPP---- 1286
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  235 sdSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASD 314
Cdd:PTZ00108 1287 --PPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSS 1364
                         250       260
                  ....*....|....*....|..
gi 217330641  315 SETEDASRHKQKPESDDDSDRE 336
Cdd:PTZ00108 1365 SEDDDDSEVDDSEDEDDEDDED 1386
PHA03321 PHA03321
tegument protein VP11/12; Provisional
187-343 3.09e-05

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 47.65  E-value: 3.09e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 187 SEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRhqaSDSENE 266
Cdd:PHA03321 427 SRQPPGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPPEPAAAPS---PATYYT 503
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 267 EL--PKPRIsdsesedPPRNQASDSENEELPKPRVSDSESEGP-------QKGPASDSETEDASRHKQK-PESDDDSDRE 336
Cdd:PHA03321 504 RMggGPPRL-------PPRNRATETLRPDWGPPAAAPPEQMEDpylepddDRFDRRDGAAAAATSHPREaPAPDDDPIYE 576

                 ....*..
gi 217330641 337 NKGEDTE 343
Cdd:PHA03321 577 GVSDSEE 583
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
115-374 2.75e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 44.60  E-value: 2.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   115 QHGSDSESEETRKLPGSDSENEELLNGHAS-DSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRH 193
Cdd:TIGR00927  639 EHTGERTGEEGERPTEAEGENGEESGGEAEqEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   194 QASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQAsdSENEELpkpri 273
Cdd:TIGR00927  719 GETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQA--GEDGEM----- 791
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   274 sdsESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSETEDAsrhKQKPESDDDSDRENKGEDTEMQNDSFHSDS 353
Cdd:TIGR00927  792 ---KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQ---ELNAENQGEAKQDEKGVDGGGGSDGGDSEE 865
                          250       260
                   ....*....|....*....|.
gi 217330641   354 HMDRKKFHSSDSEEEEHKKQK 374
Cdd:TIGR00927  866 EEEEEEEEEEEEEEEEEEEEE 886
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
134-341 2.84e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 44.31  E-value: 2.84e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 134 ENEELLNGHASDSENEDVGKHPASDSEIEELQKSPASDSEtedALKPqiSDSESEEPPRHQAsdsENEEPPKPRMSDSES 213
Cdd:PRK08691 373 ENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASA---AAMP--SEGKTAGPVSNQE---NNDVPPWEDAPDEAQ 444
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 214 EELPKPQVSD------SESEEPPRHQ-----ASDSENE----ELPKPR-ISDSESEDPPRHQASDSENEELPKPRISDSE 277
Cdd:PRK08691 445 TAAGTAQTSAksiqtaSEAETPPENQvsknkAADNETDaplsEVPSENpIQATPNDEAVETETFAHEAPAEPFYGYGFPD 524
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 217330641 278 SEDPPRnqasdsENEELPKPrvsDSESEGPQKGPASDSETEDASRHKQKPESDDDSDRENKGED 341
Cdd:PRK08691 525 NDCPPE------DGAEIPPP---DWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFSTEN 579
ECM1 pfam05782
Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic ...
203-312 3.73e-04

Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic extracellular matrix protein 1 (ECM1) sequences. ECM1 has been shown to regulate endochondral bone formation, stimulate the proliferation of endothelial cells and induce angiogenesis. Mutations in the ECM1 gene can cause lipoid proteinosis, a disorder which causes generalized thickening of skin, mucosae and certain viscera. Classical features include beaded eyelid papules and laryngeal infiltration leading to hoarseness.


Pssm-ID: 461739  Cd Length: 518  Bit Score: 44.06  E-value: 3.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  203 PPKPR---MSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESE 279
Cdd:pfam05782   9 PPQTRglpVDHPDTSQHDPPFEGQSEVQPPPSQEAIPVQEEELPPPQLPVEKKVDPPLPQEAIPLQEELPPPQLPIEQKE 88
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 217330641  280 -DPPRNQASD----SENEELPKPRVSDSESEGPQKGPA 312
Cdd:pfam05782  89 iDPPFPQQEEitpsKQREEKPAPLVGQGHPEPESWNPA 126
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
240-515 4.42e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 4.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   240 EELPKPRISDSESEDPPRHQASDSENE-ELPKPRISDSESEdPPRNQASDSENE---ELPKPRVSDSESEGPQKGPASDS 315
Cdd:TIGR00927  628 GDLSKGDVAEAEHTGERTGEEGERPTEaEGENGEESGGEAE-QEGETETKGENEsegEIPAERKGEQEGEGEIEAKEADH 706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   316 ETEDASRHKQ-----KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDekegeeekva 390
Cdd:TIGR00927  707 KGETEAEEVEhegetEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE---------- 776
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   391 krkaavlSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKELSDKKNEEKDLFGSDSESGNEEE 470
Cdd:TIGR00927  777 -------DEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 217330641   471 NLIADifGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDD 515
Cdd:TIGR00927  850 KGVDG--GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
PHA03169 PHA03169
hypothetical protein; Provisional
148-334 6.95e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.04  E-value: 6.95e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 148 NEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESE 227
Cdd:PHA03169  49 PAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSP 128
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 228 EpprhqaSDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGP 307
Cdd:PHA03169 129 E------SPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSP 202
                        170       180
                 ....*....|....*....|....*..
gi 217330641 308 QKGPASDSETEDASRHKQKPESDDDSD 334
Cdd:PHA03169 203 PPQSPPDEPGEPQSPTPQQAPSPNTQQ 229
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
98-263 7.42e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.04  E-value: 7.42e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  98 EERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIEElqKSPASDSETE-D 176
Cdd:PRK13108 293 DEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGE--STPAVEETSEaD 370
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 177 ALKPQISDSESEEPPRHQASDS-ENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElpkPRISDSESEDP 255
Cdd:PRK13108 371 IEREQPGDLAGQAPAAHQVDAEaASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAG---PGDDPAEPDGI 447

                 ....*...
gi 217330641 256 PRHQASDS 263
Cdd:PRK13108 448 RRQDDFSS 455
PHA03247 PHA03247
large tegument protein UL36; Provisional
165-345 8.45e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 8.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  165 QKSPASDSETEDALKPQISDSESEEPPRHQASDSE---NEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQA--SDSEN 239
Cdd:PHA03247 2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFalpPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPL 2943
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  240 EELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDP-PRNQASDSENEELPKPRVSDSES-----EGPQKGPAS 313
Cdd:PHA03247 2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAPASSTPPLTGHSLSRVSSWASslalhEETDPPPVS 3023
                         170       180       190
                  ....*....|....*....|....*....|..
gi 217330641  314 DSETEDASRHKQkpESDDDSDRENKGEDTEMQ 345
Cdd:PHA03247 3024 LKQTLWPPDDTE--DSDADSLFDSDSERSDLE 3053
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
14-267 8.88e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.06  E-value: 8.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641    14 GGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQKDS 93
Cdd:TIGR00927  642 GERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGET 721
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641    94 DSESEERAEPPASDSENEDVNQHG-SDSESEETRKLPGSDSENEELLNGHASDSENEDVGK-HPASDSEIEELQKSPASD 171
Cdd:TIGR00927  722 EAEGTEDEGEIETGEEGEEVEDEGeGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEiQAGEDGEMKGDEGAEGKV 801
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   172 SETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEpprhQASDSENEELPKPRISDSE 251
Cdd:TIGR00927  802 EHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS----DGGDSEEEEEEEEEEEEEE 877
                          250
                   ....*....|....*.
gi 217330641   252 SEDPPRHQaSDSENEE 267
Cdd:TIGR00927  878 EEEEEEEE-EEEENEE 892
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
67-288 2.53e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.57  E-value: 2.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   67 TDSENDEPLNLNASDSESEELHRQKDSDSESeerAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNghASDS 146
Cdd:PTZ00108 1186 ADKSKKASVVGNSKRVDSDEKRKLDDKPDNK---KSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSED--NDEF 1260
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  147 ENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEepPKPRMSDSESEELPKPQVSDSES 226
Cdd:PTZ00108 1261 SSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLE--GSLAALKKKKKSEKKTARKKKSK 1338
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 217330641  227 EEPPRHQASDSENEELPKPRISDSESEDpprhqasDSENEELpkpriSDSESEDPPRNQASD 288
Cdd:PTZ00108 1339 TRVKQASASQSSRLLRRPRKKKSDSSSE-------DDDDSEV-----DDSEDEDDEDDEDDD 1388
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
188-343 2.57e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 41.12  E-value: 2.57e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 188 EEPPRHQASDSENEEPPKPrmsdsESEELPKPQVSDSESEEPPRHQASDSENE---ELPKPRISDSESEDPPRHQASDSE 264
Cdd:PRK13108 280 EAPGALRGSEYVVDEALER-----EPAELAAAAVASAASAVGPVGPGEPNQPDdvaEAVKAEVAEVTDEVAAESVVQVAD 354
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 265 NEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDS-ESEGPQKGPASDSETEDASR----HKQKPESDDDSDRENKG 339
Cdd:PRK13108 355 RDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEaASAAPEEPAALASEAHDETEpevpEKAAPIPDPAKPDELAV 434

                 ....
gi 217330641 340 EDTE 343
Cdd:PRK13108 435 AGPG 438
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
74-288 2.78e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 41.23  E-value: 2.78e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  74 PLNLNASDS----ESEELHRQKDSDSESEERAEPP----ASDSENEDVNQHGSDSESEETRKL-PGSDSENEELLNGHAS 144
Cdd:PRK08691 360 PLAAASCDAnaviENTELQSPSAQTAEKETAAKKPqprpEAETAQTPVQTASAAAMPSEGKTAgPVSNQENNDVPPWEDA 439
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 145 DSENEDV-GKHPASDSEIE---ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQ 220
Cdd:PRK08691 440 PDEAQTAaGTAQTSAKSIQtasEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYG 519
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 217330641 221 VSDSESEEPPRhqasdsENEELPKPrisDSESEDPPRHQASDSENEELPKpRISDSESEDPPRNQASD 288
Cdd:PRK08691 520 YGFPDNDCPPE------DGAEIPPP---DWEHAAPADTAGGGADEEAEAG-GIGGNNTPSAPPPEFST 577
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1-320 2.91e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 2.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641    1 MDSEYYSGDQSDDGGATPVQDErDSGSDGEDDVNEqHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNAS 80
Cdd:PHA03307  100 PAREGSPTPPGPSSPDPPPPTP-PPASPPPSPAPD-LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLS 177
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   81 DSESEElhRQKDSDSE----SEERAEPPASDSENEDVNQHGSDS------ESEETRKLPGSDSENEELLNGHASDSENED 150
Cdd:PHA03307  178 SPEETA--RAPSSPPAepppSTPPAAASPRPPRRSSPISASASSpapapgRSAADDAGASSSDSSSSESSGCGWGPENEC 255
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  151 VGKHPASDSEIEELQKSPASDSETEDAL--KPQISDSESEEPPRHQASDSEnEEPPKPRMSDSESEELPKPQVSDSESEE 228
Cdd:PHA03307  256 PLPRPAPITLPTRIWEASGWNGPSSRPGpaSSSSSPRERSPSPSPSSPGSG-PAPSSPRASSSSSSSRESSSSSTSSSSE 334
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  229 PPRHQASDS--ENEELPKPRiSDSESEDPPRHQASDSENEELPKPRISDSESEdpPRNQASDSENEELPKPRVSDSESEG 306
Cdd:PHA03307  335 SSRGAAVSPgpSPSRSPSPS-RPPPPADPSSPRKRPRPSRAPSSPAASAGRPT--RRRARAAVAGRARRRDATGRFPAGR 411
                         330
                  ....*....|....
gi 217330641  307 PQKGPASDSETEDA 320
Cdd:PHA03307  412 PRPSPLDAGAASGA 425
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
7-241 3.76e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 41.13  E-value: 3.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641     7 SGDQSDDGGATpvQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEE 86
Cdd:TIGR00927  663 SGGEAEQEGET--ETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEE 740
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641    87 LHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESE----ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIE 162
Cdd:TIGR00927  741 VEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDedegEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSE 820
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 217330641   163 ELQKSPASDSETEDAlkPQISDSESEEPPRHQASDSENEEPPkprmSDSESEELPKPQVSDSESEEPPRHQaSDSENEE 241
Cdd:TIGR00927  821 TQADDTEVKDETGEQ--ELNAENQGEAKQDEKGVDGGGGSDG----GDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEE 892
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
253-530 5.08e-03

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 40.66  E-value: 5.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 253 EDPPRHQASDSENEELPKPRISDSeseDPPRNQASDSENEELPKPRVSDSESEgpqKGPASDSETEDASRHKQKPESDDD 332
Cdd:NF033609 540 DKPVVPEQPDEPGEIEPIPEDSDS---DPGSDSGSDSSNSDSGSDSGSDSTSD---SGSDSASDSDSASDSDSASDSDSA 613
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 333 SDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKegeeekvakrkaavlSDSEDEEKASAKKSR 412
Cdd:NF033609 614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD---------------SDSDSDSDSDSDSDS 678
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641 413 VVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKElSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFN 492
Cdd:NF033609 679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 757
                        250       260       270
                 ....*....|....*....|....*....|....*...
gi 217330641 493 QEDLEEEKGETQVKEAEDSDSDDNIKRGKHMDFLSDFE 530
Cdd:NF033609 758 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 795
PTZ00482 PTZ00482
membrane-attack complex/perforin (MACPF) Superfamily; Provisional
9-176 6.76e-03

membrane-attack complex/perforin (MACPF) Superfamily; Provisional


Pssm-ID: 240433 [Multi-domain]  Cd Length: 844  Bit Score: 40.23  E-value: 6.76e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   9 DQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREdglpkghhvtDSENDEPLNlNASDSESEELH 88
Cdd:PTZ00482  88 DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN----------DQTNDFDQD-DSSNSQTDQGL 156
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  89 RQKDSDSESEERAEPPASDSENE-DVNQHGSDSESEETRKLPGSDSENEELLNghaSDSENEDVGkhpASDSEIEELQKS 167
Cdd:PTZ00482 157 KQSVNLSSAEKLIEEKKGQTENTfKFYNFGNDGEEAAAKDGGKSKSSDPGPLN---DSDGQGDDG---DPESAEEDKAAS 230

                 ....*....
gi 217330641 168 PASDSETED 176
Cdd:PTZ00482 231 NTRAAYTKA 239
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
163-377 8.88e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 39.64  E-value: 8.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  163 ELQKSPASDSETED--ALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElpkpqVSDSESEEPPRHQASDSENE 240
Cdd:PTZ00108 1168 KLRKPKLKKKEKKKkkSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSG-----SDQEDDEEQKTKPKKSSVKR 1242
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  241 ELPKPRISDSESEDPPRHQASDSENEELPKPRISDSesedPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSETEDA 320
Cdd:PTZ00108 1243 LKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRV----SAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSL 1318
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 217330641  321 SRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDS 377
Cdd:PTZ00108 1319 AALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDD 1375
PRK08581 PRK08581
amidase domain-containing protein;
2-226 8.90e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 39.39  E-value: 8.90e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641   2 DSEYYSGDQSDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKghhvtDSENDEPLNLNASD 81
Cdd:PRK08581 104 INQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQ-----SSKQDKADNQKAPS 178
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217330641  82 SESeelhrQKDSDSESEERAEPPASDSENedvnqhGSDSESEETRKLPGSDSENEEllnghASDSENEDVGKHPASDSEI 161
Cdd:PRK08581 179 SNN-----TKPSTSNKQPNSPKPTQPNQS------NSQPASDDTANQKSSSKDNQS-----MSDSALDSILDQYSEDAKK 242
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 217330641 162 EE---LQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELpkPQVSDSES 226
Cdd:PRK08581 243 TQkdyASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETG--PSLSNNDD 308
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH