NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462575015|ref|XP_054198865|]
View 

protein IWS1 homolog isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TFIIS_I super family cl00146
N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a ...
533-782 1.05e-28

N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme


The actual alignment was detected with superfamily member COG5139:

Pssm-ID: 469629  Cd Length: 397  Bit Score: 119.42  E-value: 1.05e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 533 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 612
Cdd:COG5139   126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 613 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 692
Cdd:COG5139   206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 693 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMNSTGGQTpRRDLEKVLTGEEKALRPGDPGFCARARV---PMPSNKDYVV 769
Cdd:COG5139   285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMDSAKNRK-KKSSGEDPTSRGSSVQTLYEQAAARRNRaaaPAQTTTDYKY 362
                         250
                  ....*....|...
gi 2462575015 770 RPkwnVEMESSRP 782
Cdd:COG5139   363 AP---VSNLSAVP 372
MSCRAMM_ClfA super family cl41352
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
11-371 2.06e-10

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


The actual alignment was detected with superfamily member NF033609:

Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 64.55  E-value: 2.06e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  11 QDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSverhseNETSDREdglpkghhvTDSENDEPlnlNASDSESE 90
Cdd:NF033609  546 EQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGS------DSTSDSG---------SDSASDSD---SASDSDSA 607
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  91 ElhrqkDSDSESEERAeppASDSENEDVNQHGSDSESEETrklpgSDSENEELLNGHASDSENEDVGKHPASDSEIEELQ 170
Cdd:NF033609  608 S-----DSDSASDSDS---ASDSDSASDSDSASDSDSASD-----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 674
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 171 KSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKP 250
Cdd:NF033609  675 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSD 747
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 251 RISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKgpaSDSETEDASRHKQ 330
Cdd:NF033609  748 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 824
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 2462575015 331 KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSE 371
Cdd:NF033609  825 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSE 865
2A1904 super family cl36772
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
245-520 5.19e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


The actual alignment was detected with superfamily member TIGR00927:

Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 5.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  245 EELPKPRISDSESEDPPRHQASDSENE-ELPKPRISDSESEdPPRNQASDSENE---ELPKPRVSDSESEGPQKGPASDS 320
Cdd:TIGR00927  628 GDLSKGDVAEAEHTGERTGEEGERPTEaEGENGEESGGEAE-QEGETETKGENEsegEIPAERKGEQEGEGEIEAKEADH 706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  321 ETEDASRHKQ-----KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDekegeeekva 395
Cdd:TIGR00927  707 KGETEAEEVEhegetEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE---------- 776
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  396 krkaavlSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKELSDKKNEEKDLFGSDSESGNEEE 475
Cdd:TIGR00927  777 -------DEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462575015  476 NLIADifGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDD 520
Cdd:TIGR00927  850 KGVDG--GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
 
Name Accession Description Interval E-value
COG5139 COG5139
Uncharacterized conserved protein [Function unknown];
533-782 1.05e-28

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 227468  Cd Length: 397  Bit Score: 119.42  E-value: 1.05e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 533 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 612
Cdd:COG5139   126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 613 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 692
Cdd:COG5139   206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 693 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMNSTGGQTpRRDLEKVLTGEEKALRPGDPGFCARARV---PMPSNKDYVV 769
Cdd:COG5139   285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMDSAKNRK-KKSSGEDPTSRGSSVQTLYEQAAARRNRaaaPAQTTTDYKY 362
                         250
                  ....*....|...
gi 2462575015 770 RPkwnVEMESSRP 782
Cdd:COG5139   363 AP---VSNLSAVP 372
Med26 pfam08711
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is ...
641-694 4.37e-12

TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species {1-2]. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator exists in two major forms in human cells: a smaller form that interacts strongly with pol II and activates transcription, and a large form that does not interact strongly with pol II and does not directly activate transcription. Notably, the 'small' and 'large' Mediator complexes differ in their subunit composition: the Med26 subunit preferentially associates with the small, active complex, whereas cdk8, cyclin C, Med12 and Med13 associate with the large Mediator complex. This family includesthe C terminal region of a number of eukaryotic hypothetical proteins which are homologous to the Saccharomyces cerevisiae protein IWS1. IWS1 is known to be an Pol II transcription elongation factor and interacts with Spt6 and Spt5.


Pssm-ID: 462573 [Multi-domain]  Cd Length: 52  Bit Score: 61.38  E-value: 4.37e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 641 ELLKILQELPsVSQETLKHSGIGRAVMYLYKHPkESRSNKDMAGKLINEWSRPI 694
Cdd:pfam08711   1 KLLKKLEKLP-VTLELLKSTGIGKVVNKLRKHK-ENPEIKKLAKELVKKWKRLV 52
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
11-371 2.06e-10

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 64.55  E-value: 2.06e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  11 QDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSverhseNETSDREdglpkghhvTDSENDEPlnlNASDSESE 90
Cdd:NF033609  546 EQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGS------DSTSDSG---------SDSASDSD---SASDSDSA 607
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  91 ElhrqkDSDSESEERAeppASDSENEDVNQHGSDSESEETrklpgSDSENEELLNGHASDSENEDVGKHPASDSEIEELQ 170
Cdd:NF033609  608 S-----DSDSASDSDS---ASDSDSASDSDSASDSDSASD-----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 674
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 171 KSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKP 250
Cdd:NF033609  675 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSD 747
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 251 RISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKgpaSDSETEDASRHKQ 330
Cdd:NF033609  748 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 824
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 2462575015 331 KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSE 371
Cdd:NF033609  825 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSE 865
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-300 5.53e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 53.37  E-value: 5.53e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   9 SDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDRE-DGLPKGHHVTDSENDEPLNLNASDS 87
Cdd:NF033609  608 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSD 687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  88 ESEELHRQKDSDSESEERAEpPASDSENEDVNQHGSDSESE-ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI 166
Cdd:NF033609  688 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 167 EELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEE 246
Cdd:NF033609  767 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSD 845
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 247 LPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSEnEELP 300
Cdd:NF033609  846 SDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK-EPLP 898
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
206-533 1.08e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 1.08e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 206 EEPPKPRMSDSESEELPKPQVSDSEseePPRHQASDSENEELPKPRISDSESeDPPRHQASDSENEElPKPRISDSESeD 285
Cdd:NF033609  540 DKPVVPEQPDEPGEIEPIPEDSDSD---PGSDSGSDSSNSDSGSDSGSDSTS-DSGSDSASDSDSAS-DSDSASDSDS-A 613
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 286 PPRNQASDSENEElPKPRVSDSESEGPQKGpASDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKF 365
Cdd:NF033609  614 SDSDSASDSDSAS-DSDSASDSDSASDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 366 HSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDS 445
Cdd:NF033609  692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 771
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 446 EEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDDNIKRG 525
Cdd:NF033609  772 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 851

                  ....*...
gi 2462575015 526 KHMDFLSD 533
Cdd:NF033609  852 SDSDSESD 859
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
128-476 1.69e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 51.83  E-value: 1.69e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 128 EETRKLPGSDSeneellnghASDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEpprHQASDSENEE 207
Cdd:NF033609  559 EDSDSDPGSDS---------GSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDS---DSASDSDSAS 626
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 208 PpkprmSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESE-DP 286
Cdd:NF033609  627 D-----SDSASD-------SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDS 694
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 287 PRNQASDSENEelpkprvSDSESEgpqkgpaSDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFH 366
Cdd:NF033609  695 DSDSDSDSDSD-------SDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 760
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 367 SSDSEEEEHKKQKMDSDEDEKEGeeekvakrkaavlSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSE 446
Cdd:NF033609  761 DSDSDSDSDSDSDSDSDSDSDSD-------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 827
                         330       340       350
                  ....*....|....*....|....*....|.
gi 2462575015 447 EEAGKEL-SDKKNEEKDLFGSDSESGNEEEN 476
Cdd:NF033609  828 SDSDSDSdSDSDSDSDSDSDSDSDSDSDSES 858
PRK08581 PRK08581
amidase domain-containing protein;
84-357 3.60e-06

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 50.56  E-value: 3.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  84 ASDSESEELHRQKDSDSESEEraeppasDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSEN--EDVGKHPA 161
Cdd:PRK08581   28 DDPQKDSTAKTTSHDSKKSND-------DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNiiDFIYKNLP 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 162 SDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpQVSDSESEEPPRHQASD 241
Cdd:PRK08581  101 QTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIK-----NDTDTQSSKQDKADNQK 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 242 SENEELPKPRISDSESEDPPRHQASDSEneelpkpriSDSESEDPPRNQASDSENEEL---PKPRVSDSESEGPQKGPAS 318
Cdd:PRK08581  176 APSSNNTKPSTSNKQPNSPKPTQPNQSN---------SQPASDDTANQKSSSKDNQSMsdsALDSILDQYSEDAKKTQKD 246
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 2462575015 319 DSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSD 357
Cdd:PRK08581  247 YASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFEND 285
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-321 3.99e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 3.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   9 SDQDPPEEDDGGAtpvQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSE 88
Cdd:NF033609  602 SDSDSASDSDSAS---DSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  89 SEELHRQKDSDSESEERAEPPA-SDSENEDVNQHGSDSESEETRKlpgSDSENEELLNGHASDSENEDVGKHPASDSEIE 167
Cdd:NF033609  679 DSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 755
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 168 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEepprhQASDSENEEL 247
Cdd:NF033609  756 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSD-----SDSDSDSDSD 823
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 248 PKpriSDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSE 321
Cdd:NF033609  824 SD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK 894
Ebola_NP pfam05505
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These ...
47-298 2.93e-05

Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus.


Pssm-ID: 398905  Cd Length: 717  Bit Score: 47.81  E-value: 2.93e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  47 TGSVERHSENETSdredglpkGHHVTDSENDEPLNLNASDSESEelhrQKDSDSESEERAEPPASDSENEdvNQHGSDSE 126
Cdd:pfam05505 388 TEAITAASLPKTS--------GHYDDDDDIPFPGPINDDDNPGH----QDDDPTDSQDTTIPDVVVDPDD--GSYGEYQS 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 127 SEETrklpGSDSENEELLNGhaSDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPR--HQASDSE 204
Cdd:pfam05505 454 YSEN----GMNAPDDLVLLN--EDEDDLEDTKPVPNRSTKGGQQKNSQKGQHIEGRQTQSRPIQNVPGPHRtiHHASAPL 527
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 205 NEEPPKPRMSDSESEELPKPQvsdseSEEPPRHQASDSENEELPkPRISDSESED-------------PPRHQASDSENE 271
Cdd:pfam05505 528 TDNDRRNEPSGSTSPRMLTPI-----NEEADPLDDADDETSSLP-PLESDDEEQDrdgtsnrtptvapPAPVYRDHSEKK 601
                         250       260
                  ....*....|....*....|....*..
gi 2462575015 272 ELPKPRISDSESEDPPRNQASDSENEE 298
Cdd:pfam05505 602 ELPQDEQQDQDHTQEARNQDSDNTQSE 628
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
16-208 1.30e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 45.76  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   16 EDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQ 95
Cdd:TIGR00927  707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAG 786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   96 KDSDSESEERAEPP--ASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDvgkhpasDSEIEELQKSP 173
Cdd:TIGR00927  787 EDGEMKGDEGAEGKveHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQD-------EKGVDGGGGSD 859
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 2462575015  174 ASDSETEDALKPQISDSESEEPPRHQaSDSENEEP 208
Cdd:TIGR00927  860 GGDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEEP 893
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
245-520 5.19e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 5.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  245 EELPKPRISDSESEDPPRHQASDSENE-ELPKPRISDSESEdPPRNQASDSENE---ELPKPRVSDSESEGPQKGPASDS 320
Cdd:TIGR00927  628 GDLSKGDVAEAEHTGERTGEEGERPTEaEGENGEESGGEAE-QEGETETKGENEsegEIPAERKGEQEGEGEIEAKEADH 706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  321 ETEDASRHKQ-----KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDekegeeekva 395
Cdd:TIGR00927  707 KGETEAEEVEhegetEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE---------- 776
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  396 krkaavlSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKELSDKKNEEKDLFGSDSESGNEEE 475
Cdd:TIGR00927  777 -------DEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462575015  476 NLIADifGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDD 520
Cdd:TIGR00927  850 KGVDG--GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
258-535 1.26e-03

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 42.59  E-value: 1.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 258 EDPPRHQASDSENEELPKPRISDSeseDPPRNQASDSENEELPKPRVSDSESEgpqKGPASDSETEDASRHKQKPESDDD 337
Cdd:NF033609  540 DKPVVPEQPDEPGEIEPIPEDSDS---DPGSDSGSDSSNSDSGSDSGSDSTSD---SGSDSASDSDSASDSDSASDSDSA 613
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 338 SDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKegeeekvakrkaavlSDSEDEEKASAKKSR 417
Cdd:NF033609  614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD---------------SDSDSDSDSDSDSDS 678
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 418 VVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKElSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFN 497
Cdd:NF033609  679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 757
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 2462575015 498 QEDLEEEKGETQVKEAEDSDSDDNIKRGKHMDFLSDFE 535
Cdd:NF033609  758 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 795
 
Name Accession Description Interval E-value
COG5139 COG5139
Uncharacterized conserved protein [Function unknown];
533-782 1.05e-28

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 227468  Cd Length: 397  Bit Score: 119.42  E-value: 1.05e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 533 DFEMMLQRKKSMSGKRRRNRDGGTFISDADDVVSAMIVKMNEAAEEDRQLNNQKKPALKKLTLLPAVVMHLKKQDLKETF 612
Cdd:COG5139   126 ELGDTGDRQLKAPAASRARRKEDLLEQTVDEISLRLKKRMQDAAKKDNANNLEGRPATGKIKNLPEVSDVLMKKALQDTI 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 613 IDSGVMSAIKEWLSPLPDRSLPALKIREELLKILQELPsVSQETLKHSGIGRAVMYLYKHPKESRSNKDMAGKLINEWSR 692
Cdd:COG5139   206 LDNNILDSVRGWLEPLPDKSLPNIKIQKSLLDVLKTLP-IHTEHLVESGVGRIVYFYTISKKEEKEVRRSAKALVQEWTR 284
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 693 PIFGLTSNYKGmTREEREQRDLEQMPQRRRMNSTGGQTpRRDLEKVLTGEEKALRPGDPGFCARARV---PMPSNKDYVV 769
Cdd:COG5139   285 PIIKPSGNYRD-KRIMQLEFDSEKLRKKSVMDSAKNRK-KKSSGEDPTSRGSSVQTLYEQAAARRNRaaaPAQTTTDYKY 362
                         250
                  ....*....|...
gi 2462575015 770 RPkwnVEMESSRP 782
Cdd:COG5139   363 AP---VSNLSAVP 372
Med26 pfam08711
TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is ...
641-694 4.37e-12

TFIIS helical bundle-like domain; Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species {1-2]. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator exists in two major forms in human cells: a smaller form that interacts strongly with pol II and activates transcription, and a large form that does not interact strongly with pol II and does not directly activate transcription. Notably, the 'small' and 'large' Mediator complexes differ in their subunit composition: the Med26 subunit preferentially associates with the small, active complex, whereas cdk8, cyclin C, Med12 and Med13 associate with the large Mediator complex. This family includesthe C terminal region of a number of eukaryotic hypothetical proteins which are homologous to the Saccharomyces cerevisiae protein IWS1. IWS1 is known to be an Pol II transcription elongation factor and interacts with Spt6 and Spt5.


Pssm-ID: 462573 [Multi-domain]  Cd Length: 52  Bit Score: 61.38  E-value: 4.37e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 641 ELLKILQELPsVSQETLKHSGIGRAVMYLYKHPkESRSNKDMAGKLINEWSRPI 694
Cdd:pfam08711   1 KLLKKLEKLP-VTLELLKSTGIGKVVNKLRKHK-ENPEIKKLAKELVKKWKRLV 52
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
11-371 2.06e-10

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 64.55  E-value: 2.06e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  11 QDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSverhseNETSDREdglpkghhvTDSENDEPlnlNASDSESE 90
Cdd:NF033609  546 EQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGS------DSTSDSG---------SDSASDSD---SASDSDSA 607
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  91 ElhrqkDSDSESEERAeppASDSENEDVNQHGSDSESEETrklpgSDSENEELLNGHASDSENEDVGKHPASDSEIEELQ 170
Cdd:NF033609  608 S-----DSDSASDSDS---ASDSDSASDSDSASDSDSASD-----SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 674
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 171 KSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKP 250
Cdd:NF033609  675 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSD 747
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 251 RISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKgpaSDSETEDASRHKQ 330
Cdd:NF033609  748 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDS 824
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 2462575015 331 KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSE 371
Cdd:NF033609  825 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSE 865
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-300 5.53e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 53.37  E-value: 5.53e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   9 SDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDRE-DGLPKGHHVTDSENDEPLNLNASDS 87
Cdd:NF033609  608 SDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSD 687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  88 ESEELHRQKDSDSESEERAEpPASDSENEDVNQHGSDSESE-ETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI 166
Cdd:NF033609  688 SDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 167 EELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQvSDSESEEPPRHQASDSENEE 246
Cdd:NF033609  767 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSD 845
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 247 LPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSEnEELP 300
Cdd:NF033609  846 SDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK-EPLP 898
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
206-533 1.08e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 1.08e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 206 EEPPKPRMSDSESEELPKPQVSDSEseePPRHQASDSENEELPKPRISDSESeDPPRHQASDSENEElPKPRISDSESeD 285
Cdd:NF033609  540 DKPVVPEQPDEPGEIEPIPEDSDSD---PGSDSGSDSSNSDSGSDSGSDSTS-DSGSDSASDSDSAS-DSDSASDSDS-A 613
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 286 PPRNQASDSENEElPKPRVSDSESEGPQKGpASDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKF 365
Cdd:NF033609  614 SDSDSASDSDSAS-DSDSASDSDSASDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 691
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 366 HSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVLSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDS 445
Cdd:NF033609  692 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 771
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 446 EEEAGKELSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDDNIKRG 525
Cdd:NF033609  772 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 851

                  ....*...
gi 2462575015 526 KHMDFLSD 533
Cdd:NF033609  852 SDSDSESD 859
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
128-476 1.69e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 51.83  E-value: 1.69e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 128 EETRKLPGSDSeneellnghASDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEpprHQASDSENEE 207
Cdd:NF033609  559 EDSDSDPGSDS---------GSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDS---DSASDSDSAS 626
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 208 PpkprmSDSESEelpkpqvSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESE-DP 286
Cdd:NF033609  627 D-----SDSASD-------SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDS 694
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 287 PRNQASDSENEelpkprvSDSESEgpqkgpaSDSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFH 366
Cdd:NF033609  695 DSDSDSDSDSD-------SDSDSD-------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 760
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 367 SSDSEEEEHKKQKMDSDEDEKEGeeekvakrkaavlSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSE 446
Cdd:NF033609  761 DSDSDSDSDSDSDSDSDSDSDSD-------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 827
                         330       340       350
                  ....*....|....*....|....*....|.
gi 2462575015 447 EEAGKEL-SDKKNEEKDLFGSDSESGNEEEN 476
Cdd:NF033609  828 SDSDSDSdSDSDSDSDSDSDSDSDSDSDSES 858
PRK08581 PRK08581
amidase domain-containing protein;
84-357 3.60e-06

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 50.56  E-value: 3.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  84 ASDSESEELHRQKDSDSESEEraeppasDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSEN--EDVGKHPA 161
Cdd:PRK08581   28 DDPQKDSTAKTTSHDSKKSND-------DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNiiDFIYKNLP 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 162 SDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpQVSDSESEEPPRHQASD 241
Cdd:PRK08581  101 QTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIK-----NDTDTQSSKQDKADNQK 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 242 SENEELPKPRISDSESEDPPRHQASDSEneelpkpriSDSESEDPPRNQASDSENEEL---PKPRVSDSESEGPQKGPAS 318
Cdd:PRK08581  176 APSSNNTKPSTSNKQPNSPKPTQPNQSN---------SQPASDDTANQKSSSKDNQSMsdsALDSILDQYSEDAKKTQKD 246
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 2462575015 319 DSETEDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSD 357
Cdd:PRK08581  247 YASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFEND 285
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-321 3.99e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 3.99e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   9 SDQDPPEEDDGGAtpvQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSE 88
Cdd:NF033609  602 SDSDSASDSDSAS---DSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 678
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  89 SEELHRQKDSDSESEERAEPPA-SDSENEDVNQHGSDSESEETRKlpgSDSENEELLNGHASDSENEDVGKHPASDSEIE 167
Cdd:NF033609  679 DSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 755
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 168 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEelpkpqvSDSESEepprhQASDSENEEL 247
Cdd:NF033609  756 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-------SDSDSD-----SDSDSDSDSD 823
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 248 PKpriSDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSE 321
Cdd:NF033609  824 SD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSK 894
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
16-334 5.68e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 5.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   16 EDDGGATPVqderdSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNA--SDSESEELH 93
Cdd:PHA03307    28 PGDAADDLL-----SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSlsTLAPASPAR 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   94 RQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEI----EEL 169
Cdd:PHA03307   103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlsspEET 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSeneePPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElPK 249
Cdd:PHA03307   183 ARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS----ASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEC-PL 257
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  250 PRIS--------------DSESEDPPRHQASDSENEELPKP--------------RISDSESEDPPRNQASDSENEELPK 301
Cdd:PHA03307   258 PRPApitlptriweasgwNGPSSRPGPASSSSSPRERSPSPspsspgsgpapsspRASSSSSSSRESSSSSTSSSSESSR 337
                          330       340       350
                   ....*....|....*....|....*....|....
gi 2462575015  302 PR-VSDSESEGPQKGPASDSETEDASRHKQKPES 334
Cdd:PHA03307   338 GAaVSPGPSPSRSPSPSRPPPPADPSSPRKRPRP 371
PTZ00121 PTZ00121
MAEBL; Provisional
88-461 1.66e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.98  E-value: 1.66e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   88 ESEELHRQKDSDSESEERAEPPASDSENEDvnqhGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIE 167
Cdd:PTZ00121  1392 KADEAKKKAEEDKKKADELKKAAAAKKKAD----EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  168 ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSEN--- 244
Cdd:PTZ00121  1468 EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEkkk 1547
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  245 -EELPKPR-ISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSET 322
Cdd:PTZ00121  1548 aDELKKAEeLKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK 1627
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  323 EDASRHKQKPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKEGEEEKVAKRKAAVL 402
Cdd:PTZ00121  1628 AEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEEL 1707
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462575015  403 SDSEDEEKASAKKSRVVSDADDSDSDAVSDKS--GKREKTIASDSEEEAGKELSDKKNEEK 461
Cdd:PTZ00121  1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAeeDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
55-333 1.98e-05

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 48.53  E-value: 1.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  55 ENETSDREDGLPKGHHVTDSENDEPlnlnaSDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLP 134
Cdd:PTZ00449  500 EEEDSDKHDEPPEGPEASGLPPKAP-----GDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPT 574
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 135 GSDSENEELLNGHASDSENEDVGKHPASDS--------EIEELQKSPASDSETEDALKPQisdseSEEPPRHQASDSENE 206
Cdd:PTZ00449  575 LSKKPEFPKDPKHPKDPEEPKKPKRPRSAQrptrpkspKLPELLDIPKSPKRPESPKSPK-----RPPPPQRPSSPERPE 649
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 207 EPPKPRMSDS-ESEELP-----KPQVSDSESEEPPRHQASDSeNEELPKPRISDSESEDPPRHQASDSENEELPKPRISD 280
Cdd:PTZ00449  650 GPKIIKSPKPpKSPKPPfdpkfKEKFYDDYLDAAAKSKETKT-TVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRD 728
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462575015 281 SES-----EDPPRNQASDSENEELP---KPRVSDSESEGPQKG-PASDSETEDASRHKQKPE 333
Cdd:PTZ00449  729 EEFpfepiGDPDAEQPDDIEFFTPPeeeRTFFHETPADTPLPDiLAEEFKEEDIHAETGEPD 790
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
96-338 2.12e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.24  E-value: 2.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   96 KDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEellNGHASDSENEDvGKHPASDSEieelqkSPAS 175
Cdd:PHA03307    59 AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG---SPTPPGPSSPD-PPPPTPPPA------SPPP 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  176 DSETeDALKPQISDSESEEPPRHQASDSENEEPPKPR-------------MSDSESEELPKPQVSDSESEEPPRHQASDS 242
Cdd:PHA03307   129 SPAP-DLSEMLRPVGSPGPPPAASPPAAGASPAAVASdaassrqaalplsSPEETARAPSSPPAEPPPSTPPAAASPRPP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  243 EneelPKPRISDSESED---PPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDS----ESEGPQKG 315
Cdd:PHA03307   208 R----RSSPISASASSPapaPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEasgwNGPSSRPG 283
                          250       260
                   ....*....|....*....|...
gi 2462575015  316 PASDSETEDASRHKQKPESDDDS 338
Cdd:PHA03307   284 PASSSSSPRERSPSPSPSSPGSG 306
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
82-341 2.34e-05

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 48.12  E-value: 2.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   82 LNASDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPA 161
Cdd:PTZ00108  1134 LDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  162 SDSEIEELQKSPASDSETEDALKpqiSDSESEEPPRHQASDSENEEPPKPRMSDSESEELPK--PQVSDSESEEPPrhqa 239
Cdd:PTZ00108  1214 DNKKSNSSGSDQEDDEEQKTKPK---KSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKnaPKRVSAVQYSPP---- 1286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  240 sdSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASD 319
Cdd:PTZ00108  1287 --PPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSS 1364
                          250       260
                   ....*....|....*....|..
gi 2462575015  320 SETEDASRHKQKPESDDDSDRE 341
Cdd:PTZ00108  1365 SEDDDDSEVDDSEDEDDEDDED 1386
Ebola_NP pfam05505
Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These ...
47-298 2.93e-05

Ebola nucleoprotein; This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus.


Pssm-ID: 398905  Cd Length: 717  Bit Score: 47.81  E-value: 2.93e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  47 TGSVERHSENETSdredglpkGHHVTDSENDEPLNLNASDSESEelhrQKDSDSESEERAEPPASDSENEdvNQHGSDSE 126
Cdd:pfam05505 388 TEAITAASLPKTS--------GHYDDDDDIPFPGPINDDDNPGH----QDDDPTDSQDTTIPDVVVDPDD--GSYGEYQS 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 127 SEETrklpGSDSENEELLNGhaSDSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPR--HQASDSE 204
Cdd:pfam05505 454 YSEN----GMNAPDDLVLLN--EDEDDLEDTKPVPNRSTKGGQQKNSQKGQHIEGRQTQSRPIQNVPGPHRtiHHASAPL 527
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 205 NEEPPKPRMSDSESEELPKPQvsdseSEEPPRHQASDSENEELPkPRISDSESED-------------PPRHQASDSENE 271
Cdd:pfam05505 528 TDNDRRNEPSGSTSPRMLTPI-----NEEADPLDDADDETSSLP-PLESDDEEQDrdgtsnrtptvapPAPVYRDHSEKK 601
                         250       260
                  ....*....|....*....|....*..
gi 2462575015 272 ELPKPRISDSESEDPPRNQASDSENEE 298
Cdd:pfam05505 602 ELPQDEQQDQDHTQEARNQDSDNTQSE 628
PHA03321 PHA03321
tegument protein VP11/12; Provisional
192-348 4.89e-05

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 46.87  E-value: 4.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 192 SEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRhqaSDSENE 271
Cdd:PHA03321  427 SRQPPGAPAPRRDNDPPPPPRARPGSTPACARRARAQRARDAGPEYVDPLGALRRLPAGAAPPPEPAAAPS---PATYYT 503
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 272 EL--PKPRIsdsesedPPRNQASDSENEELPKPRVSDSESEGP-------QKGPASDSETEDASRHKQK-PESDDDSDRE 341
Cdd:PHA03321  504 RMggGPPRL-------PPRNRATETLRPDWGPPAAAPPEQMEDpylepddDRFDRRDGAAAAATSHPREaPAPDDDPIYE 576

                  ....*..
gi 2462575015 342 NKGEDTE 348
Cdd:PHA03321  577 GVSDSEE 583
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
16-208 1.30e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 45.76  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   16 EDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQ 95
Cdd:TIGR00927  707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAG 786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   96 KDSDSESEERAEPP--ASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDvgkhpasDSEIEELQKSP 173
Cdd:TIGR00927  787 EDGEMKGDEGAEGKveHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQD-------EKGVDGGGGSD 859
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 2462575015  174 ASDSETEDALKPQISDSESEEPPRHQaSDSENEEP 208
Cdd:TIGR00927  860 GGDSEEEEEEEEEEEEEEEEEEEEEE-EEEENEEP 893
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
120-379 3.06e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 44.60  E-value: 3.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  120 QHGSDSESEETRKLPGSDSENEELLNGHAS-DSENEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRH 198
Cdd:TIGR00927  639 EHTGERTGEEGERPTEAEGENGEESGGEAEqEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  199 QASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQAsdSENEELpkpri 278
Cdd:TIGR00927  719 GETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQA--GEDGEM----- 791
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  279 sdsESEDPPRNQASDSENEELPKPRVSDSESEGPQKGPASDSETEDAsrhKQKPESDDDSDRENKGEDTEMQNDSFHSDS 358
Cdd:TIGR00927  792 ---KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQ---ELNAENQGEAKQDEKGVDGGGGSDGGDSEE 865
                          250       260
                   ....*....|....*....|.
gi 2462575015  359 HMDRKKFHSSDSEEEEHKKQK 379
Cdd:TIGR00927  866 EEEEEEEEEEEEEEEEEEEEE 886
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
139-346 3.09e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 44.31  E-value: 3.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 139 ENEELLNGHASDSENEDVGKHPASDSEIEELQKSPASDSEtedALKPqiSDSESEEPPRHQAsdsENEEPPKPRMSDSES 218
Cdd:PRK08691  373 ENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASA---AAMP--SEGKTAGPVSNQE---NNDVPPWEDAPDEAQ 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 219 EELPKPQVSD------SESEEPPRHQ-----ASDSENE----ELPKPR-ISDSESEDPPRHQASDSENEELPKPRISDSE 282
Cdd:PRK08691  445 TAAGTAQTSAksiqtaSEAETPPENQvsknkAADNETDaplsEVPSENpIQATPNDEAVETETFAHEAPAEPFYGYGFPD 524
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462575015 283 SEDPPRnqasdsENEELPKPrvsDSESEGPQKGPASDSETEDASRHKQKPESDDDSDRENKGED 346
Cdd:PRK08691  525 NDCPPE------DGAEIPPP---DWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFSTEN 579
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
16-272 4.49e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 4.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   16 EDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDSESEELHRQ 95
Cdd:TIGR00927  639 EHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   96 KDSDSESEERAEPPASDSENEDVNQHG-SDSESEETRKLPGSDSENEELLNGHASDSENEDVGK-HPASDSEIEELQKSP 173
Cdd:TIGR00927  719 GETEAEGTEDEGEIETGEEGEEVEDEGeGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEiQAGEDGEMKGDEGAE 798
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  174 ASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEpprhQASDSENEELPKPRIS 253
Cdd:TIGR00927  799 GKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGS----DGGDSEEEEEEEEEEE 874
                          250
                   ....*....|....*....
gi 2462575015  254 DSESEDPPRHQaSDSENEE 272
Cdd:TIGR00927  875 EEEEEEEEEEE-EEEENEE 892
ECM1 pfam05782
Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic ...
208-317 5.15e-04

Extracellular matrix protein 1 (ECM1); This family consists of several eukaryotic extracellular matrix protein 1 (ECM1) sequences. ECM1 has been shown to regulate endochondral bone formation, stimulate the proliferation of endothelial cells and induce angiogenesis. Mutations in the ECM1 gene can cause lipoid proteinosis, a disorder which causes generalized thickening of skin, mucosae and certain viscera. Classical features include beaded eyelid papules and laryngeal infiltration leading to hoarseness.


Pssm-ID: 461739  Cd Length: 518  Bit Score: 43.68  E-value: 5.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 208 PPKPR---MSDSESEELPKPQVSDSESEEPPRHQASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESE 284
Cdd:pfam05782   9 PPQTRglpVDHPDTSQHDPPFEGQSEVQPPPSQEAIPVQEEELPPPQLPVEKKVDPPLPQEAIPLQEELPPPQLPIEQKE 88
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 2462575015 285 -DPPRNQASD----SENEELPKPRVSDSESEGPQKGPA 317
Cdd:pfam05782  89 iDPPFPQQEEitpsKQREEKPAPLVGQGHPEPESWNPA 126
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
245-520 5.19e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.83  E-value: 5.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  245 EELPKPRISDSESEDPPRHQASDSENE-ELPKPRISDSESEdPPRNQASDSENE---ELPKPRVSDSESEGPQKGPASDS 320
Cdd:TIGR00927  628 GDLSKGDVAEAEHTGERTGEEGERPTEaEGENGEESGGEAE-QEGETETKGENEsegEIPAERKGEQEGEGEIEAKEADH 706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  321 ETEDASRHKQ-----KPESDDDSDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDekegeeekva 395
Cdd:TIGR00927  707 KGETEAEEVEhegetEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKE---------- 776
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  396 krkaavlSDSEDEEKASAKKSRVVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKELSDKKNEEKDLFGSDSESGNEEE 475
Cdd:TIGR00927  777 -------DEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 2462575015  476 NLIADifGESGDEEEEEFTGFNQEDLEEEKGETQVKEAEDSDSDD 520
Cdd:TIGR00927  850 KGVDG--GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
PRK08581 PRK08581
amidase domain-containing protein;
13-231 6.04e-04

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 43.24  E-value: 6.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  13 PPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKghhvtDSENDEPLNLNASDSESeel 92
Cdd:PRK08581  110 KNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQ-----SSKQDKADNQKAPSSNN--- 181
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  93 hrQKDSDSESEERAEPPASDSENedvnqhGSDSESEETRKLPGSDSENEEllnghASDSENEDVGKHPASDSEIEE---L 169
Cdd:PRK08581  182 --TKPSTSNKQPNSPKPTQPNQS------NSQPASDDTANQKSSSKDNQS-----MSDSALDSILDQYSEDAKKTQkdyA 248
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462575015 170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELpkPQVSDSES 231
Cdd:PRK08581  249 SQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETG--PSLSNNDD 308
PHA03169 PHA03169
hypothetical protein; Provisional
153-339 8.31e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.65  E-value: 8.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 153 NEDVGKHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESE 232
Cdd:PHA03169   49 PAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSP 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 233 EpprhqaSDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDSESEGP 312
Cdd:PHA03169  129 E------SPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSP 202
                         170       180
                  ....*....|....*....|....*..
gi 2462575015 313 QKGPASDSETEDASRHKQKPESDDDSD 339
Cdd:PHA03169  203 PPQSPPDEPGEPQSPTPQQAPSPNTQQ 229
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
103-268 8.79e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 42.66  E-value: 8.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 103 EERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASDSEIEElqKSPASDSETE-D 181
Cdd:PRK13108  293 DEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGE--STPAVEETSEaD 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 182 ALKPQISDSESEEPPRHQASDS-ENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASDSENEElpkPRISDSESEDP 260
Cdd:PRK13108  371 IEREQPGDLAGQAPAAHQVDAEaASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAG---PGDDPAEPDGI 447

                  ....*...
gi 2462575015 261 PRHQASDS 268
Cdd:PRK13108  448 RRQDDFSS 455
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
258-535 1.26e-03

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 42.59  E-value: 1.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 258 EDPPRHQASDSENEELPKPRISDSeseDPPRNQASDSENEELPKPRVSDSESEgpqKGPASDSETEDASRHKQKPESDDD 337
Cdd:NF033609  540 DKPVVPEQPDEPGEIEPIPEDSDS---DPGSDSGSDSSNSDSGSDSGSDSTSD---SGSDSASDSDSASDSDSASDSDSA 613
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 338 SDRENKGEDTEMQNDSFHSDSHMDRKKFHSSDSEEEEHKKQKMDSDEDEKegeeekvakrkaavlSDSEDEEKASAKKSR 417
Cdd:NF033609  614 SDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSD---------------SDSDSDSDSDSDSDS 678
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 418 VVSDADDSDSDAVSDKSGKREKTIASDSEEEAGKElSDKKNEEKDLFGSDSESGNEEENLIADIFGESGDEEEEEFTGFN 497
Cdd:NF033609  679 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 757
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 2462575015 498 QEDLEEEKGETQVKEAEDSDSDDNIKRGKHMDFLSDFE 535
Cdd:NF033609  758 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 795
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-325 1.50e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015    3 TLLPRGSDQDPPEEDDGGATPVQDER--DSGSDGEDDVNEqHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPL 80
Cdd:PHA03307    94 TLAPASPAREGSPTPPGPSSPDPPPPtpPPASPPPSPAPD-LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   81 NLNASDSESEElhRQKDSDSE----SEERAEPPASDSENEDVNQHGSDS------ESEETRKLPGSDSENEELLNGHASD 150
Cdd:PHA03307   173 ALPLSSPEETA--RAPSSPPAepppSTPPAAASPRPPRRSSPISASASSpapapgRSAADDAGASSSDSSSSESSGCGWG 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  151 SENEDVGKHPASDSEIEELQKSPASDSETEDAL--KPQISDSESEEPPRHQASDSEnEEPPKPRMSDSESEELPKPQVSD 228
Cdd:PHA03307   251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGpaSSSSSPRERSPSPSPSSPGSG-PAPSSPRASSSSSSSRESSSSST 329
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  229 SESEEPPRHQASDS--ENEELPKPRiSDSESEDPPRHQASDSENEELPKPRISDSESEdpPRNQASDSENEELPKPRVSD 306
Cdd:PHA03307   330 SSSSESSRGAAVSPgpSPSRSPSPS-RPPPPADPSSPRKRPRPSRAPSSPAASAGRPT--RRRARAAVAGRARRRDATGR 406
                          330
                   ....*....|....*....
gi 2462575015  307 SESEGPQKGPASDSETEDA 325
Cdd:PHA03307   407 FPAGRPRPSPLDAGAASGA 425
PHA03247 PHA03247
large tegument protein UL36; Provisional
170-350 1.50e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  170 QKSPASDSETEDALKPQISDSESEEPPRHQASDSE---NEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQA--SDSEN 244
Cdd:PHA03247  2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFalpPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPprPQPPL 2943
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  245 EELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESEDP-PRNQASDSENEELPKPRVSDSES-----EGPQKGPAS 318
Cdd:PHA03247  2944 APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSrEAPASSTPPLTGHSLSRVSSWASslalhEETDPPPVS 3023
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2462575015  319 DSETEDASRHKQkpESDDDSDRENKGEDTEMQ 350
Cdd:PHA03247  3024 LKQTLWPPDDTE--DSDADSLFDSDSERSDLE 3053
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
6-246 1.73e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 41.90  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015    6 PRGSDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDrEDGLPKGHHVTDSENDEPLNLNAS 85
Cdd:TIGR00927  669 QEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEA-EGTEDEGEIETGEEGEEVEDEGEG 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   86 DSESEELHRQKDSDSESEERAEPPASDSENEDVN--QHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVGKHPASD 163
Cdd:TIGR00927  748 EAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGeiQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTE 827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  164 SEIEELQKSPASDSETEDALKPQISDSESEEpprhQASDSENEEPPKPRMSDSESEElpkpqvsdsESEEpprhqASDSE 243
Cdd:TIGR00927  828 VKDETGEQELNAENQGEAKQDEKGVDGGGGS----DGGDSEEEEEEEEEEEEEEEEE---------EEEE-----EEEEE 889

                   ...
gi 2462575015  244 NEE 246
Cdd:TIGR00927  890 NEE 892
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
78-293 2.04e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.96  E-value: 2.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   78 EPLNLNASDSESEELHRQKDSDSESEERAEPPASDSENEDVNQHGSDSESEETRKLPGSDSENEELLNGHASDSENEDVG 157
Cdd:PTZ00108  1179 KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDND 1258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  158 KHPASDSEIEELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRM----SDSESEELPKPQVSDSESEE 233
Cdd:PTZ00108  1259 EFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLegslAALKKKKKSEKKTARKKKSK 1338
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  234 PPRHQASDSENEElPKPRISDSESEDpprhqASDSENEELPkpriSDSESEDPPRNQASD 293
Cdd:PTZ00108  1339 TRVKQASASQSSR-LLRRPRKKKSDS-----SSEDDDDSEV----DDSEDEDDEDDEDDD 1388
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
79-293 2.97e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 41.23  E-value: 2.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  79 PLNLNASDS----ESEELHRQKDSDSESEERAEPP----ASDSENEDVNQHGSDSESEETRKL-PGSDSENEELLNGHAS 149
Cdd:PRK08691  360 PLAAASCDAnaviENTELQSPSAQTAEKETAAKKPqprpEAETAQTPVQTASAAAMPSEGKTAgPVSNQENNDVPPWEDA 439
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 150 DSENEDV-GKHPASDSEIE---ELQKSPASDSETEDALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQ 225
Cdd:PRK08691  440 PDEAQTAaGTAQTSAKSIQtasEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDEAVETETFAHEAPAEPFYG 519
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462575015 226 VSDSESEEPPRhqasdsENEELPKPrisDSESEDPPRHQASDSENEELPKpRISDSESEDPPRNQASD 293
Cdd:PRK08691  520 YGFPDNDCPPE------DGAEIPPP---DWEHAAPADTAGGGADEEAEAG-GIGGNNTPSAPPPEFST 577
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
193-348 3.04e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 41.12  E-value: 3.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 193 EEPPRHQASDSENEEPPKPrmsdsESEELPKPQVSDSESEEPPRHQASDSENE---ELPKPRISDSESEDPPRHQASDSE 269
Cdd:PRK13108  280 EAPGALRGSEYVVDEALER-----EPAELAAAAVASAASAVGPVGPGEPNQPDdvaEAVKAEVAEVTDEVAAESVVQVAD 354
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 270 NEELPKPRISDSESEDPPRNQASDSENEELPKPRVSDS-ESEGPQKGPASDSETEDASR----HKQKPESDDDSDRENKG 344
Cdd:PRK13108  355 RDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEaASAAPEEPAALASEAHDETEpevpEKAAPIPDPAKPDELAV 434

                  ....
gi 2462575015 345 EDTE 348
Cdd:PRK13108  435 AGPG 438
DMP1 pfam07263
Dentin matrix protein 1 (DMP1); This family consists of several mammalian dentin matrix ...
8-273 4.73e-03

Dentin matrix protein 1 (DMP1); This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21. DMP1 is a bone and teeth specific protein initially identified from mineralized dentin. DMP1 is primarily localized in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca(2+) surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralized matrix formation extracellularly, at later stages of osteoblast maturation. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown.


Pssm-ID: 462128 [Multi-domain]  Cd Length: 519  Bit Score: 40.30  E-value: 4.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015   8 GSDQDPPEEDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREDGLPKGHHVTDSENDEPLNLNASDS 87
Cdd:pfam07263 248 ASTQDSGDSQSVEYPSRKFFRKSRISEEDDRGELDDSNTMEEVKSDSTESTSSKEAGLSQSREDSKSESQEDSEESQSQE 327
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  88 ESEELhrqKDSDSESEERAEPPASDSENEdvNQHGSDSESEETRKLPGSDSENEEllngHASD-SENEDVGKHPASDSEI 166
Cdd:pfam07263 328 DSQNS---QDPSSESSQEADLPSQESSSE--SQEEVVSESRGDNPDNTSSSEEDQ----EDSDsSEEDSLSTFSSSESES 398
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 167 EELQkspaSDSETEDALKpqiSDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQASdSENEE 246
Cdd:pfam07263 399 REEQ----ADSESNESLR---SSEESPESSEDENSSSQEGLQSHSASTESQSEESQSEQDSQSEEDDESDSQDS-SRSKE 470
                         250       260
                  ....*....|....*....|....*..
gi 2462575015 247 LPKPRISDSESEDPPRHQASDSENEEL 273
Cdd:pfam07263 471 DSNSTESTSSSEEDGQSKNMEIESRKL 497
PTZ00482 PTZ00482
membrane-attack complex/perforin (MACPF) Superfamily; Provisional
10-181 5.05e-03

membrane-attack complex/perforin (MACPF) Superfamily; Provisional


Pssm-ID: 240433 [Multi-domain]  Cd Length: 844  Bit Score: 40.62  E-value: 5.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  10 DQDPpeeDDGGATPVQDERDSGSDGEDDVNEQHSGSDTGSVERHSENETSDREdglpkghhvtDSENDEPLNlNASDSES 89
Cdd:PTZ00482   87 DDDD---DDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANN----------DQTNDFDQD-DSSNSQT 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  90 EELHRQKDSDSESEERAEPPASDSENE-DVNQHGSDSESEETRKLPGSDSENEELLNghaSDSENEDVGkhpASDSEIEE 168
Cdd:PTZ00482  153 DQGLKQSVNLSSAEKLIEEKKGQTENTfKFYNFGNDGEEAAAKDGGKSKSSDPGPLN---DSDGQGDDG---DPESAEED 226
                         170
                  ....*....|...
gi 2462575015 169 LQKSPASDSETED 181
Cdd:PTZ00482  227 KAASNTRAAYTKA 239
PRK08581 PRK08581
amidase domain-containing protein;
159-416 6.69e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 40.16  E-value: 6.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 159 HPASDSEIEELQKSPASDSETEDalkpqisDSESEEPPRHQASDSENEEPPKPRMSDSESEELPKPQVSDSESEEPPRHQ 238
Cdd:PRK08581   26 YADDPQKDSTAKTTSHDSKKSND-------DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKN 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 239 ASDSENEELPKPRISDSESEDPPRHQASDSENEELPKPRISDSESE-DPPRNQASDSENEELPKPRVSDSESEGPQKGPA 317
Cdd:PRK08581   99 LPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKsTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPS 178
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015 318 SDSETEDASRHKQKPESDDDSdreNKGEDTEMQNDSFHSDSHMDRKKFHSS-------DSEEEEHKKQKMDSDEDEKEGE 390
Cdd:PRK08581  179 SNNTKPSTSNKQPNSPKPTQP---NQSNSQPASDDTANQKSSSKDNQSMSDsaldsilDQYSEDAKKTQKDYASQSKKDK 255
                         250       260
                  ....*....|....*....|....*.
gi 2462575015 391 EEKVAKRKAAVLSDSEDEEKASAKKS 416
Cdd:PRK08581  256 TETSNTKNPQLPTQDELKHKSKPAQS 281
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
168-382 7.04e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 40.03  E-value: 7.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  168 ELQKSPASDSETED--ALKPQISDSESEEPPRHQASDSENEEPPKPRMSDSESEElpkpqVSDSESEEPPRHQASDSENE 245
Cdd:PTZ00108  1168 KLRKPKLKKKEKKKkkSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSG-----SDQEDDEEQKTKPKKSSVKR 1242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462575015  246 ELPKPRISDSESEDPPRHQASDSENEELPK---PRISDSESEDPPRNQASDSENEelPKPRVSDSESEGPQKGPASDSET 322
Cdd:PTZ00108  1243 LKSKKNNSSKSSEDNDEFSSDDLSKEGKPKnapKRVSAVQYSPPPPSKRPDGESN--GGSKPSSPTKKKVKKRLEGSLAA 1320
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462575015  323 EDASRHKQKPESDDDS--DRENKGEDTEMQNDSFhsdshmdRKKFHSSDSEEEEHKKQKMDS 382
Cdd:PTZ00108  1321 LKKKKKSEKKTARKKKskTRVKQASASQSSRLLR-------RPRKKKSDSSSEDDDDSEVDD 1375
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH