NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907157157|ref|XP_036020290|]
View 

arginine-glutamic acid dipeptide repeats protein isoform X14 [Mus musculus]

Protein Classification

GATA-type transcription factor( domain architecture ID 12975021)

GATA-type transcription factor binds to the DNA consensus sequence [AT]GATA[AG] and may function as a transcription factor/regulator

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
173-1173 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1090.96  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  173 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 252
Cdd:pfam03154    1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  253 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 332
Cdd:pfam03154   79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  333 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 412
Cdd:pfam03154  159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  413 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 489
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  490 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 568
Cdd:pfam03154  317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  569 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 647
Cdd:pfam03154  397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  648 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 727
Cdd:pfam03154  477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  728 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 807
Cdd:pfam03154  555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  808 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 887
Cdd:pfam03154  634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  888 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 967
Cdd:pfam03154  714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  968 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1047
Cdd:pfam03154  794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 1048 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1127
Cdd:pfam03154  874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1907157157 1128 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1173
Cdd:pfam03154  946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
7-45 3.91e-18

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


:

Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 78.81  E-value: 3.91e-18
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907157157    7 KRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 45
Cdd:cd11661      8 KLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
108-157 2.03e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


:

Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.03e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1907157157   108 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 157
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
173-1173 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1090.96  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  173 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 252
Cdd:pfam03154    1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  253 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 332
Cdd:pfam03154   79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  333 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 412
Cdd:pfam03154  159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  413 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 489
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  490 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 568
Cdd:pfam03154  317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  569 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 647
Cdd:pfam03154  397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  648 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 727
Cdd:pfam03154  477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  728 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 807
Cdd:pfam03154  555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  808 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 887
Cdd:pfam03154  634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  888 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 967
Cdd:pfam03154  714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  968 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1047
Cdd:pfam03154  794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 1048 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1127
Cdd:pfam03154  874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1907157157 1128 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1173
Cdd:pfam03154  946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
7-45 3.91e-18

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 78.81  E-value: 3.91e-18
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907157157    7 KRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 45
Cdd:cd11661      8 KLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
108-157 2.03e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.03e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1907157157   108 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 157
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ZnF_GATA cd00202
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ...
111-165 2.46e-15

Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C


Pssm-ID: 238123 [Multi-domain]  Cd Length: 54  Bit Score: 71.25  E-value: 2.46e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157  111 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 165
Cdd:cd00202      1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
GATA pfam00320
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ...
112-147 9.37e-12

GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.


Pssm-ID: 425605 [Multi-domain]  Cd Length: 36  Bit Score: 60.41  E-value: 9.37e-12
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907157157  112 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 147
Cdd:pfam00320    1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
150-336 1.94e-09

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 62.23  E-value: 1.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  150 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 221
Cdd:NF033609   539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  222 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 299
Cdd:NF033609   619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1907157157  300 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609   699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
PHA03247 PHA03247
large tegument protein UL36; Provisional
183-632 3.96e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.96e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  183 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 260
Cdd:PHA03247  2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  261 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 335
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  336 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 415
Cdd:PHA03247  2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  416 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 494
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  495 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 573
Cdd:PHA03247  2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157  574 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 632
Cdd:PHA03247  2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
207-367 1.01e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 285
Cdd:NF033609   716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  286 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 363
Cdd:NF033609   796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875

                   ....
gi 1907157157  364 GTPQ 367
Cdd:NF033609   876 NSPK 879
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
185-336 1.52e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 1.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  185 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 264
Cdd:NF033609   628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907157157  265 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609   708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
207-336 2.23e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 2.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 286
Cdd:NF033609   674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907157157  287 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609   753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
173-1173 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1090.96  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  173 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 252
Cdd:pfam03154    1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  253 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 332
Cdd:pfam03154   79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  333 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 412
Cdd:pfam03154  159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  413 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 489
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  490 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 568
Cdd:pfam03154  317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  569 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 647
Cdd:pfam03154  397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  648 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 727
Cdd:pfam03154  477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  728 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 807
Cdd:pfam03154  555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  808 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 887
Cdd:pfam03154  634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  888 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 967
Cdd:pfam03154  714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  968 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1047
Cdd:pfam03154  794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157 1048 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1127
Cdd:pfam03154  874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1907157157 1128 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1173
Cdd:pfam03154  946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
7-45 3.91e-18

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 78.81  E-value: 3.91e-18
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907157157    7 KRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 45
Cdd:cd11661      8 KLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
108-157 2.03e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.03e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1907157157   108 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 157
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ZnF_GATA cd00202
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ...
111-165 2.46e-15

Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C


Pssm-ID: 238123 [Multi-domain]  Cd Length: 54  Bit Score: 71.25  E-value: 2.46e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157  111 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 165
Cdd:cd00202      1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
GATA pfam00320
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ...
112-147 9.37e-12

GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.


Pssm-ID: 425605 [Multi-domain]  Cd Length: 36  Bit Score: 60.41  E-value: 9.37e-12
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907157157  112 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 147
Cdd:pfam00320    1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
339-505 1.04e-09

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 62.75  E-value: 1.04e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  339 QM-LQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspatsQPPNQTQSTVAPAAHTHIQQAPtlhppr 417
Cdd:pfam09770  202 AMrAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ------QQPQQPQQHPGQGHPVTILQRP------ 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  418 lpsphpplQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQtgpLLQHP-------------GPPQPFGLPSQPSQGQGPL 484
Cdd:pfam09770  270 --------QSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ---ILQNPnrlsaarvgypqnPQPGVQPAPAHQAHRQQGS 338
                          170       180
                   ....*....|....*....|...
gi 1907157157  485 --GPSPAAAHPHSTIQLPASQSA 505
Cdd:pfam09770  339 fgRQAPIITHPQQLAQLSEEEKA 361
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
150-336 1.94e-09

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 62.23  E-value: 1.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  150 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 221
Cdd:NF033609   539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  222 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 299
Cdd:NF033609   619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1907157157  300 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609   699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
PHA03247 PHA03247
large tegument protein UL36; Provisional
183-632 3.96e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.96e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  183 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 260
Cdd:PHA03247  2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  261 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 335
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  336 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 415
Cdd:PHA03247  2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  416 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 494
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  495 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 573
Cdd:PHA03247  2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157  574 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 632
Cdd:PHA03247  2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
323-493 8.38e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.45  E-value: 8.38e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  323 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA 402
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  403 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLH------SQGPPGPHSLQTGPLLQHPG-PPQPFGLPS 475
Cdd:PRK07764   677 APAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQppqaaqGASAPSPAADDPVPLPPEPDdPPDPAGAPA 756
                          170
                   ....*....|....*...
gi 1907157157  476 QPSQGQGPLGPSPAAAHP 493
Cdd:PRK07764   757 QPPPPPAPAPAAAPAAAP 774
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
341-493 9.29e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.45  E-value: 9.29e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  341 LQAQPPALQAPSGAASAPSTAPPGTPQL-------PTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTL 413
Cdd:PRK07764   580 GDWQVEAVVGPAPGAAGGEGPPAPASSGppeeaarPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  414 HPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGP-PQPFGLPSQPSQGQGPLGPSPAAAH 492
Cdd:PRK07764   660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQaDDPAAQPPQAAQGASAPSPAADDPV 739

                   .
gi 1907157157  493 P 493
Cdd:PRK07764   740 P 740
PHA03247 PHA03247
large tegument protein UL36; Provisional
271-637 1.14e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.14e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  271 ITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSAQQQMLQAQPPALQA 350
Cdd:PHA03247  2582 VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-PAANEPDPHPPPTVPPPERPRDDPAPGR 2660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  351 PSGAASAPSTAPPGTPQLPTQGPTPSAtAVPPQGSPATSQPPNQTQSTVAPAahthiqqaPTLHPPRLPsphpplqpmTA 430
Cdd:PHA03247  2661 VSRPRRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPEPA--------PHALVSATP---------LP 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  431 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSALQPQQ 510
Cdd:PHA03247  2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  511 PPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNA------------NLPPPPALKPLSSLSTHH 578
Cdd:PHA03247  2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRPPSRSPAAKPAAPARPP 2882
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  579 PPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP-FPQHP 637
Cdd:PHA03247  2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPpRPQPP 2942
PHA03247 PHA03247
large tegument protein UL36; Provisional
315-744 1.60e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  315 NRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSG------AASAPSTAPPGTPQLPTQGPTPSATAV-PPQGSPA 387
Cdd:PHA03247  2565 DRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDdrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  388 TSQPPNQTQSTVAPAA-----HTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHS---LQTG 459
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSatpLPPG 2724
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  460 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHStiqlPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLP 539
Cdd:PHA03247  2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  540 APQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQL--------------MPQSQPLPSSPAQPPG 605
Cdd:PHA03247  2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvrrRPPSRSPAAKPAAPAR 2880
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  606 lTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvPGGPPPITPPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSC 685
Cdd:PHA03247  2881 -PPVRRLARPAVSRSTESFALPPDQPERPPQP--QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157  686 PLPAVQikeeaLDEAEEPESPPPPPRSPSPEPTVvdtPSHASQSARFYKHLDRGYNSCA 744
Cdd:PHA03247  2958 AVPQPW-----LGALVPGRVAVPRFRVPQPAPSR---EAPASSTPPLTGHSLSRVSSWA 3008
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
330-485 3.71e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.14  E-value: 3.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  330 SDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPAtsqPPNQTQSTVAPAAhthiQQ 409
Cdd:PRK07764   367 ASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPA---PAAAPQPAPAPAP----AP 439
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907157157  410 APTlhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPslhsQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLG 485
Cdd:PRK07764   440 APP---SPAGNAPAGGAPSPPPAAAPSAQPAPAP----AAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
207-367 1.01e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 285
Cdd:NF033609   716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  286 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 363
Cdd:NF033609   796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875

                   ....
gi 1907157157  364 GTPQ 367
Cdd:NF033609   876 NSPK 879
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
343-461 2.04e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 48.56  E-value: 2.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  343 AQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPH 422
Cdd:PRK14951   382 ARPEA--AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA------APAAVALAPAPPA 453
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1907157157  423 PPLQPMTAPPSQSSAQPH-PQPSLHSQGPPGPHSLQTGPL 461
Cdd:PRK14951   454 QAAPETVAIPVRVAPEPAvASAAPAPAAAPAAARLTPTEE 493
PLN02967 PLN02967
kinase
163-294 2.29e-05

kinase


Pssm-ID: 215521 [Multi-domain]  Cd Length: 581  Bit Score: 48.50  E-value: 2.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  163 PVKEEDDGLSGKHSMRTRRSRgsgqmstlRSGRKKQPTSPDGRASPINEDIRssgrNSPSAASTSSNDSKAETVKKSA-- 240
Cdd:PLN02967    57 AVDEEPDENGAVSKKKPTRSV--------KRATKKTVVEISEPLEEGSELVV----NEDAALDKESKKTPRRTRRKAAaa 124
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157  241 -KKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGES 294
Cdd:PLN02967   125 sSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
PRK10856 PRK10856
cytoskeleton protein RodZ;
324-405 2.62e-05

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 47.71  E-value: 2.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  324 SPQDNES---DSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVA 400
Cdd:PRK10856   155 SQNSGQSvplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234

                   ....*
gi 1907157157  401 PAAHT 405
Cdd:PRK10856   235 APLPT 239
PHA03247 PHA03247
large tegument protein UL36; Provisional
147-560 2.66e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 2.66e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  147 LPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAAST 226
Cdd:PHA03247  2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  227 SSNDSKAETVKKSAKKVKEEAASPL---KSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDE 303
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLppgPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  304 GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTA---PPGTPQLPTQGPTPSATAV 380
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPPSLPLGGSV 2856
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  381 PPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH-SLQTG 459
Cdd:PHA03247  2857 APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpPPPPP 2936
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  460 PLLQHPGPPQPFGLPSQPSQGQGP-----------------LGPSPAAA-------------HPHSTIQLPASQSALQPQ 509
Cdd:PHA03247  2937 PRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrvavprfRVPQPAPSreapasstppltgHSLSRVSSWASSLALHEE 3016
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  510 QPPRE-----------------QPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLS------------GPSPFSLNA 560
Cdd:PHA03247  3017 TDPPPvslkqtlwppddtedsdADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPeagarespssqfGPPPLSANA 3096
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
316-457 4.21e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.72  E-value: 4.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  316 RSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTP-------SATAVPPQGSPAT 388
Cdd:pfam09770  204 RAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVtilqrpqSPQPDPAQPSIQP 283
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907157157  389 SQPPNQTQSTVAPAAHTHIQQAPTLhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPP---GPHSLQ 457
Cdd:pfam09770  284 QAQQFHQQPPPVPVQPTQILQNPNR--LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPiitHPQQLA 353
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
272-453 5.30e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 5.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  272 TSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSA--QQQMLQAQPPALQ 349
Cdd:PRK07764   600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDggDGWPAKAGGAAPA 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  350 APSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMT 429
Cdd:PRK07764   680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                          170       180
                   ....*....|....*....|....
gi 1907157157  430 APPSQSSAQPHPQPSLHSQGPPGP 453
Cdd:PRK07764   760 PPPAPAPAAAPAAAPPPSPPSEEE 783
PRK10263 PRK10263
DNA translocase FtsK; Provisional
349-499 6.12e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.39  E-value: 6.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  349 QAPSGAASAPSTAPPGTPQlptQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPM 428
Cdd:PRK10263   738 DGPHEPLFTPIVEPVQQPQ---QPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ 814
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907157157  429 TAPPS-QSSAQPHPQPSLHSQGP-PGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQL 499
Cdd:PRK10263   815 PQYQQpQQPVAPQPQYQQPQQPVaPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFAL 887
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
298-403 1.08e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.81  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  298 RSVNDEGSSDPK--DIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPgTPQLPTQGPTP 375
Cdd:PRK12270    17 QYLADPNSVDPSwrEFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAA 95
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1907157157  376 SATAVPPQGSPATSQPPNQTQSTV---APAA 403
Cdd:PRK12270    96 PAAPPAAAAAAAPAAAAVEDEVTPlrgAAAA 126
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
185-336 1.52e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 1.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  185 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 264
Cdd:NF033609   628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907157157  265 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609   708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
343-554 2.21e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 2.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  343 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPnqtqstvAPAAHTHIQQAPTLHPPrlpsph 422
Cdd:PRK12323   379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP-------APEALAAARQASARGPG------ 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  423 pplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPllqhPGPPQPFGLPSQPSQGQGP---LGPSPAAAHPHSTIQL 499
Cdd:PRK12323   446 ----GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA----PARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAA 517
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907157157  500 PASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPS 554
Cdd:PRK12323   518 PAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
207-336 2.23e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 2.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  207 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 286
Cdd:NF033609   674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907157157  287 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 336
Cdd:NF033609   753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
230-400 3.18e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 45.10  E-value: 3.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  230 DSKAETVKKSAKKVKEEAAsPLKSTKRQREKVASDTEdtdriTSKKTKTQEISRPNSPSEGEGESSDSRSVNdEGSSDPK 309
Cdd:PRK14949   630 SPKEGDGKKSSADRKPKTP-PSRAPPASLSKPASSPD-----ASQTSASFDLDPDFELATHQSVPEAALASG-SAPAPPP 702
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  310 DIDQDNRstsPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATS 389
Cdd:PRK14949   703 VPDPYDR---PPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQ 779
                          170
                   ....*....|.
gi 1907157157  390 QPPNQTQSTVA 400
Cdd:PRK14949   780 DTELNLVLLSS 790
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
342-462 4.71e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 4.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  342 QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPT---PSATAVPPQGSPATSQPPNqtQSTVAPAAHTHIQQAPTLHPPRL 418
Cdd:PRK14951   387 AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAaapPAPVAAPAAAAPAAAPAAA--PAAVALAPAPPAQAAPETVAIPV 464
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1907157157  419 PSPHPPLQPMTAPPSqssaQPHPQPSLHSQGPPGPHSLQTGPLL 462
Cdd:PRK14951   465 RVAPEPAVASAAPAP----AAAPAAARLTPTEEGDVWHATVQQL 504
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
323-483 6.85e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.70  E-value: 6.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  323 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspATSQPPNQTQSTVAPA 402
Cdd:PRK07994   368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ---LQRAQGATKAKKSEPA 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  403 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQG 482
Cdd:PRK07994   445 AASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLAAEAIERD 524

                   .
gi 1907157157  483 P 483
Cdd:PRK07994   525 P 525
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
298-410 9.14e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.23  E-value: 9.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  298 RSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSA 377
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVDPPAAVPVNPP 440
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1907157157  378 TAVPPQGSPATSQPPNQ----TQSTVAPAAHTHIQQA 410
Cdd:PRK14971   441 STAPQAVRPAQFKEEKKipvsKVSSLGPSTLRPIQEK 477
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
142-454 1.23e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.14  E-value: 1.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  142 KKYGELPPIEK---------PVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASpined 212
Cdd:PTZ00449   491 KSKKKLAPIEEedsdkhdepPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAK----- 565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  213 irssgRNSPSAASTSSNDSKAETVKKSAKKVKEEaasplKSTKRQRekvaSDTEDTDRITSKKTKTQEI----SRPNSPS 288
Cdd:PTZ00449   566 -----EHKPSKIPTLSKKPEFPKDPKHPKDPEEP-----KKPKRPR----SAQRPTRPKSPKLPELLDIpkspKRPESPK 631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  289 EGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIP-SPQDNES--DSDSSAQQQMLQAQPPALQAPSGAASAPSTAP--P 363
Cdd:PTZ00449   632 SPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPfDPKFKEKfyDDYLDAAAKSKETKTTVVLDESFESILKETLPetP 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  364 GTP-----QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQP----MTAPPSQ 434
Cdd:PTZ00449   712 GTPfttprPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEdihaETGEPDE 791
                          330       340
                   ....*....|....*....|
gi 1907157157  435 SSAQPHpQPSLHSQGPPGPH 454
Cdd:PTZ00449   792 AMKRPD-SPSEHEDKPPGDH 810
dnaA PRK14086
chromosomal replication initiator protein DnaA;
316-487 1.48e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.89  E-value: 1.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  316 RSTSPSIPSPQDNESDSDSSAQQQMLQAQP--PALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavpPQGSPATSQPPn 393
Cdd:PRK14086   115 RRPYEGYGGPRADDRPPGLPRQDQLPTARPayPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA----PYASPASYAPE- 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  394 qtQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmtaPPSQSSAQPHPQPS---LHSQGPPGPHSLQTGPLLQHPGPPQP 470
Cdd:PRK14086   190 --QERDREPYDAGRPEYDQRRRDYDHPRPDWDR----PRRDRTDRPEPPPGaghVHRGGPGPPERDDAPVVPIRPSAPGP 263
                          170
                   ....*....|....*..
gi 1907157157  471 FglPSQPSQGQGPLGPS 487
Cdd:PRK14086   264 L--AAQPAPAPGPGEPT 278
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
332-555 2.11e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 42.33  E-value: 2.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  332 SDSSAQQQML-QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATA------------------------VPPQGSP 386
Cdd:pfam09770   92 SDAIEEEQVRfNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyekykepepipdlqvdaslwgVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  387 ATSQPPnqtqsTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPG 466
Cdd:pfam09770  172 APAPAP-----QPAAQPASLPAPSRKMMSLEEVEAAMRAQ--AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  467 PPQPFGLPSQPSQGQGPlgpsPAAAHPHSTI-QLPASQSALQPQQPPREQPLPPAPLAMPHI---------KPPPTTPIP 536
Cdd:pfam09770  245 QPQQQPQQPQQHPGQGH----PVTILQRPQSpQPDPAQPSIQPQAQQFHQQPPPVPVQPTQIlqnpnrlsaARVGYPQNP 320
                          250
                   ....*....|....*....
gi 1907157157  537 QLPAPQAHKHPPHLSGPSP 555
Cdd:pfam09770  321 QPGVQPAPAHQAHRQQGSF 339
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
340-470 2.17e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  340 MLQAQPP-ALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqQAPTLHPPRL 418
Cdd:PRK14951   361 LLAFKPAaAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP-----VAAPAAAAPA 435
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907157157  419 PSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQP 470
Cdd:PRK14951   436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA--PAAAPAA 485
PRK10927 PRK10927
cell division protein FtsN;
244-470 2.53e-03

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 41.59  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  244 KEEAASPLKSTKRQREKVASDTEDTDR-ITSKKTKTQEISRPNSPSEGeGESSDSRSVNDEGSSDPKDIDQDNRSTSPSI 322
Cdd:PRK10927    58 KKEESETLQSQKVTGNGLPPKPEERWRyIKELESRQPGVRAPTEPSAG-GEVKTPEQLTPEQRQLLEQMQADMRQQPTQL 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  323 PSPQDNESDSDSsaQQQMLQAQPpalQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPatsQPPNQTQStvapa 402
Cdd:PRK10927   137 VEVPWNEQTPEQ--RQQTLQRQR---QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP---RQSKPAST----- 203
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907157157  403 ahthiqQAPtlhpprlpsphppLQPMTAPPSQSSAQPHPQpslhsqgppgphslQTGPLLQHPGPPQP 470
Cdd:PRK10927   204 ------QQP-------------YQDLLQTPAHTTAQSKPQ--------------QAAPVTRAADAPKP 238
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
343-432 2.82e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.72  E-value: 2.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  343 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavPPQGSPATsqPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPH 422
Cdd:PRK14950   366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK---EPVRETAT--PPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
                           90
                   ....*....|
gi 1907157157  423 PPLQPMTAPP 432
Cdd:PRK14950   441 KPKYTPPAPP 450
PHA03247 PHA03247
large tegument protein UL36; Provisional
346-691 2.93e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 2.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  346 PALQAPSGAASApsTAPPGTPQlPTQGPTPSATAVPPQGSPATSQPPNQTQ-STVAPAAHTHIQQAPTLHPPRLPSPHPP 424
Cdd:PHA03247  2478 PVYRRPAEARFP--FAAGAAPD-PGGGGPPDPDAPPAPSRLAPAILPDEPVgEPVHPRMLTWIRGLEELASDDAGDPPPP 2554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  425 LQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAH--------PHST 496
Cdd:PHA03247  2555 LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHapdppppsPSPA 2634
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  497 IQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGP-SPFSLNANLPPPPALKPLSSLS 575
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvGSLTSLADPPPPPPTPEPAPHA 2714
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  576 THHPPSAHPPPLQLMPQSQPLPSSPAQP--------PGLTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvpGGPPPIT 647
Cdd:PHA03247  2715 LVSATPLPPGPAAARQASPALPAAPAPPavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---RPAVASL 2791
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907157157  648 PPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSCPLPAVQ 691
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
318-500 3.03e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 3.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  318 TSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQ----GSPATSQPPN 393
Cdd:PRK12323   400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAaagpRPVAAAAAAA 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  394 QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQT---GPLLQHPGPPQP 470
Cdd:PRK12323   480 PARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPApaaAPAPRAAAATEP 559
                          170       180       190
                   ....*....|....*....|....*....|
gi 1907157157  471 FGLPSQPSQGQGPLGPSPAAAHPHSTIQLP 500
Cdd:PRK12323   560 VVAPRPPRASASGLPDMFDGDWPALAARLP 589
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
344-432 3.39e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 3.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  344 QPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPqGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHP 423
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP-AAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                   ....*....
gi 1907157157  424 PLQPMTAPP 432
Cdd:PRK12270   116 EVTPLRGAA 124
PRK08581 PRK08581
amidase domain-containing protein;
200-462 3.44e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 41.70  E-value: 3.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  200 TSPDGRASPINEDIRSSGRNSPSAASTSsNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITS---KKT 276
Cdd:PRK08581    21 TSPTAYADDPQKDSTAKTTSHDSKKSND-DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDfiyKNL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  277 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDpKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAL----QAPS 352
Cdd:PRK08581   100 PQTNINQLLTKNKYDDNYSLTTLIQNLFNLN-SDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKadnqKAPS 178
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  353 GAASAPST-------APPGTPQLPTQGPTPSATAVPPQGS--------------------PATSQPPNQTQSTVAPAAHT 405
Cdd:PRK08581   179 SNNTKPSTsnkqpnsPKPTQPNQSNSQPASDDTANQKSSSkdnqsmsdsaldsildqyseDAKKTQKDYASQSKKDKTET 258
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907157157  406 HIQQAPTLHPPRLPSPhpplqpmTAPPSQSSAQPHPQPSLHSQgppgpHSLQTGPLL 462
Cdd:PRK08581   259 SNTKNPQLPTQDELKH-------KSKPAQSFENDVNQSNTRST-----SLFETGPSL 303
PHA03269 PHA03269
envelope glycoprotein C; Provisional
352-467 4.47e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.25  E-value: 4.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  352 SGAASAPSTAPpgTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTlhpprlPSPHPPLQPMTAP 431
Cdd:PHA03269    17 LIIANLNTNIP--IPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT------PAASEKFDPAPAP 88
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1907157157  432 PSQSSAQPHPQ--PSLHSQGPPGPHSLQTGPLLQHPGP 467
Cdd:PHA03269    89 HQAASRAPDPAvaPQLAAAPKPDAAEAFTSAAQAHEAP 126
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
141-315 4.72e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.19  E-value: 4.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  141 FKKYGELPPIEKPVDPPPFMFKPVKEEDDglsgKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNS 220
Cdd:PTZ00108  1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSS----KSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNG 1298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  221 PSAASTSSNDSKAETVKKSAKKVKEeaasPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEgESSDSRSV 300
Cdd:PTZ00108  1299 GSKPSSPTKKKVKKRLEGSLAALKK----KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSS-EDDDDSEV 1373
                          170
                   ....*....|....*
gi 1907157157  301 NDEGSSDPKDIDQDN 315
Cdd:PTZ00108  1374 DDSEDEDDEDDEDDD 1388
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
197-506 5.62e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 40.86  E-value: 5.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  197 KQPTSPDGRASPINEDIRSSgrNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKT 276
Cdd:PRK14949   473 EASSSLDADNSAVPEQIDST--AEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLD 550
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  277 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNrSTSPSIPSPQDNESDSDSS--------AQQQMLQAQPPAL 348
Cdd:PRK14949   551 AYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPS-SQSLSPISAVTTAAASLADddildavlAARDSLLSDLDAL 629
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  349 QAPSGAASAPSTA--PPGTPQLPTQGPTPSATAVPP--QGSPATSQPPNQTQSTV--APAAHTHIQQAPTLHPPRLPSPH 422
Cdd:PRK14949   630 SPKEGDGKKSSADrkPKTPPSRAPPASLSKPASSPDasQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDR 709
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  423 PplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQPFGLPSQPSQGQGPlGPSPAAAHPHSTIQLPAS 502
Cdd:PRK14949   710 P---PWEEAPEVASANDGPNNAAEGNLSESVEDASNSEL--QAVEQQATHQPQVQAEAQSP-ASTTALTQTSSEVQDTEL 783

                   ....
gi 1907157157  503 QSAL 506
Cdd:PRK14949   784 NLVL 787
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
331-436 6.55e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.53  E-value: 6.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  331 DSDSSAQQQMLQAQPPALQAPSgAASAPSTAPPGTPQLPTQGPTPSatavPPQGSPATSQPPNQTQSTVAPAAHTHIQQA 410
Cdd:PRK14971   366 GDDASGGRGPKQHIKPVFTQPA-AAPQPSAAAAASPSPSQSSAAAQ----PSAPQSATQPAGTPPTVSVDPPAAVPVNPP 440
                           90       100
                   ....*....|....*....|....*.
gi 1907157157  411 PTLHPPRLPSPHPPLQPMtaPPSQSS 436
Cdd:PRK14971   441 STAPQAVRPAQFKEEKKI--PVSKVS 464
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
368-505 7.00e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 7.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  368 LPTQGPTPSATAVPPQGSPAtSQPPNQTQSTVAPAAhthiqqaptlhpprlpsphpplqpMTAPPSQSSAQPHPQPSLHS 447
Cdd:PRK07764   385 LGVAGGAGAPAAAAPSAAAA-APAAAPAPAAAAPAA------------------------AAAPAPAAAPQPAPAPAPAP 439
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907157157  448 QGPPGPHSLQTGPLLQHP----GPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSA 505
Cdd:PRK07764   440 APPSPAGNAPAGGAPSPPpaaaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
PHA03378 PHA03378
EBNA-3B; Provisional
342-476 7.89e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.44  E-value: 7.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  342 QAQPPALQAPSGAAS--APSTAPPGTPQLPTQGPTPsatAVPPQGSPATSQPPNQTQSTVAP--AAHTHIQQAPTLHPPR 417
Cdd:PHA03378   688 QWAPGTMQPPPRAPTpmRPPAAPPGRAQRPAAATGR---ARPPAAAPGRARPPAAAPGRARPpaAAPGRARPPAAAPGRA 764
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907157157  418 LPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQ 476
Cdd:PHA03378   765 RPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQ 823
PRK10905 PRK10905
cell division protein DamX; Validated
286-502 7.92e-03

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 39.92  E-value: 7.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  286 SPSEGEGESSDSRSVNDEGSSDpkdiDQDNRSTspsiPSP-QDNESDSDSSAQQQMlqAQPPALQAPSGAASAPstAPPG 364
Cdd:PRK10905    24 STSSSDQTASGEKSIDLAGNAT----DQANGVQ----PAPgTTSAEQTAGNTQQDV--SLPPISSTPTQGQTPV--ATDG 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  365 TPQLPTQG------------------------PTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPS 420
Cdd:PRK10905    92 QQRVEVQGdlnnaltqpqnqqqlnnvavnstlPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQAT 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  421 PHPPLQPMTAPP--SQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQ 498
Cdd:PRK10905   172 AKTEPKPVAQTPkrTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQ 251

                   ....
gi 1907157157  499 LPAS 502
Cdd:PRK10905   252 LSSS 255
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
351-496 8.53e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.23  E-value: 8.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  351 PSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSphpplqpmta 430
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQ---------- 430
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907157157  431 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHST 496
Cdd:PRK07994   431 RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVAT 496
PHA03269 PHA03269
envelope glycoprotein C; Provisional
375-495 8.71e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 40.10  E-value: 8.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  375 PSATAVPPQGSPATSQPPNQtqstvAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH 454
Cdd:PHA03269    23 NTNIPIPELHTSAATQKPDP-----APAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD 97
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907157157  455 SLQTGPLLQHPGPPQPFGLPSQPSQGQGPL---------GPSPAAAHPHS 495
Cdd:PHA03269    98 PAVAPQLAAAPKPDAAEAFTSAAQAHEAPAdagtsaaskKPDPAAHTQHS 147
PTZ00395 PTZ00395
Sec24-related protein; Provisional
195-488 9.41e-03

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 40.44  E-value: 9.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  195 RKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPlkstkrqrekvaSDTEDTDRITSK 274
Cdd:PTZ00395   267 RGASSAAESGYAHHRGSNIASHTPNDNIMHAANNPLNNTNDAQRNAIQGDLVRGAP------------NDKNSFDRGNEK 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  275 KTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSipSPQDNESDSDSSAQQQmlqAQPPALQAPSGA 354
Cdd:PTZ00395   335 TYQIYGGFHDGSPNAASAGAPFNGLGNQADGGHINQVHPDARGAWAG--GPHSNASYNCAAYSNA---AQSNAAQSNAGF 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907157157  355 ASAPSTAPPGTPQlPTQGPTPSATAV--PPQGSPATSQPPN-QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAP 431
Cdd:PTZ00395   410 SNAGYSNPGNSNP-GYNNAPNSNTPYnnPPNSNTPYSNPPNsNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAY 488
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907157157  432 PSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGlpSQPSQGQGPLGPSP 488
Cdd:PTZ00395   489 QHRAANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFG--SAPYGGNAATTADP 543
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH