NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720409699|ref|XP_030109601|]
View 

arginine-glutamic acid dipeptide repeats protein isoform X1 [Mus musculus]

Protein Classification

arginine-glutamic acid dipeptide repeats protein( domain architecture ID 11562211)

arginine-glutamic acid dipeptide repeats protein (RERE) plays a role as a transcriptional repressor during development

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
568-1568 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1202.28  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  568 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 647
Cdd:pfam03154    1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  648 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 727
Cdd:pfam03154   79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  728 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 807
Cdd:pfam03154  159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  808 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 884
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  885 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 963
Cdd:pfam03154  317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  964 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 1042
Cdd:pfam03154  397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 1122
Cdd:pfam03154  477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1123 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 1202
Cdd:pfam03154  555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1203 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 1282
Cdd:pfam03154  634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1283 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 1362
Cdd:pfam03154  714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1363 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1442
Cdd:pfam03154  794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1443 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1522
Cdd:pfam03154  874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1720409699 1523 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1568
Cdd:pfam03154  946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
BAH_MTA cd04709
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ...
102-307 3.30e-81

BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


:

Pssm-ID: 240060  Cd Length: 164  Bit Score: 263.87  E-value: 3.30e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709      1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709     38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1720409699  261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709    118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
395-440 1.33e-22

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


:

Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 91.91  E-value: 1.33e-22
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1720409699  395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661      1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
503-552 2.72e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


:

Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.72e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1720409699   503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ELM2 pfam01448
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ...
286-336 1.63e-12

ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.


:

Pssm-ID: 460214  Cd Length: 53  Bit Score: 63.40  E-value: 1.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720409699  286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448    1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
568-1568 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1202.28  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  568 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 647
Cdd:pfam03154    1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  648 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 727
Cdd:pfam03154   79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  728 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 807
Cdd:pfam03154  159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  808 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 884
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  885 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 963
Cdd:pfam03154  317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  964 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 1042
Cdd:pfam03154  397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 1122
Cdd:pfam03154  477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1123 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 1202
Cdd:pfam03154  555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1203 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 1282
Cdd:pfam03154  634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1283 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 1362
Cdd:pfam03154  714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1363 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1442
Cdd:pfam03154  794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1443 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1522
Cdd:pfam03154  874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1720409699 1523 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1568
Cdd:pfam03154  946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
BAH_MTA cd04709
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ...
102-307 3.30e-81

BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240060  Cd Length: 164  Bit Score: 263.87  E-value: 3.30e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709      1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709     38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1720409699  261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709    118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
395-440 1.33e-22

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 91.91  E-value: 1.33e-22
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1720409699  395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661      1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
503-552 2.72e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.72e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1720409699   503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ZnF_GATA cd00202
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ...
506-560 3.29e-15

Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C


Pssm-ID: 238123 [Multi-domain]  Cd Length: 54  Bit Score: 71.25  E-value: 3.29e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699  506 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 560
Cdd:cd00202      1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
ELM2 pfam01448
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ...
286-336 1.63e-12

ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.


Pssm-ID: 460214  Cd Length: 53  Bit Score: 63.40  E-value: 1.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720409699  286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448    1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
GATA pfam00320
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ...
507-542 1.25e-11

GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.


Pssm-ID: 425605 [Multi-domain]  Cd Length: 36  Bit Score: 60.41  E-value: 1.25e-11
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720409699  507 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 542
Cdd:pfam00320    1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
BAH pfam01426
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been ...
103-281 2.71e-10

BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.


Pssm-ID: 460207  Cd Length: 120  Bit Score: 59.24  E-value: 2.71e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  103 VVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPPACSLPVapqppqhlseagrgpggskrdhllm 182
Cdd:pfam01426    1 ETYSVGDFVLVEPDDADEPYYVARIEEL----------------FEDTKNGKKMV------------------------- 39
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  183 NVKWYYRQSEVPdsvyqHLVQDRHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK-A 261
Cdd:pfam01426   40 RVQWFYRPEETV-----HRAGKAFNK----------------DELFLSDEEDDVPLSAIIGKCSVLHKSDLESLDPYKiK 98
                          170       180
                   ....*....|....*....|
gi 1720409699  262 RVDSFFYILGYNPETRRLNS 281
Cdd:pfam01426   99 EPDDFFCELLYDPKTKSFKK 118
PHA03247 PHA03247
large tegument protein UL36; Provisional
578-1027 4.35e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 4.35e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  578 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 655
Cdd:PHA03247  2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  656 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 730
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  731 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 810
Cdd:PHA03247  2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  811 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 889
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  890 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 968
Cdd:PHA03247  2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720409699  969 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 1027
Cdd:PHA03247  2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
BAH smart00439
Bromo adjacent homology domain;
105-281 1.24e-09

Bromo adjacent homology domain;


Pssm-ID: 214664 [Multi-domain]  Cd Length: 121  Bit Score: 57.69  E-value: 1.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699   105 YRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpGGSKRDHLlmNV 184
Cdd:smart00439    2 ISVGDFVLVEPDDADEPYYIGRIEEI----------------FETK----------------------KNSESKMV--RV 41
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699   185 KWYYRQSEVPdsvyqHLVQDRHNENdsgrelvitdpviknrELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFKAR 262
Cdd:smart00439   42 RWFYRPEETV-----LEKAALFDKN----------------EVFLSDEYDTVPLSDIIGKCNVLYKSDYPglRPEGSIGE 100
                           170
                    ....*....|....*....
gi 1720409699   263 VDSFFYILGYNPETRRLNS 281
Cdd:smart00439  101 PDVFFCESAYDPEKGSFKK 119
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
545-731 3.31e-09

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 61.85  E-value: 3.31e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  545 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 616
Cdd:NF033609   539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  617 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 694
Cdd:NF033609   619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1720409699  695 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609   699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
602-762 1.50e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 680
Cdd:NF033609   716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  681 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 758
Cdd:NF033609   796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875

                   ....
gi 1720409699  759 GTPQ 762
Cdd:NF033609   876 NSPK 879
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
396-441 6.70e-05

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 41.83  E-value: 6.70e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1720409699   396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWKKT 441
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKE-LPGRTAEQCRERWRNLLKP 49
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
396-439 1.92e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 40.56  E-value: 1.92e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1720409699  396 WTEDEVKRFVKGLRQYGKNFFRIrKELLPSKETGELITFYYYWK 439
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKI-AKLLPGRTDNQCKNRWQNYL 46
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
580-731 2.45e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 2.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  580 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 659
Cdd:NF033609   628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720409699  660 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609   708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
602-731 3.47e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.29  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 681
Cdd:NF033609   674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720409699  682 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609   753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
652-784 6.53e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.65  E-value: 6.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  652 QREKVASDTEDTDRITSKKTKTQ-EISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDN---------RSTSPSIPSPQ 721
Cdd:TIGR00601    9 QQQKFKIDMEPDETVKELKEKIEaEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDfvvvmvskpKTGTGKVAPPA 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699  722 DNESDSDSSAqqqmlqAQPPALQAPSGAASAPSTAPPGTP---QLPTQGPTPSATAVPPQGSPATS 784
Cdd:TIGR00601   89 ATPTSAPTPT------PSPPASPASGMSAAPASAVEEKSPseeSATATAPESPSTSVPSSGSDAAS 148
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
568-1568 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1202.28  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  568 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 647
Cdd:pfam03154    1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  648 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 727
Cdd:pfam03154   79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  728 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 807
Cdd:pfam03154  159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  808 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 884
Cdd:pfam03154  237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  885 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 963
Cdd:pfam03154  317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  964 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 1042
Cdd:pfam03154  397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 1122
Cdd:pfam03154  477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1123 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 1202
Cdd:pfam03154  555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1203 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 1282
Cdd:pfam03154  634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1283 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 1362
Cdd:pfam03154  714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1363 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1442
Cdd:pfam03154  794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1443 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1522
Cdd:pfam03154  874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1720409699 1523 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1568
Cdd:pfam03154  946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
BAH_MTA cd04709
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ...
102-307 3.30e-81

BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240060  Cd Length: 164  Bit Score: 263.87  E-value: 3.30e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709      1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709     38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1720409699  261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709    118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
395-440 1.33e-22

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 91.91  E-value: 1.33e-22
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1720409699  395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661      1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
503-552 2.72e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.72e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 1720409699   503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ZnF_GATA cd00202
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ...
506-560 3.29e-15

Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C


Pssm-ID: 238123 [Multi-domain]  Cd Length: 54  Bit Score: 71.25  E-value: 3.29e-15
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699  506 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 560
Cdd:cd00202      1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
ELM2 pfam01448
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ...
286-336 1.63e-12

ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.


Pssm-ID: 460214  Cd Length: 53  Bit Score: 63.40  E-value: 1.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720409699  286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448    1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
GATA pfam00320
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ...
507-542 1.25e-11

GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.


Pssm-ID: 425605 [Multi-domain]  Cd Length: 36  Bit Score: 60.41  E-value: 1.25e-11
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720409699  507 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 542
Cdd:pfam00320    1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
734-900 1.41e-10

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 66.21  E-value: 1.41e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  734 QM-LQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspatsQPPNQTQSTVAPAAHTHIQQAPtlhppr 812
Cdd:pfam09770  202 AMrAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ------QQPQQPQQHPGQGHPVTILQRP------ 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  813 lpsphpplQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQtgpLLQHP-------------GPPQPFGLPSQPSQGQGPL 879
Cdd:pfam09770  270 --------QSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ---ILQNPnrlsaarvgypqnPQPGVQPAPAHQAHRQQGS 338
                          170       180
                   ....*....|....*....|...
gi 1720409699  880 --GPSPAAAHPHSTIQLPASQSA 900
Cdd:pfam09770  339 fgRQAPIITHPQQLAQLSEEEKA 361
BAH pfam01426
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been ...
103-281 2.71e-10

BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.


Pssm-ID: 460207  Cd Length: 120  Bit Score: 59.24  E-value: 2.71e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  103 VVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPPACSLPVapqppqhlseagrgpggskrdhllm 182
Cdd:pfam01426    1 ETYSVGDFVLVEPDDADEPYYVARIEEL----------------FEDTKNGKKMV------------------------- 39
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  183 NVKWYYRQSEVPdsvyqHLVQDRHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK-A 261
Cdd:pfam01426   40 RVQWFYRPEETV-----HRAGKAFNK----------------DELFLSDEEDDVPLSAIIGKCSVLHKSDLESLDPYKiK 98
                          170       180
                   ....*....|....*....|
gi 1720409699  262 RVDSFFYILGYNPETRRLNS 281
Cdd:pfam01426   99 EPDDFFCELLYDPKTKSFKK 118
PHA03247 PHA03247
large tegument protein UL36; Provisional
578-1027 4.35e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.96  E-value: 4.35e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  578 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 655
Cdd:PHA03247  2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  656 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 730
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  731 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 810
Cdd:PHA03247  2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  811 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 889
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  890 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 968
Cdd:PHA03247  2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720409699  969 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 1027
Cdd:PHA03247  2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
BAH smart00439
Bromo adjacent homology domain;
105-281 1.24e-09

Bromo adjacent homology domain;


Pssm-ID: 214664 [Multi-domain]  Cd Length: 121  Bit Score: 57.69  E-value: 1.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699   105 YRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpGGSKRDHLlmNV 184
Cdd:smart00439    2 ISVGDFVLVEPDDADEPYYIGRIEEI----------------FETK----------------------KNSESKMV--RV 41
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699   185 KWYYRQSEVPdsvyqHLVQDRHNENdsgrelvitdpviknrELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFKAR 262
Cdd:smart00439   42 RWFYRPEETV-----LEKAALFDKN----------------EVFLSDEYDTVPLSDIIGKCNVLYKSDYPglRPEGSIGE 100
                           170
                    ....*....|....*....
gi 1720409699   263 VDSFFYILGYNPETRRLNS 281
Cdd:smart00439  101 PDVFFCESAYDPEKGSFKK 119
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
545-731 3.31e-09

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 61.85  E-value: 3.31e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  545 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 616
Cdd:NF033609   539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  617 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 694
Cdd:NF033609   619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1720409699  695 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609   699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
BAH_fungalPHD cd04710
BAH, or Bromo Adjacent Homology domain, as present in fungal proteins containing PHD domains. ...
101-278 5.13e-08

BAH, or Bromo Adjacent Homology domain, as present in fungal proteins containing PHD domains. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240061  Cd Length: 135  Bit Score: 53.14  E-value: 5.13e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  101 DDVVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapafcdppacsLPVAPQPPQHLSEAGRGPGGSKRdhl 180
Cdd:cd04710      8 NGELLKVNDHIYMSSEPPGEPYYIGRIMEF------------------------VPKHEFPSGIHARVFPASYFQVR--- 60
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  181 lMNvkWYYRQSEVpdsvyqhlvqDRHNENDSgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04710     61 -LN--WYYRPRDI----------SRRVVADS-------------RLLYASMHSDICPIGSVRGKCTVRHRDQIPDLEEYK 114
                          170
                   ....*....|....*...
gi 1720409699  261 ARVDSFFYILGYNPETRR 278
Cdd:cd04710    115 KRPNHFYFDQLFDRYILR 132
BAH cd04370
BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). ...
105-277 6.39e-08

BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). BAH domains have first been described as domains found in the polybromo protein and Yeast Rsc1/Rsc2 (Remodeling of the Structure of Chromatin). They also occur in mammalian DNA methyltransferases and the MTA1 subunits of histone deacetylase complexes. A BAH domain is also found in Yeast Sir3p and in the origin receptor complex protein 1 (Orc1p), where it was found to interact with the N-terminal lobe of the silence information regulator 1 protein (Sir1p), confirming the initial hypothesis that BAH plays a role in protein-protein interactions.


Pssm-ID: 239835 [Multi-domain]  Cd Length: 123  Bit Score: 52.78  E-value: 6.39e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  105 YRPGDCVYIE--SRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpggskRDHLLM 182
Cdd:cd04370      4 YEVGDSVYVEpdDSIKSDPPYIARIEEL----------------WEDT--------------------------NGSKQV 41
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  183 NVKWYYRQSEVPDSVYQHlvqdrHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFK 260
Cdd:cd04370     42 KVRWFYRPEETPKGLSPF-----ALR----------------RELFLSDHLDEIPVESIIGKCKVLFVSEFEglKQRPNK 100
                          170
                   ....*....|....*..
gi 1720409699  261 ARVDSFFYILGYNPETR 277
Cdd:cd04370    101 IDTDDFFCRLAYDPTTK 117
PHA03247 PHA03247
large tegument protein UL36; Provisional
666-1032 2.25e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  666 ITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSAQQQMLQAQPPALQA 745
Cdd:PHA03247  2582 VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-PAANEPDPHPPPTVPPPERPRDDPAPGR 2660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  746 PSGAASAPSTAPPGTPQLPTQGPTPSAtAVPPQGSPATSQPPNQTQSTVAPAahthiqqaPTLHPPRLPsphpplqpmTA 825
Cdd:PHA03247  2661 VSRPRRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPEPA--------PHALVSATP---------LP 2722
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  826 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSALQPQQ 905
Cdd:PHA03247  2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  906 PPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNA------------NLPPPPALKPLSSLSTHH 973
Cdd:PHA03247  2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRPPSRSPAAKPAAPARPP 2882
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  974 PPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP-FPQHP 1032
Cdd:PHA03247  2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPpRPQPP 2942
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
736-888 3.43e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 3.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  736 LQAQPPALQAPSGAASAPSTAPPGTPQL-------PTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTL 808
Cdd:PRK07764   580 GDWQVEAVVGPAPGAAGGEGPPAPASSGppeeaarPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  809 HPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGP-PQPFGLPSQPSQGQGPLGPSPAAAH 887
Cdd:PRK07764   660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQaDDPAAQPPQAAQGASAPSPAADDPV 739

                   .
gi 1720409699  888 P 888
Cdd:PRK07764   740 P 740
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
718-888 3.73e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 3.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  718 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA 797
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  798 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLH------SQGPPGPHSLQTGPLLQHPG-PPQPFGLPS 870
Cdd:PRK07764   677 APAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQppqaaqGASAPSPAADDPVPLPPEPDdPPDPAGAPA 756
                          170
                   ....*....|....*...
gi 1720409699  871 QPSQGQGPLGPSPAAAHP 888
Cdd:PRK07764   757 QPPPPPAPAPAAAPAAAP 774
PHA03247 PHA03247
large tegument protein UL36; Provisional
710-1139 6.28e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 6.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  710 NRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSG------AASAPSTAPPGTPQLPTQGPTPSATAV-PPQGSPA 782
Cdd:PHA03247  2565 DRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDdrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPP 2644
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  783 TSQPPNQTQSTVAPAA-----HTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHS---LQTG 854
Cdd:PHA03247  2645 TVPPPERPRDDPAPGRvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSatpLPPG 2724
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  855 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHStiqlPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLP 934
Cdd:PHA03247  2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  935 APQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQL--------------MPQSQPLPSSPAQPPG 1000
Cdd:PHA03247  2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvrrRPPSRSPAAKPAAPAR 2880
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1001 lTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvPGGPPPITPPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSC 1080
Cdd:PHA03247  2881 -PPVRRLARPAVSRSTESFALPPDQPERPPQP--QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720409699 1081 PLPAVQikeeaLDEAEEPESPPPPPRSPSPEPTVvdtPSHASQSARFYKHLDRGYNSCA 1139
Cdd:PHA03247  2958 AVPQPW-----LGALVPGRVAVPRFRVPQPAPSR---EAPASSTPPLTGHSLSRVSSWA 3008
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
725-880 8.56e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.84  E-value: 8.56e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  725 SDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPAtsqPPNQTQSTVAPAAhthiQQ 804
Cdd:PRK07764   367 ASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPA---PAAAPQPAPAPAP----AP 439
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699  805 APTlhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPslhsQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLG 880
Cdd:PRK07764   440 APP---SPAGNAPAGGAPSPPPAAAPSAQPAPAP----AAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
PHA03247 PHA03247
large tegument protein UL36; Provisional
542-955 1.78e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.78e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  542 LPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAAST 621
Cdd:PHA03247  2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  622 SSNDSKAETVKKSAKKVKEEAASPL---KSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDE 698
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLppgPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  699 GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTA---PPGTPQLPTQGPTPSATAV 775
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPPSLPLGGSV 2856
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  776 PPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH-SLQTG 854
Cdd:PHA03247  2857 APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpPPPPP 2936
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  855 PLLQHPGPPQPFGLPSQPSQGQGP-----------------LGPSPAAA-------------HPHSTIQLPASQSALQPQ 904
Cdd:PHA03247  2937 PRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrvavprfRVPQPAPSreapasstppltgHSLSRVSSWASSLALHEE 3016
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  905 QPP-----------------REQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLS------------GPSPFSLNA 955
Cdd:PHA03247  3017 TDPppvslkqtlwppddtedSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPeagarespssqfGPPPLSANA 3096
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
738-856 9.98e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 50.10  E-value: 9.98e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  738 AQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPH 817
Cdd:PRK14951   382 ARPEA--AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA------APAAVALAPAPPA 453
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1720409699  818 PPLQPMTAPPSQSSAQPH-PQPSLHSQGPPGPHSLQTGPL 856
Cdd:PRK14951   454 QAAPETVAIPVRVAPEPAvASAAPAPAAAPAAARLTPTEE 493
PRK10263 PRK10263
DNA translocase FtsK; Provisional
594-894 1.26e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.08  E-value: 1.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  594 PTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTED------TDRIT 667
Cdd:PRK10263   554 PVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASygiklpSQRAA 633
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  668 SKKTKTQEISRPNSPSEGEGESSDSRSvNDE-----GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPA 742
Cdd:PRK10263   634 EEKAREAQRNQYDSGDQYNDDEIDAMQ-QDElarqfAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYS 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  743 LQAPSGA--------------------ASAPSTAPPGTP-QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTH 801
Cdd:PRK10263   713 GEQPAGAnpfslddfefspmkallddgPHEPLFTPIVEPvQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ 792
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  802 IQQAPTLHPPRLPSPHPPLQPMTAPPS-QSSAQPHPQPSLHSQGP-PGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPL 879
Cdd:PRK10263   793 QPQQPVAPQPQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVaPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLL 872
                          330
                   ....*....|....*
gi 1720409699  880 GPSPAAAHPHSTIQL 894
Cdd:PRK10263   873 TPPPSEVEPVDTFAL 887
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
602-762 1.50e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 680
Cdd:NF033609   716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  681 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 758
Cdd:NF033609   796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875

                   ....
gi 1720409699  759 GTPQ 762
Cdd:NF033609   876 NSPK 879
PRK10856 PRK10856
cytoskeleton protein RodZ;
719-800 1.52e-05

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 48.87  E-value: 1.52e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  719 SPQDNES---DSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVA 795
Cdd:PRK10856   155 SQNSGQSvplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234

                   ....*
gi 1720409699  796 PAAHT 800
Cdd:PRK10856   235 APLPT 239
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
711-852 1.72e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.65  E-value: 1.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  711 RSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTP-------SATAVPPQGSPAT 783
Cdd:pfam09770  204 RAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVtilqrpqSPQPDPAQPSIQP 283
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720409699  784 SQPPNQTQSTVAPAAHTHIQQAPTLhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPP---GPHSLQ 852
Cdd:pfam09770  284 QAQQFHQQPPPVPVQPTQILQNPNR--LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPiitHPQQLA 353
PLN02967 PLN02967
kinase
558-689 2.42e-05

kinase


Pssm-ID: 215521 [Multi-domain]  Cd Length: 581  Bit Score: 48.89  E-value: 2.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  558 PVKEEDDGLSGKHSMRTRRSRgsgqmstlRSGRKKQPTSPDGRASPINEDIRssgrNSPSAASTSSNDSKAETVKKSA-- 635
Cdd:PLN02967    57 AVDEEPDENGAVSKKKPTRSV--------KRATKKTVVEISEPLEEGSELVV----NEDAALDKESKKTPRRTRRKAAaa 124
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699  636 -KKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGES 689
Cdd:PLN02967   125 sSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
667-848 2.44e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.21  E-value: 2.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  667 TSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSA--QQQMLQAQPPALQ 744
Cdd:PRK07764   600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDggDGWPAKAGGAAPA 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  745 APSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMT 824
Cdd:PRK07764   680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                          170       180
                   ....*....|....*....|....
gi 1720409699  825 APPSQSSAQPHPQPSLHSQGPPGP 848
Cdd:PRK07764   760 PPPAPAPAAAPAAAPPPSPPSEEE 783
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
396-441 6.70e-05

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 41.83  E-value: 6.70e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1720409699   396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWKKT 441
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKE-LPGRTAEQCRERWRNLLKP 49
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
396-439 6.79e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 41.79  E-value: 6.79e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1720409699  396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWK 439
Cdd:cd00167      2 WTEEEDELLLEAVKKYGkNNWEKIAKE-LPGRTPKQCRERWRNLL 45
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
738-949 7.63e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.56  E-value: 7.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  738 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPnqtqstvAPAAHTHIQQAPTLHPPrlpsph 817
Cdd:PRK12323   379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP-------APEALAAARQASARGPG------ 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  818 pplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPllqhPGPPQPFGLPSQPSQGQGP---LGPSPAAAHPHSTIQL 894
Cdd:PRK12323   446 ----GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA----PARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAA 517
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699  895 PASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPS 949
Cdd:PRK12323   518 PAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
PRK10927 PRK10927
cell division protein FtsN;
639-865 1.08e-04

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 46.21  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  639 KEEAASPLKSTKRQREKVASDTEDTDR-ITSKKTKTQEISRPNSPSEGeGESSDSRSVNDEGSSDPKDIDQDNRSTSPSI 717
Cdd:PRK10927    58 KKEESETLQSQKVTGNGLPPKPEERWRyIKELESRQPGVRAPTEPSAG-GEVKTPEQLTPEQRQLLEQMQADMRQQPTQL 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  718 PSPQDNESDSDSsaQQQMLQAQPpalQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPatsQPPNQTQStvapa 797
Cdd:PRK10927   137 VEVPWNEQTPEQ--RQQTLQRQR---QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP---RQSKPAST----- 203
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720409699  798 ahthiqQAPtlhpprlpsphppLQPMTAPPSQSSAQPHPQpslhsqgppgphslQTGPLLQHPGPPQP 865
Cdd:PRK10927   204 ------QQP-------------YQDLLQTPAHTTAQSKPQ--------------QAAPVTRAADAPKP 238
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
693-798 1.10e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.19  E-value: 1.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  693 RSVNDEGSSDPK--DIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPgTPQLPTQGPTP 770
Cdd:PRK12270    17 QYLADPNSVDPSwrEFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAA 95
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1720409699  771 SATAVPPQGSPATSQPPNQTQSTV---APAA 798
Cdd:PRK12270    96 PAAPPAAAAAAAPAAAAVEDEVTPlrgAAAA 126
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
537-849 1.27e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 46.99  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  537 KKYGELPPIEK---------PVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASpined 607
Cdd:PTZ00449   491 KSKKKLAPIEEedsdkhdepPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAK----- 565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  608 irssgRNSPSAASTSSNDSKAETVKKSAKKVKEEaasplKSTKRQRekvaSDTEDTDRITSKKTKTQEI----SRPNSPS 683
Cdd:PTZ00449   566 -----EHKPSKIPTLSKKPEFPKDPKHPKDPEEP-----KKPKRPR----SAQRPTRPKSPKLPELLDIpkspKRPESPK 631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  684 EGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIP-SPQDNES--DSDSSAQQQMLQAQPPALQAPSGAASAPSTAP--P 758
Cdd:PTZ00449   632 SPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPfDPKFKEKfyDDYLDAAAKSKETKTTVVLDESFESILKETLPetP 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  759 GTP-----QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQP----MTAPPSQ 829
Cdd:PTZ00449   712 GTPfttprPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEdihaETGEPDE 791
                          330       340
                   ....*....|....*....|
gi 1720409699  830 SSAQPHpQPSLHSQGPPGPH 849
Cdd:PTZ00449   792 AMKRPD-SPSEHEDKPPGDH 810
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
561-795 1.44e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 46.64  E-value: 1.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  561 EEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRA--SPINEDIRS---SGRNS----PSAASTSSNDSKaetv 631
Cdd:PRK14949   562 ESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAaaSLADDDILDavlAARDSllsdLDALSPKEGDGK---- 637
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  632 kKSAKKVKEEAAsPLKSTKRQREKVASDTEdtdriTSKKTKTQEISRPNSPSEGEGESSDSRSVNdEGSSDPKDIDQDNR 711
Cdd:PRK14949   638 -KSSADRKPKTP-PSRAPPASLSKPASSPD-----ASQTSASFDLDPDFELATHQSVPEAALASG-SAPAPPPVPDPYDR 709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  712 stsPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQ 791
Cdd:PRK14949   710 ---PPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTELNLV 786

                   ....
gi 1720409699  792 STVA 795
Cdd:PRK14949   787 LLSS 790
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
396-439 1.92e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 40.56  E-value: 1.92e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1720409699  396 WTEDEVKRFVKGLRQYGKNFFRIrKELLPSKETGELITFYYYWK 439
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKI-AKLLPGRTDNQCKNRWQNYL 46
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
580-731 2.45e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 2.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  580 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 659
Cdd:NF033609   628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720409699  660 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609   708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
737-857 2.66e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.48  E-value: 2.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  737 QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPT---PSATAVPPQGSPATSQPPNqtQSTVAPAAHTHIQQAPTLHPPRL 813
Cdd:PRK14951   387 AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAaapPAPVAAPAAAAPAAAPAAA--PAAVALAPAPPAQAAPETVAIPV 464
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720409699  814 PSPHPPLQPMTAPPSqssaQPHPQPSLHSQGPPGPHSLQTGPLL 857
Cdd:PRK14951   465 RVAPEPAVASAAPAP----AAAPAAARLTPTEEGDVWHATVQQL 504
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
602-731 3.47e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.29  E-value: 3.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 681
Cdd:NF033609   674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720409699  682 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609   753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
dnaA PRK14086
chromosomal replication initiator protein DnaA;
711-882 4.82e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.82  E-value: 4.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  711 RSTSPSIPSPQDNESDSDSSAQQQMLQAQP--PALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavpPQGSPATSQPPn 788
Cdd:PRK14086   115 RRPYEGYGGPRADDRPPGLPRQDQLPTARPayPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA----PYASPASYAPE- 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  789 qtQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmtaPPSQSSAQPHPQPS---LHSQGPPGPHSLQTGPLLQHPGPPQP 865
Cdd:PRK14086   190 --QERDREPYDAGRPEYDQRRRDYDHPRPDWDR----PRRDRTDRPEPPPGaghVHRGGPGPPERDDAPVVPIRPSAPGP 263
                          170
                   ....*....|....*..
gi 1720409699  866 FglPSQPSQGQGPLGPS 882
Cdd:PRK14086   264 L--AAQPAPAPGPGEPT 278
PRK10905 PRK10905
cell division protein DamX; Validated
681-897 5.06e-04

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 44.16  E-value: 5.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  681 SPSEGEGESSDSRSVNDEGSSDpkdiDQDNRSTspsiPSP-QDNESDSDSSAQQQMlqAQPPALQAPSGAASAPstAPPG 759
Cdd:PRK10905    24 STSSSDQTASGEKSIDLAGNAT----DQANGVQ----PAPgTTSAEQTAGNTQQDV--SLPPISSTPTQGQTPV--ATDG 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  760 TPQLPTQG------------------------PTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPS 815
Cdd:PRK10905    92 QQRVEVQGdlnnaltqpqnqqqlnnvavnstlPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQAT 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  816 PHPPLQPMTAPP--SQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQ 893
Cdd:PRK10905   172 AKTEPKPVAQTPkrTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQ 251

                   ....
gi 1720409699  894 LPAS 897
Cdd:PRK10905   252 LSSS 255
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
727-1015 5.75e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.64  E-value: 5.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  727 SDSSAQQQML-QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATA------------------------VPPQGSP 781
Cdd:pfam09770   92 SDAIEEEQVRfNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyekykepepipdlqvdaslwgVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  782 ATSQPPnqtqsTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPG 861
Cdd:pfam09770  172 APAPAP-----QPAAQPASLPAPSRKMMSLEEVEAAMRAQ--AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  862 PPQPFGLPSQPSQGQGplgpspaaaHPHSTIQLPASQSAlqpqqppreqplppaplamphikPPPTTPIPQLPAPQAHKH 941
Cdd:pfam09770  245 QPQQQPQQPQQHPGQG---------HPVTILQRPQSPQP-----------------------DPAQPSIQPQAQQFHQQP 292
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720409699  942 PPHLSGPSPFSLNANLPPPPALKPLSslsthhppsahppplQLMPQSQPLPSSPAQPPGltQSQSLPPPAASHP 1015
Cdd:pfam09770  293 PPVPVQPTQILQNPNRLSAARVGYPQ---------------NPQPGVQPAPAHQAHRQQ--GSFGRQAPIITHP 349
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
715-878 6.33e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.47  E-value: 6.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  715 PSIPSPQDNESD----SDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspATSQPPNQT 790
Cdd:PRK07994   361 PAAPLPEPEVPPqsaaPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ---LQRAQGATK 437
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  791 QSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPS 870
Cdd:PRK07994   438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLA 517

                   ....*...
gi 1720409699  871 QPSQGQGP 878
Cdd:PRK07994   518 AEAIERDP 525
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
693-805 9.27e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 9.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  693 RSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSA 772
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVDPPAAVPVNPP 440
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1720409699  773 TAVPPQGSPATSQPPNQ----TQSTVAPAAHTHIQQA 805
Cdd:PRK14971   441 STAPQAVRPAQFKEEKKipvsKVSSLGPSTLRPIQEK 477
PHA03378 PHA03378
EBNA-3B; Provisional
703-871 9.82e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 9.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  703 PKDIDQDNRSTSPSIPSPQDNESDSDSSAQQqMLQAQPPALQAPSGAAS--APSTAPPGTPQLPTQGPTPsatAVPPQGS 780
Cdd:PHA03378   655 PQVEITPYKPTWTQIGHIPYQPSPTGANTML-PIQWAPGTMQPPPRAPTpmRPPAAPPGRAQRPAAATGR---ARPPAAA 730
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  781 PATSQPPNQTQSTVAP--AAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQ 858
Cdd:PHA03378   731 PGRARPPAAAPGRARPpaAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLM 810
                          170
                   ....*....|...
gi 1720409699  859 HPGPPQPFGLPSQ 871
Cdd:PHA03378   811 PRAAPGQQGPTKQ 823
PHA03247 PHA03247
large tegument protein UL36; Provisional
741-1086 1.01e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  741 PALQAPSGAASaPStAPPGTPQlPTQGPTPSATAVPPQGSPATSQPPNQTQ-STVAPAAHTHIQQAPTLHPPRLPSPHPP 819
Cdd:PHA03247  2478 PVYRRPAEARF-PF-AAGAAPD-PGGGGPPDPDAPPAPSRLAPAILPDEPVgEPVHPRMLTWIRGLEELASDDAGDPPPP 2554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  820 LQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAH--------PHST 891
Cdd:PHA03247  2555 LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHapdppppsPSPA 2634
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  892 IQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGP-SPFSLNANLPPPPALKPLSSLS 970
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvGSLTSLADPPPPPPTPEPAPHA 2714
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  971 THHPPSAHPPPLQLMPQSQPLPSSPAQP--------PGLTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvpGGPPPIT 1042
Cdd:PHA03247  2715 LVSATPLPPGPAAARQASPALPAAPAPPavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---RPAVASL 2791
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSCPLPAVQ 1086
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
PRK11901 PRK11901
hypothetical protein; Reviewed
700-899 1.20e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.75  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  700 SSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQ---LPTQGPTPSATAVP 776
Cdd:PRK11901    55 GSALKSPTEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQdisAPPISPTPTQAAPP 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  777 PQgsPATSQ----PPN------QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSlhSQGPP 846
Cdd:PRK11901   135 QT--PNGQQrielPGNisdalsQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPAT--KKPAV 210
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1720409699  847 GPHSlqTGPLLQHPGPpqpfglPSQPSQGQGPLGPSPAAAHPHSTIQLP-ASQS 899
Cdd:PRK11901   211 NHHK--TATVAVPPAT------SGKPKSGAASARALSSAPASHYTLQLSsASRS 256
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
713-895 1.29e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  713 TSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQ----GSPATSQPPN 788
Cdd:PRK12323   400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAaagpRPVAAAAAAA 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  789 QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQT---GPLLQHPGPPQP 865
Cdd:PRK12323   480 PARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPApaaAPAPRAAAATEP 559
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720409699  866 FGLPSQPSQGQGPLGPSPAAAHPHSTIQLP 895
Cdd:PRK12323   560 VVAPRPPRASASGLPDMFDGDWPALAARLP 589
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
712-789 1.41e-03

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 42.70  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  712 STSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGT----PQLPTQGPTPSATAVPPQGSPATSQPP 787
Cdd:PRK13042    17 TTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTkveaPQQTPNATTPSSTKVETPQSPTTKQVP 96

                   ..
gi 1720409699  788 NQ 789
Cdd:PRK13042    97 TE 98
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
735-865 1.45e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.16  E-value: 1.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  735 MLQAQPP-ALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqQAPTLHPPRL 813
Cdd:PRK14951   361 LLAFKPAaAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP-----VAAPAAAAPA 435
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720409699  814 PSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQP 865
Cdd:PRK14951   436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA--PAAAPAA 485
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
607-901 1.95e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 42.79  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  607 DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK-VASDTEDTDRITSKKTKTQEISRPNSPSEG 685
Cdd:PRK14949   482 NSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDtLESNGLDEGDYAQDSAPLDAYQDDYVAFSS 561
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  686 EGESSDSRSVNDEGSSDPKdidQDNRSTSPSIPSPQDNE---SDSDSSAQQQMLQA----------QPPALQAPSGAASA 752
Cdd:PRK14949   562 ESYNALSDDEQHSANVQSA---QSAAEAQPSSQSLSPISavtTAAASLADDDILDAvlaardsllsDLDALSPKEGDGKK 638
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  753 PSTA--PPGTPQLPTQGPTPSATAVPP--QGSPATSQPPNQTQSTV--APAAHTHIQQAPTLHPPRLPSPHPplqPMTAP 826
Cdd:PRK14949   639 SSADrkPKTPPSRAPPASLSKPASSPDasQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRP---PWEEA 715
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699  827 PSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQPFGLPSQPSQGQGPlGPSPAAAHPHSTIQLPASQSAL 901
Cdd:PRK14949   716 PEVASANDGPNNAAEGNLSESVEDASNSEL--QAVEQQATHQPQVQAEAQSP-ASTTALTQTSSEVQDTELNLVL 787
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
763-900 2.16e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  763 LPTQGPTPSATAVPPQGSPAtSQPPNQTQSTVAPAAhthiqqaptlhpprlpsphpplqpMTAPPSQSSAQPHPQPSLHS 842
Cdd:PRK07764   385 LGVAGGAGAPAAAAPSAAAA-APAAAPAPAAAAPAA------------------------AAAPAPAAAPQPAPAPAPAP 439
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720409699  843 QGPPGPHSLQTGPLLQHP----GPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSA 900
Cdd:PRK07764   440 APPSPAGNAPAGGAPSPPpaaaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
738-827 2.21e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  738 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavPPQGSPATsqPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPH 817
Cdd:PRK14950   366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK---EPVRETAT--PPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
                           90
                   ....*....|
gi 1720409699  818 PPLQPMTAPP 827
Cdd:PRK14950   441 KPKYTPPAPP 450
PHA03269 PHA03269
envelope glycoprotein C; Provisional
747-862 2.45e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 42.41  E-value: 2.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  747 SGAASAPSTAPpgTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTlhpprlPSPHPPLQPMTAP 826
Cdd:PHA03269    17 LIIANLNTNIP--IPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT------PAASEKFDPAPAP 88
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1720409699  827 PSQSSAQPHPQ--PSLHSQGPPGPHSLQTGPLLQHPGP 862
Cdd:PHA03269    89 HQAASRAPDPAvaPQLAAAPKPDAAEAFTSAAQAHEAP 126
PRK08581 PRK08581
amidase domain-containing protein;
595-857 3.33e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 42.08  E-value: 3.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  595 TSPDGRASPINEDIRSSGRNSPSAASTSsNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITS---KKT 671
Cdd:PRK08581    21 TSPTAYADDPQKDSTAKTTSHDSKKSND-DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDfiyKNL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  672 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDpKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAL----QAPS 747
Cdd:PRK08581   100 PQTNINQLLTKNKYDDNYSLTTLIQNLFNLN-SDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKadnqKAPS 178
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  748 GAASAPST-------APPGTPQLPTQGPTPSATAVPPQGS--------------------PATSQPPNQTQSTVAPAAHT 800
Cdd:PRK08581   179 SNNTKPSTsnkqpnsPKPTQPNQSNSQPASDDTANQKSSSkdnqsmsdsaldsildqyseDAKKTQKDYASQSKKDKTET 258
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720409699  801 HIQQAPTLHPPRLPSPhpplqpmTAPPSQSSAQPHPQPSLHSQgppgpHSLQTGPLL 857
Cdd:PRK08581   259 SNTKNPQLPTQDELKH-------KSKPAQSFENDVNQSNTRST-----SLFETGPSL 303
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
739-827 4.59e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 4.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  739 QPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPqGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHP 818
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP-AAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                   ....*....
gi 1720409699  819 PLQPMTAPP 827
Cdd:PRK12270   116 EVTPLRGAA 124
PHA03269 PHA03269
envelope glycoprotein C; Provisional
770-890 4.76e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.64  E-value: 4.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  770 PSATAVPPQGSPATSQPPNQtqstvAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH 849
Cdd:PHA03269    23 NTNIPIPELHTSAATQKPDP-----APAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD 97
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1720409699  850 SLQTGPLLQHPGPPQPFGLPSQPSQGQGPL---------GPSPAAAHPHS 890
Cdd:PHA03269    98 PAVAPQLAAAPKPDAAEAFTSAAQAHEAPAdagtsaaskKPDPAAHTQHS 147
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
596-839 4.96e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.48  E-value: 4.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  596 SPDGRASPINEDIRSSGRNSPSAASTSSNDSKaeTVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQE 675
Cdd:pfam17823   22 PADPRHFVLNKMWNGAGKQNASGDAVPRADNK--SSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSE 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  676 isrpNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSpqdnESDSDSSAQQqmlqAQPPALQAPSGAASAPST 755
Cdd:pfam17823  100 ----PATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPS----EAFSAPRAAA----CRANASAAPRAAIAAASA 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  756 APPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPH 835
Cdd:pfam17823  168 PHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGT 247

                   ....
gi 1720409699  836 PQPS 839
Cdd:pfam17823  248 VTPA 251
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
762-1033 4.99e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 41.56  E-value: 4.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  762 QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA-----AHTHIQQAPTLhpprlpsphpplQPM-----TAPPSQSS 831
Cdd:pfam09770  105 QQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyeKYKEPEPIPDL------------QVDaslwgVAPKKAAA 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  832 AQPHPQPSLHSQGPPGPH----SLQ----------TGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPAS 897
Cdd:pfam09770  173 PAPAPQPAAQPASLPAPSrkmmSLEeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQ 252
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  898 QsalqpqqppreqplppaplamphikpppttpipqlPAPQAHKHPPHLsgpspfslnanlppppalkplsslsthhppsa 977
Cdd:pfam09770  253 P-----------------------------------QQHPGQGHPVTI-------------------------------- 265
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  978 hpppLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLH----QVPSQSPFPQHPF 1033
Cdd:pfam09770  266 ----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQnpnrLSAARVGYPQNPQ 321
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
652-784 6.53e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.65  E-value: 6.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  652 QREKVASDTEDTDRITSKKTKTQ-EISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDN---------RSTSPSIPSPQ 721
Cdd:TIGR00601    9 QQQKFKIDMEPDETVKELKEKIEaEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDfvvvmvskpKTGTGKVAPPA 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699  722 DNESDSDSSAqqqmlqAQPPALQAPSGAASAPSTAPPGTP---QLPTQGPTPSATAVPPQGSPATS 784
Cdd:TIGR00601   89 ATPTSAPTPT------PSPPASPASGMSAAPASAVEEKSPseeSATATAPESPSTSVPSSGSDAAS 148
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
536-710 6.55e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.19  E-value: 6.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  536 FKKYGELPPIEKPVDPPPFMFKPVKEEDDglsgKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNS 615
Cdd:PTZ00108  1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSS----KSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNG 1298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  616 PSAASTSSNDSKAETVKKSAKKVKEeaasPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEgESSDSRSV 695
Cdd:PTZ00108  1299 GSKPSSPTKKKVKKRLEGSLAALKK----KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSS-EDDDDSEV 1373
                          170
                   ....*....|....*
gi 1720409699  696 NDEGSSDPKDIDQDN 710
Cdd:PTZ00108  1374 DDSEDEDDEDDEDDD 1388
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
729-886 7.26e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.99  E-value: 7.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  729 SSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHThiqqaptl 808
Cdd:PRK07003   413 KAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDA-------- 484
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720409699  809 hpprlpsphppLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAA 886
Cdd:PRK07003   485 -----------PPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
726-831 7.59e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.91  E-value: 7.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  726 DSDSSAQQQMLQAQPPALQAPSgAASAPSTAPPGTPQLPTQGPTPSatavPPQGSPATSQPPNQTQSTVAPAAHTHIQQA 805
Cdd:PRK14971   366 GDDASGGRGPKQHIKPVFTQPA-AAPQPSAAAAASPSPSQSSAAAQ----PSAPQSATQPAGTPPTVSVDPPAAVPVNPP 440
                           90       100
                   ....*....|....*....|....*.
gi 1720409699  806 PTLHPPRLPSPHPPLQPMtaPPSQSS 831
Cdd:PRK14971   441 STAPQAVRPAQFKEEKKI--PVSKVS 464
PRK10856 PRK10856
cytoskeleton protein RodZ;
729-832 7.88e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 40.39  E-value: 7.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  729 SSAQ--QQMLQAQPPALQAPSGAASAPSTAPPGTPQlPTQGPTPSATAVP-PQGSPATSQPPNQTQSTVAPAAHThiQQA 805
Cdd:PRK10856   150 SSAElsQNSGQSVPLDTSTTTDPATTPAPAAPVDTT-PTNSQTPAVATAPaPAVDPQQNAVVAPSQANVDTAATP--APA 226
                           90       100
                   ....*....|....*....|....*..
gi 1720409699  806 PTLHPPRLPSPHPPLQPMTAPPSQSSA 832
Cdd:PRK10856   227 APATPDGAAPLPTDQAGVSTPAADPNA 253
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
746-891 8.13e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 8.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  746 PSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSphpplqpmta 825
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQ---------- 430
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699  826 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHST 891
Cdd:PRK07994   431 RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVAT 496
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
718-898 8.41e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 40.76  E-value: 8.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  718 PSPQDNESDSDSSAQQQML-QAQPPALQAPSGAAS-------APSTAPPGTPQLPTQGPTPSATAVPPQGSPA------- 782
Cdd:pfam09606  229 MNPQQMGGAPNQVAMQQQQpQQQGQQSQLGMGINQmqqmpqgVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGdqnnyqq 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699  783 -TSQPPNQTQSTVAPAAHTH-----IQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSL--QTG 854
Cdd:pfam09606  309 qQTRQQQQQQGGNHPAAHQQqmnqsVGQGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQvrQVT 388
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1720409699  855 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQ 898
Cdd:pfam09606  389 PNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQ 432
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH