|
Name |
Accession |
Description |
Interval |
E-value |
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
568-1568 |
0e+00 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 1202.28 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 568 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 647
Cdd:pfam03154 1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 648 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 727
Cdd:pfam03154 79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 728 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 807
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 808 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 884
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 885 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 963
Cdd:pfam03154 317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 964 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 1042
Cdd:pfam03154 397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 1122
Cdd:pfam03154 477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1123 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 1202
Cdd:pfam03154 555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1203 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 1282
Cdd:pfam03154 634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1283 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 1362
Cdd:pfam03154 714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1363 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1442
Cdd:pfam03154 794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1443 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1522
Cdd:pfam03154 874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
|
970 980 990 1000
....*....|....*....|....*....|....*....|....*.
gi 1720409699 1523 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1568
Cdd:pfam03154 946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
|
|
| BAH_MTA |
cd04709 |
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ... |
102-307 |
3.30e-81 |
|
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Pssm-ID: 240060 Cd Length: 164 Bit Score: 263.87 E-value: 3.30e-81
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709 1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709 38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1720409699 261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709 118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
|
|
| SANT_MTA3_like |
cd11661 |
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ... |
395-440 |
1.33e-22 |
|
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.
Pssm-ID: 212559 [Multi-domain] Cd Length: 46 Bit Score: 91.91 E-value: 1.33e-22
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1720409699 395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661 1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
|
|
| ZnF_GATA |
smart00401 |
zinc finger binding to DNA consensus sequence [AT]GATA[AG]; |
503-552 |
2.72e-15 |
|
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
Pssm-ID: 214648 [Multi-domain] Cd Length: 52 Bit Score: 71.30 E-value: 2.72e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1720409699 503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401 2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
|
|
| ZnF_GATA |
cd00202 |
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ... |
506-560 |
3.29e-15 |
|
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C
Pssm-ID: 238123 [Multi-domain] Cd Length: 54 Bit Score: 71.25 E-value: 3.29e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699 506 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 560
Cdd:cd00202 1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
|
|
| ELM2 |
pfam01448 |
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ... |
286-336 |
1.63e-12 |
|
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
Pssm-ID: 460214 Cd Length: 53 Bit Score: 63.40 E-value: 1.63e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1720409699 286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448 1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
|
|
| GATA |
pfam00320 |
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ... |
507-542 |
1.25e-11 |
|
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Pssm-ID: 425605 [Multi-domain] Cd Length: 36 Bit Score: 60.41 E-value: 1.25e-11
10 20 30
....*....|....*....|....*....|....*.
gi 1720409699 507 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 542
Cdd:pfam00320 1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
|
|
| BAH |
pfam01426 |
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been ... |
103-281 |
2.71e-10 |
|
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
Pssm-ID: 460207 Cd Length: 120 Bit Score: 59.24 E-value: 2.71e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 103 VVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPPACSLPVapqppqhlseagrgpggskrdhllm 182
Cdd:pfam01426 1 ETYSVGDFVLVEPDDADEPYYVARIEEL----------------FEDTKNGKKMV------------------------- 39
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 183 NVKWYYRQSEVPdsvyqHLVQDRHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK-A 261
Cdd:pfam01426 40 RVQWFYRPEETV-----HRAGKAFNK----------------DELFLSDEEDDVPLSAIIGKCSVLHKSDLESLDPYKiK 98
|
170 180
....*....|....*....|
gi 1720409699 262 RVDSFFYILGYNPETRRLNS 281
Cdd:pfam01426 99 EPDDFFCELLYDPKTKSFKK 118
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
578-1027 |
4.35e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.96 E-value: 4.35e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 578 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 655
Cdd:PHA03247 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 656 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 730
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 731 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 810
Cdd:PHA03247 2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 811 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 889
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 890 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 968
Cdd:PHA03247 2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1720409699 969 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 1027
Cdd:PHA03247 2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
|
|
| BAH |
smart00439 |
Bromo adjacent homology domain; |
105-281 |
1.24e-09 |
|
Bromo adjacent homology domain;
Pssm-ID: 214664 [Multi-domain] Cd Length: 121 Bit Score: 57.69 E-value: 1.24e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 105 YRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpGGSKRDHLlmNV 184
Cdd:smart00439 2 ISVGDFVLVEPDDADEPYYIGRIEEI----------------FETK----------------------KNSESKMV--RV 41
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 185 KWYYRQSEVPdsvyqHLVQDRHNENdsgrelvitdpviknrELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFKAR 262
Cdd:smart00439 42 RWFYRPEETV-----LEKAALFDKN----------------EVFLSDEYDTVPLSDIIGKCNVLYKSDYPglRPEGSIGE 100
|
170
....*....|....*....
gi 1720409699 263 VDSFFYILGYNPETRRLNS 281
Cdd:smart00439 101 PDVFFCESAYDPEKGSFKK 119
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
545-731 |
3.31e-09 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 61.85 E-value: 3.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 545 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 616
Cdd:NF033609 539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 617 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 694
Cdd:NF033609 619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
|
170 180 190
....*....|....*....|....*....|....*..
gi 1720409699 695 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609 699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
602-762 |
1.50e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.91 E-value: 1.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 680
Cdd:NF033609 716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 681 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 758
Cdd:NF033609 796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875
|
....
gi 1720409699 759 GTPQ 762
Cdd:NF033609 876 NSPK 879
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
396-441 |
6.70e-05 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 41.83 E-value: 6.70e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1720409699 396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWKKT 441
Cdd:smart00717 4 WTEEEDELLIELVKKYGkNNWEKIAKE-LPGRTAEQCRERWRNLLKP 49
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
396-439 |
1.92e-04 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 40.56 E-value: 1.92e-04
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1720409699 396 WTEDEVKRFVKGLRQYGKNFFRIrKELLPSKETGELITFYYYWK 439
Cdd:pfam00249 4 WTPEEDELLLEAVEKLGNRWKKI-AKLLPGRTDNQCKNRWQNYL 46
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
580-731 |
2.45e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.06 E-value: 2.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 580 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 659
Cdd:NF033609 628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720409699 660 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609 708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
602-731 |
3.47e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 45.29 E-value: 3.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 681
Cdd:NF033609 674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1720409699 682 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609 753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
|
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
652-784 |
6.53e-03 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 40.65 E-value: 6.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 652 QREKVASDTEDTDRITSKKTKTQ-EISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDN---------RSTSPSIPSPQ 721
Cdd:TIGR00601 9 QQQKFKIDMEPDETVKELKEKIEaEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDfvvvmvskpKTGTGKVAPPA 88
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699 722 DNESDSDSSAqqqmlqAQPPALQAPSGAASAPSTAPPGTP---QLPTQGPTPSATAVPPQGSPATS 784
Cdd:TIGR00601 89 ATPTSAPTPT------PSPPASPASGMSAAPASAVEEKSPseeSATATAPESPSTSVPSSGSDAAS 148
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
568-1568 |
0e+00 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 1202.28 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 568 GKHSMRTRRSRGSgqMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLK 647
Cdd:pfam03154 1 GKHSMRTRRSRGS--MSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLK 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 648 STKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 727
Cdd:pfam03154 79 SAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 728 DSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPT 807
Cdd:pfam03154 159 DSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPT 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 808 LHPPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPA 884
Cdd:pfam03154 237 LHPQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPA 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 885 AAHP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPAL 963
Cdd:pfam03154 317 APGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 964 KPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPIT 1042
Cdd:pfam03154 397 KPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPIT 476
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHAS 1122
Cdd:pfam03154 477 PPSGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1123 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqK 1202
Cdd:pfam03154 555 QSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-K 633
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1203 ASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAY 1282
Cdd:pfam03154 634 ASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAY 713
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1283 HMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASF 1362
Cdd:pfam03154 714 HMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASF 793
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1363 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 1442
Cdd:pfam03154 794 HPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQG 873
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1443 SAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQA 1522
Cdd:pfam03154 874 SGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQA 945
|
970 980 990 1000
....*....|....*....|....*....|....*....|....*.
gi 1720409699 1523 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1568
Cdd:pfam03154 946 MHAQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
|
|
| BAH_MTA |
cd04709 |
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ... |
102-307 |
3.30e-81 |
|
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Pssm-ID: 240060 Cd Length: 164 Bit Score: 263.87 E-value: 3.30e-81
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709 1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709 38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1720409699 261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709 118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
|
|
| SANT_MTA3_like |
cd11661 |
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ... |
395-440 |
1.33e-22 |
|
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.
Pssm-ID: 212559 [Multi-domain] Cd Length: 46 Bit Score: 91.91 E-value: 1.33e-22
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1720409699 395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661 1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
|
|
| ZnF_GATA |
smart00401 |
zinc finger binding to DNA consensus sequence [AT]GATA[AG]; |
503-552 |
2.72e-15 |
|
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
Pssm-ID: 214648 [Multi-domain] Cd Length: 52 Bit Score: 71.30 E-value: 2.72e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1720409699 503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401 2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
|
|
| ZnF_GATA |
cd00202 |
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ... |
506-560 |
3.29e-15 |
|
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C
Pssm-ID: 238123 [Multi-domain] Cd Length: 54 Bit Score: 71.25 E-value: 3.29e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699 506 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 560
Cdd:cd00202 1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
|
|
| ELM2 |
pfam01448 |
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ... |
286-336 |
1.63e-12 |
|
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
Pssm-ID: 460214 Cd Length: 53 Bit Score: 63.40 E-value: 1.63e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1720409699 286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448 1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
|
|
| GATA |
pfam00320 |
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ... |
507-542 |
1.25e-11 |
|
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.
Pssm-ID: 425605 [Multi-domain] Cd Length: 36 Bit Score: 60.41 E-value: 1.25e-11
10 20 30
....*....|....*....|....*....|....*.
gi 1720409699 507 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 542
Cdd:pfam00320 1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
734-900 |
1.41e-10 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 66.21 E-value: 1.41e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 734 QM-LQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspatsQPPNQTQSTVAPAAHTHIQQAPtlhppr 812
Cdd:pfam09770 202 AMrAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ------QQPQQPQQHPGQGHPVTILQRP------ 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 813 lpsphpplQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQtgpLLQHP-------------GPPQPFGLPSQPSQGQGPL 879
Cdd:pfam09770 270 --------QSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ---ILQNPnrlsaarvgypqnPQPGVQPAPAHQAHRQQGS 338
|
170 180
....*....|....*....|...
gi 1720409699 880 --GPSPAAAHPHSTIQLPASQSA 900
Cdd:pfam09770 339 fgRQAPIITHPQQLAQLSEEEKA 361
|
|
| BAH |
pfam01426 |
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been ... |
103-281 |
2.71e-10 |
|
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
Pssm-ID: 460207 Cd Length: 120 Bit Score: 59.24 E-value: 2.71e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 103 VVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPPACSLPVapqppqhlseagrgpggskrdhllm 182
Cdd:pfam01426 1 ETYSVGDFVLVEPDDADEPYYVARIEEL----------------FEDTKNGKKMV------------------------- 39
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 183 NVKWYYRQSEVPdsvyqHLVQDRHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK-A 261
Cdd:pfam01426 40 RVQWFYRPEETV-----HRAGKAFNK----------------DELFLSDEEDDVPLSAIIGKCSVLHKSDLESLDPYKiK 98
|
170 180
....*....|....*....|
gi 1720409699 262 RVDSFFYILGYNPETRRLNS 281
Cdd:pfam01426 99 EPDDFFCELLYDPKTKSFKK 118
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
578-1027 |
4.35e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.96 E-value: 4.35e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 578 RGSGQMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK 655
Cdd:PHA03247 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 656 VASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 730
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 731 AQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHP 810
Cdd:PHA03247 2733 PALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES------LPSPWD 2803
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 811 PRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPH 889
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPV 2883
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 890 STIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSS 968
Cdd:PHA03247 2884 RRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1720409699 969 LSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP 1027
Cdd:PHA03247 2964 LGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
|
|
| BAH |
smart00439 |
Bromo adjacent homology domain; |
105-281 |
1.24e-09 |
|
Bromo adjacent homology domain;
Pssm-ID: 214664 [Multi-domain] Cd Length: 121 Bit Score: 57.69 E-value: 1.24e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 105 YRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpGGSKRDHLlmNV 184
Cdd:smart00439 2 ISVGDFVLVEPDDADEPYYIGRIEEI----------------FETK----------------------KNSESKMV--RV 41
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 185 KWYYRQSEVPdsvyqHLVQDRHNENdsgrelvitdpviknrELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFKAR 262
Cdd:smart00439 42 RWFYRPEETV-----LEKAALFDKN----------------EVFLSDEYDTVPLSDIIGKCNVLYKSDYPglRPEGSIGE 100
|
170
....*....|....*....
gi 1720409699 263 VDSFFYILGYNPETRRLNS 281
Cdd:smart00439 101 PDVFFCESAYDPEKGSFKK 119
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
545-731 |
3.31e-09 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 61.85 E-value: 3.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 545 IEKPVDPP----PFMFKPVKEEDDGL----SGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 616
Cdd:NF033609 539 IDKPVVPEqpdePGEIEPIPEDSDSDpgsdSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 617 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 694
Cdd:NF033609 619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
|
170 180 190
....*....|....*....|....*....|....*..
gi 1720409699 695 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609 699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
|
|
| BAH_fungalPHD |
cd04710 |
BAH, or Bromo Adjacent Homology domain, as present in fungal proteins containing PHD domains. ... |
101-278 |
5.13e-08 |
|
BAH, or Bromo Adjacent Homology domain, as present in fungal proteins containing PHD domains. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Pssm-ID: 240061 Cd Length: 135 Bit Score: 53.14 E-value: 5.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 101 DDVVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapafcdppacsLPVAPQPPQHLSEAGRGPGGSKRdhl 180
Cdd:cd04710 8 NGELLKVNDHIYMSSEPPGEPYYIGRIMEF------------------------VPKHEFPSGIHARVFPASYFQVR--- 60
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 181 lMNvkWYYRQSEVpdsvyqhlvqDRHNENDSgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04710 61 -LN--WYYRPRDI----------SRRVVADS-------------RLLYASMHSDICPIGSVRGKCTVRHRDQIPDLEEYK 114
|
170
....*....|....*...
gi 1720409699 261 ARVDSFFYILGYNPETRR 278
Cdd:cd04710 115 KRPNHFYFDQLFDRYILR 132
|
|
| BAH |
cd04370 |
BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). ... |
105-277 |
6.39e-08 |
|
BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). BAH domains have first been described as domains found in the polybromo protein and Yeast Rsc1/Rsc2 (Remodeling of the Structure of Chromatin). They also occur in mammalian DNA methyltransferases and the MTA1 subunits of histone deacetylase complexes. A BAH domain is also found in Yeast Sir3p and in the origin receptor complex protein 1 (Orc1p), where it was found to interact with the N-terminal lobe of the silence information regulator 1 protein (Sir1p), confirming the initial hypothesis that BAH plays a role in protein-protein interactions.
Pssm-ID: 239835 [Multi-domain] Cd Length: 123 Bit Score: 52.78 E-value: 6.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 105 YRPGDCVYIE--SRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpggskRDHLLM 182
Cdd:cd04370 4 YEVGDSVYVEpdDSIKSDPPYIARIEEL----------------WEDT--------------------------NGSKQV 41
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 183 NVKWYYRQSEVPDSVYQHlvqdrHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFK 260
Cdd:cd04370 42 KVRWFYRPEETPKGLSPF-----ALR----------------RELFLSDHLDEIPVESIIGKCKVLFVSEFEglKQRPNK 100
|
170
....*....|....*..
gi 1720409699 261 ARVDSFFYILGYNPETR 277
Cdd:cd04370 101 IDTDDFFCRLAYDPTTK 117
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
666-1032 |
2.25e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 2.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 666 ITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSAQQQMLQAQPPALQA 745
Cdd:PHA03247 2582 VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-PAANEPDPHPPPTVPPPERPRDDPAPGR 2660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 746 PSGAASAPSTAPPGTPQLPTQGPTPSAtAVPPQGSPATSQPPNQTQSTVAPAahthiqqaPTLHPPRLPsphpplqpmTA 825
Cdd:PHA03247 2661 VSRPRRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPEPA--------PHALVSATP---------LP 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 826 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSALQPQQ 905
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 906 PPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNA------------NLPPPPALKPLSSLSTHH 973
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRPPSRSPAAKPAAPARPP 2882
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 974 PPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP-FPQHP 1032
Cdd:PHA03247 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPpRPQPP 2942
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
736-888 |
3.43e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 54.99 E-value: 3.43e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 736 LQAQPPALQAPSGAASAPSTAPPGTPQL-------PTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTL 808
Cdd:PRK07764 580 GDWQVEAVVGPAPGAAGGEGPPAPASSGppeeaarPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 809 HPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGP-PQPFGLPSQPSQGQGPLGPSPAAAH 887
Cdd:PRK07764 660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQaDDPAAQPPQAAQGASAPSPAADDPV 739
|
.
gi 1720409699 888 P 888
Cdd:PRK07764 740 P 740
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
718-888 |
3.73e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 54.99 E-value: 3.73e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 718 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA 797
Cdd:PRK07764 597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 798 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLH------SQGPPGPHSLQTGPLLQHPG-PPQPFGLPS 870
Cdd:PRK07764 677 APAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQppqaaqGASAPSPAADDPVPLPPEPDdPPDPAGAPA 756
|
170
....*....|....*...
gi 1720409699 871 QPSQGQGPLGPSPAAAHP 888
Cdd:PRK07764 757 QPPPPPAPAPAAAPAAAP 774
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
710-1139 |
6.28e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 6.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 710 NRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSG------AASAPSTAPPGTPQLPTQGPTPSATAV-PPQGSPA 782
Cdd:PHA03247 2565 DRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDdrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 783 TSQPPNQTQSTVAPAA-----HTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHS---LQTG 854
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSatpLPPG 2724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 855 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHStiqlPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLP 934
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 935 APQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQL--------------MPQSQPLPSSPAQPPG 1000
Cdd:PHA03247 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvrrRPPSRSPAAKPAAPAR 2880
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 1001 lTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvPGGPPPITPPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSC 1080
Cdd:PHA03247 2881 -PPVRRLARPAVSRSTESFALPPDQPERPPQP--QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 1720409699 1081 PLPAVQikeeaLDEAEEPESPPPPPRSPSPEPTVvdtPSHASQSARFYKHLDRGYNSCA 1139
Cdd:PHA03247 2958 AVPQPW-----LGALVPGRVAVPRFRVPQPAPSR---EAPASSTPPLTGHSLSRVSSWA 3008
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
725-880 |
8.56e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.84 E-value: 8.56e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 725 SDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPAtsqPPNQTQSTVAPAAhthiQQ 804
Cdd:PRK07764 367 ASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPA---PAAAPQPAPAPAP----AP 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699 805 APTlhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPslhsQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLG 880
Cdd:PRK07764 440 APP---SPAGNAPAGGAPSPPPAAAPSAQPAPAP----AAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
542-955 |
1.78e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 1.78e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 542 LPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAAST 621
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLT 2696
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 622 SSNDSKAETVKKSAKKVKEEAASPL---KSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDE 698
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLppgPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 699 GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTA---PPGTPQLPTQGPTPSATAV 775
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPPSLPLGGSV 2856
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 776 PPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH-SLQTG 854
Cdd:PHA03247 2857 APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpPPPPP 2936
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 855 PLLQHPGPPQPFGLPSQPSQGQGP-----------------LGPSPAAA-------------HPHSTIQLPASQSALQPQ 904
Cdd:PHA03247 2937 PRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrvavprfRVPQPAPSreapasstppltgHSLSRVSSWASSLALHEE 3016
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 905 QPP-----------------REQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLS------------GPSPFSLNA 955
Cdd:PHA03247 3017 TDPppvslkqtlwppddtedSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPeagarespssqfGPPPLSANA 3096
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
738-856 |
9.98e-06 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 50.10 E-value: 9.98e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 738 AQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPH 817
Cdd:PRK14951 382 ARPEA--AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA------APAAVALAPAPPA 453
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1720409699 818 PPLQPMTAPPSQSSAQPH-PQPSLHSQGPPGPHSLQTGPL 856
Cdd:PRK14951 454 QAAPETVAIPVRVAPEPAvASAAPAPAAAPAAARLTPTEE 493
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
594-894 |
1.26e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 50.08 E-value: 1.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 594 PTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTED------TDRIT 667
Cdd:PRK10263 554 PVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASygiklpSQRAA 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 668 SKKTKTQEISRPNSPSEGEGESSDSRSvNDE-----GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPA 742
Cdd:PRK10263 634 EEKAREAQRNQYDSGDQYNDDEIDAMQ-QDElarqfAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 743 LQAPSGA--------------------ASAPSTAPPGTP-QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTH 801
Cdd:PRK10263 713 GEQPAGAnpfslddfefspmkallddgPHEPLFTPIVEPvQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ 792
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 802 IQQAPTLHPPRLPSPHPPLQPMTAPPS-QSSAQPHPQPSLHSQGP-PGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPL 879
Cdd:PRK10263 793 QPQQPVAPQPQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVaPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLL 872
|
330
....*....|....*
gi 1720409699 880 GPSPAAAHPHSTIQL 894
Cdd:PRK10263 873 TPPPSEVEPVDTFAL 887
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
602-762 |
1.50e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.91 E-value: 1.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 680
Cdd:NF033609 716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 681 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 758
Cdd:NF033609 796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875
|
....
gi 1720409699 759 GTPQ 762
Cdd:NF033609 876 NSPK 879
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
719-800 |
1.52e-05 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 48.87 E-value: 1.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 719 SPQDNES---DSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVA 795
Cdd:PRK10856 155 SQNSGQSvplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234
|
....*
gi 1720409699 796 PAAHT 800
Cdd:PRK10856 235 APLPT 239
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
711-852 |
1.72e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 49.65 E-value: 1.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 711 RSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTP-------SATAVPPQGSPAT 783
Cdd:pfam09770 204 RAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVtilqrpqSPQPDPAQPSIQP 283
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720409699 784 SQPPNQTQSTVAPAAHTHIQQAPTLhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPP---GPHSLQ 852
Cdd:pfam09770 284 QAQQFHQQPPPVPVQPTQILQNPNR--LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPiitHPQQLA 353
|
|
| PLN02967 |
PLN02967 |
kinase |
558-689 |
2.42e-05 |
|
kinase
Pssm-ID: 215521 [Multi-domain] Cd Length: 581 Bit Score: 48.89 E-value: 2.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 558 PVKEEDDGLSGKHSMRTRRSRgsgqmstlRSGRKKQPTSPDGRASPINEDIRssgrNSPSAASTSSNDSKAETVKKSA-- 635
Cdd:PLN02967 57 AVDEEPDENGAVSKKKPTRSV--------KRATKKTVVEISEPLEEGSELVV----NEDAALDKESKKTPRRTRRKAAaa 124
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699 636 -KKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGES 689
Cdd:PLN02967 125 sSDVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
667-848 |
2.44e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.21 E-value: 2.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 667 TSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSA--QQQMLQAQPPALQ 744
Cdd:PRK07764 600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDggDGWPAKAGGAAPA 679
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 745 APSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMT 824
Cdd:PRK07764 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
|
170 180
....*....|....*....|....
gi 1720409699 825 APPSQSSAQPHPQPSLHSQGPPGP 848
Cdd:PRK07764 760 PPPAPAPAAAPAAAPPPSPPSEEE 783
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
396-441 |
6.70e-05 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 41.83 E-value: 6.70e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1720409699 396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWKKT 441
Cdd:smart00717 4 WTEEEDELLIELVKKYGkNNWEKIAKE-LPGRTAEQCRERWRNLLKP 49
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
396-439 |
6.79e-05 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 41.79 E-value: 6.79e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 1720409699 396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWK 439
Cdd:cd00167 2 WTEEEDELLLEAVKKYGkNNWEKIAKE-LPGRTPKQCRERWRNLL 45
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
738-949 |
7.63e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 47.56 E-value: 7.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 738 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPnqtqstvAPAAHTHIQQAPTLHPPrlpsph 817
Cdd:PRK12323 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP-------APEALAAARQASARGPG------ 445
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 818 pplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPllqhPGPPQPFGLPSQPSQGQGP---LGPSPAAAHPHSTIQL 894
Cdd:PRK12323 446 ----GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA----PARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAA 517
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699 895 PASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPS 949
Cdd:PRK12323 518 PAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
639-865 |
1.08e-04 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 46.21 E-value: 1.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 639 KEEAASPLKSTKRQREKVASDTEDTDR-ITSKKTKTQEISRPNSPSEGeGESSDSRSVNDEGSSDPKDIDQDNRSTSPSI 717
Cdd:PRK10927 58 KKEESETLQSQKVTGNGLPPKPEERWRyIKELESRQPGVRAPTEPSAG-GEVKTPEQLTPEQRQLLEQMQADMRQQPTQL 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 718 PSPQDNESDSDSsaQQQMLQAQPpalQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPatsQPPNQTQStvapa 797
Cdd:PRK10927 137 VEVPWNEQTPEQ--RQQTLQRQR---QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP---RQSKPAST----- 203
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720409699 798 ahthiqQAPtlhpprlpsphppLQPMTAPPSQSSAQPHPQpslhsqgppgphslQTGPLLQHPGPPQP 865
Cdd:PRK10927 204 ------QQP-------------YQDLLQTPAHTTAQSKPQ--------------QAAPVTRAADAPKP 238
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
693-798 |
1.10e-04 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 47.19 E-value: 1.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 693 RSVNDEGSSDPK--DIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPgTPQLPTQGPTP 770
Cdd:PRK12270 17 QYLADPNSVDPSwrEFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAA 95
|
90 100 110
....*....|....*....|....*....|.
gi 1720409699 771 SATAVPPQGSPATSQPPNQTQSTV---APAA 798
Cdd:PRK12270 96 PAAPPAAAAAAAPAAAAVEDEVTPlrgAAAA 126
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
537-849 |
1.27e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 46.99 E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 537 KKYGELPPIEK---------PVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASpined 607
Cdd:PTZ00449 491 KSKKKLAPIEEedsdkhdepPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAK----- 565
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 608 irssgRNSPSAASTSSNDSKAETVKKSAKKVKEEaasplKSTKRQRekvaSDTEDTDRITSKKTKTQEI----SRPNSPS 683
Cdd:PTZ00449 566 -----EHKPSKIPTLSKKPEFPKDPKHPKDPEEP-----KKPKRPR----SAQRPTRPKSPKLPELLDIpkspKRPESPK 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 684 EGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIP-SPQDNES--DSDSSAQQQMLQAQPPALQAPSGAASAPSTAP--P 758
Cdd:PTZ00449 632 SPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPfDPKFKEKfyDDYLDAAAKSKETKTTVVLDESFESILKETLPetP 711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 759 GTP-----QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQP----MTAPPSQ 829
Cdd:PTZ00449 712 GTPfttprPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEdihaETGEPDE 791
|
330 340
....*....|....*....|
gi 1720409699 830 SSAQPHpQPSLHSQGPPGPH 849
Cdd:PTZ00449 792 AMKRPD-SPSEHEDKPPGDH 810
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
561-795 |
1.44e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 46.64 E-value: 1.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 561 EEDDGLSGKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRA--SPINEDIRS---SGRNS----PSAASTSSNDSKaetv 631
Cdd:PRK14949 562 ESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAaaSLADDDILDavlAARDSllsdLDALSPKEGDGK---- 637
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 632 kKSAKKVKEEAAsPLKSTKRQREKVASDTEdtdriTSKKTKTQEISRPNSPSEGEGESSDSRSVNdEGSSDPKDIDQDNR 711
Cdd:PRK14949 638 -KSSADRKPKTP-PSRAPPASLSKPASSPD-----ASQTSASFDLDPDFELATHQSVPEAALASG-SAPAPPPVPDPYDR 709
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 712 stsPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQ 791
Cdd:PRK14949 710 ---PPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTELNLV 786
|
....
gi 1720409699 792 STVA 795
Cdd:PRK14949 787 LLSS 790
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
396-439 |
1.92e-04 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 40.56 E-value: 1.92e-04
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1720409699 396 WTEDEVKRFVKGLRQYGKNFFRIrKELLPSKETGELITFYYYWK 439
Cdd:pfam00249 4 WTPEEDELLLEAVEKLGNRWKKI-AKLLPGRTDNQCKNRWQNYL 46
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
580-731 |
2.45e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.06 E-value: 2.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 580 SGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASD 659
Cdd:NF033609 628 SDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 707
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720409699 660 TE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609 708 SDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
737-857 |
2.66e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 45.48 E-value: 2.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 737 QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPT---PSATAVPPQGSPATSQPPNqtQSTVAPAAHTHIQQAPTLHPPRL 813
Cdd:PRK14951 387 AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAaapPAPVAAPAAAAPAAAPAAA--PAAVALAPAPPAQAAPETVAIPV 464
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1720409699 814 PSPHPPLQPMTAPPSqssaQPHPQPSLHSQGPPGPHSLQTGPLL 857
Cdd:PRK14951 465 RVAPEPAVASAAPAP----AAAPAAARLTPTEEGDVWHATVQQL 504
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
602-731 |
3.47e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 45.29 E-value: 3.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 602 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTeDTDRITSKKTKTQEISRPNS 681
Cdd:NF033609 674 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDS 752
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1720409699 682 PSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 731
Cdd:NF033609 753 DSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
711-882 |
4.82e-04 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 44.82 E-value: 4.82e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 711 RSTSPSIPSPQDNESDSDSSAQQQMLQAQP--PALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavpPQGSPATSQPPn 788
Cdd:PRK14086 115 RRPYEGYGGPRADDRPPGLPRQDQLPTARPayPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA----PYASPASYAPE- 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 789 qtQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmtaPPSQSSAQPHPQPS---LHSQGPPGPHSLQTGPLLQHPGPPQP 865
Cdd:PRK14086 190 --QERDREPYDAGRPEYDQRRRDYDHPRPDWDR----PRRDRTDRPEPPPGaghVHRGGPGPPERDDAPVVPIRPSAPGP 263
|
170
....*....|....*..
gi 1720409699 866 FglPSQPSQGQGPLGPS 882
Cdd:PRK14086 264 L--AAQPAPAPGPGEPT 278
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
681-897 |
5.06e-04 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 44.16 E-value: 5.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 681 SPSEGEGESSDSRSVNDEGSSDpkdiDQDNRSTspsiPSP-QDNESDSDSSAQQQMlqAQPPALQAPSGAASAPstAPPG 759
Cdd:PRK10905 24 STSSSDQTASGEKSIDLAGNAT----DQANGVQ----PAPgTTSAEQTAGNTQQDV--SLPPISSTPTQGQTPV--ATDG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 760 TPQLPTQG------------------------PTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPS 815
Cdd:PRK10905 92 QQRVEVQGdlnnaltqpqnqqqlnnvavnstlPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQAT 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 816 PHPPLQPMTAPP--SQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQ 893
Cdd:PRK10905 172 AKTEPKPVAQTPkrTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQ 251
|
....
gi 1720409699 894 LPAS 897
Cdd:PRK10905 252 LSSS 255
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
727-1015 |
5.75e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.64 E-value: 5.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 727 SDSSAQQQML-QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATA------------------------VPPQGSP 781
Cdd:pfam09770 92 SDAIEEEQVRfNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyekykepepipdlqvdaslwgVAPKKAA 171
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 782 ATSQPPnqtqsTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPG 861
Cdd:pfam09770 172 APAPAP-----QPAAQPASLPAPSRKMMSLEEVEAAMRAQ--AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 862 PPQPFGLPSQPSQGQGplgpspaaaHPHSTIQLPASQSAlqpqqppreqplppaplamphikPPPTTPIPQLPAPQAHKH 941
Cdd:pfam09770 245 QPQQQPQQPQQHPGQG---------HPVTILQRPQSPQP-----------------------DPAQPSIQPQAQQFHQQP 292
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720409699 942 PPHLSGPSPFSLNANLPPPPALKPLSslsthhppsahppplQLMPQSQPLPSSPAQPPGltQSQSLPPPAASHP 1015
Cdd:pfam09770 293 PPVPVQPTQILQNPNRLSAARVGYPQ---------------NPQPGVQPAPAHQAHRQQ--GSFGRQAPIITHP 349
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
715-878 |
6.33e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 44.47 E-value: 6.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 715 PSIPSPQDNESD----SDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspATSQPPNQT 790
Cdd:PRK07994 361 PAAPLPEPEVPPqsaaPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ---LQRAQGATK 437
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 791 QSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPS 870
Cdd:PRK07994 438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLA 517
|
....*...
gi 1720409699 871 QPSQGQGP 878
Cdd:PRK07994 518 AEAIERDP 525
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
693-805 |
9.27e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 44.00 E-value: 9.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 693 RSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSA 772
Cdd:PRK14971 363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVDPPAAVPVNPP 440
|
90 100 110
....*....|....*....|....*....|....*..
gi 1720409699 773 TAVPPQGSPATSQPPNQ----TQSTVAPAAHTHIQQA 805
Cdd:PRK14971 441 STAPQAVRPAQFKEEKKipvsKVSSLGPSTLRPIQEK 477
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
703-871 |
9.82e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 43.90 E-value: 9.82e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 703 PKDIDQDNRSTSPSIPSPQDNESDSDSSAQQqMLQAQPPALQAPSGAAS--APSTAPPGTPQLPTQGPTPsatAVPPQGS 780
Cdd:PHA03378 655 PQVEITPYKPTWTQIGHIPYQPSPTGANTML-PIQWAPGTMQPPPRAPTpmRPPAAPPGRAQRPAAATGR---ARPPAAA 730
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 781 PATSQPPNQTQSTVAP--AAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQ 858
Cdd:PHA03378 731 PGRARPPAAAPGRARPpaAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLM 810
|
170
....*....|...
gi 1720409699 859 HPGPPQPFGLPSQ 871
Cdd:PHA03378 811 PRAAPGQQGPTKQ 823
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
741-1086 |
1.01e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 741 PALQAPSGAASaPStAPPGTPQlPTQGPTPSATAVPPQGSPATSQPPNQTQ-STVAPAAHTHIQQAPTLHPPRLPSPHPP 819
Cdd:PHA03247 2478 PVYRRPAEARF-PF-AAGAAPD-PGGGGPPDPDAPPAPSRLAPAILPDEPVgEPVHPRMLTWIRGLEELASDDAGDPPPP 2554
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 820 LQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAH--------PHST 891
Cdd:PHA03247 2555 LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHapdppppsPSPA 2634
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 892 IQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGP-SPFSLNANLPPPPALKPLSSLS 970
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvGSLTSLADPPPPPPTPEPAPHA 2714
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 971 THHPPSAHPPPLQLMPQSQPLPSSPAQP--------PGLTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvpGGPPPIT 1042
Cdd:PHA03247 2715 LVSATPLPPGPAAARQASPALPAAPAPPavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---RPAVASL 2791
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1720409699 1043 PPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSCPLPAVQ 1086
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
|
|
| PRK11901 |
PRK11901 |
hypothetical protein; Reviewed |
700-899 |
1.20e-03 |
|
hypothetical protein; Reviewed
Pssm-ID: 237015 [Multi-domain] Cd Length: 327 Bit Score: 42.75 E-value: 1.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 700 SSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQ---LPTQGPTPSATAVP 776
Cdd:PRK11901 55 GSALKSPTEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQdisAPPISPTPTQAAPP 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 777 PQgsPATSQ----PPN------QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSlhSQGPP 846
Cdd:PRK11901 135 QT--PNGQQrielPGNisdalsQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPAT--KKPAV 210
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1720409699 847 GPHSlqTGPLLQHPGPpqpfglPSQPSQGQGPLGPSPAAAHPHSTIQLP-ASQS 899
Cdd:PRK11901 211 NHHK--TATVAVPPAT------SGKPKSGAASARALSSAPASHYTLQLSsASRS 256
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
713-895 |
1.29e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 43.33 E-value: 1.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 713 TSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQ----GSPATSQPPN 788
Cdd:PRK12323 400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAaagpRPVAAAAAAA 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 789 QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQT---GPLLQHPGPPQP 865
Cdd:PRK12323 480 PARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPApaaAPAPRAAAATEP 559
|
170 180 190
....*....|....*....|....*....|
gi 1720409699 866 FGLPSQPSQGQGPLGPSPAAAHPHSTIQLP 895
Cdd:PRK12323 560 VVAPRPPRASASGLPDMFDGDWPALAARLP 589
|
|
| PRK13042 |
PRK13042 |
superantigen-like protein SSL4; Reviewed; |
712-789 |
1.41e-03 |
|
superantigen-like protein SSL4; Reviewed;
Pssm-ID: 183854 [Multi-domain] Cd Length: 291 Bit Score: 42.70 E-value: 1.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 712 STSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGT----PQLPTQGPTPSATAVPPQGSPATSQPP 787
Cdd:PRK13042 17 TTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTkveaPQQTPNATTPSSTKVETPQSPTTKQVP 96
|
..
gi 1720409699 788 NQ 789
Cdd:PRK13042 97 TE 98
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
735-865 |
1.45e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.16 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 735 MLQAQPP-ALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqQAPTLHPPRL 813
Cdd:PRK14951 361 LLAFKPAaAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP-----VAAPAAAAPA 435
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1720409699 814 PSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQP 865
Cdd:PRK14951 436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA--PAAAPAA 485
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
607-901 |
1.95e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 42.79 E-value: 1.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 607 DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK-VASDTEDTDRITSKKTKTQEISRPNSPSEG 685
Cdd:PRK14949 482 NSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDtLESNGLDEGDYAQDSAPLDAYQDDYVAFSS 561
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 686 EGESSDSRSVNDEGSSDPKdidQDNRSTSPSIPSPQDNE---SDSDSSAQQQMLQA----------QPPALQAPSGAASA 752
Cdd:PRK14949 562 ESYNALSDDEQHSANVQSA---QSAAEAQPSSQSLSPISavtTAAASLADDDILDAvlaardsllsDLDALSPKEGDGKK 638
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 753 PSTA--PPGTPQLPTQGPTPSATAVPP--QGSPATSQPPNQTQSTV--APAAHTHIQQAPTLHPPRLPSPHPplqPMTAP 826
Cdd:PRK14949 639 SSADrkPKTPPSRAPPASLSKPASSPDasQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRP---PWEEA 715
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720409699 827 PSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQPFGLPSQPSQGQGPlGPSPAAAHPHSTIQLPASQSAL 901
Cdd:PRK14949 716 PEVASANDGPNNAAEGNLSESVEDASNSEL--QAVEQQATHQPQVQAEAQSP-ASTTALTQTSSEVQDTELNLVL 787
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
763-900 |
2.16e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.67 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 763 LPTQGPTPSATAVPPQGSPAtSQPPNQTQSTVAPAAhthiqqaptlhpprlpsphpplqpMTAPPSQSSAQPHPQPSLHS 842
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAA-APAAAPAPAAAAPAA------------------------AAAPAPAAAPQPAPAPAPAP 439
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720409699 843 QGPPGPHSLQTGPLLQHP----GPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSA 900
Cdd:PRK07764 440 APPSPAGNAPAGGAPSPPpaaaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
738-827 |
2.21e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 42.49 E-value: 2.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 738 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavPPQGSPATsqPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPH 817
Cdd:PRK14950 366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK---EPVRETAT--PPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
|
90
....*....|
gi 1720409699 818 PPLQPMTAPP 827
Cdd:PRK14950 441 KPKYTPPAPP 450
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
747-862 |
2.45e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 42.41 E-value: 2.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 747 SGAASAPSTAPpgTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTlhpprlPSPHPPLQPMTAP 826
Cdd:PHA03269 17 LIIANLNTNIP--IPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT------PAASEKFDPAPAP 88
|
90 100 110
....*....|....*....|....*....|....*...
gi 1720409699 827 PSQSSAQPHPQ--PSLHSQGPPGPHSLQTGPLLQHPGP 862
Cdd:PHA03269 89 HQAASRAPDPAvaPQLAAAPKPDAAEAFTSAAQAHEAP 126
|
|
| PRK08581 |
PRK08581 |
amidase domain-containing protein; |
595-857 |
3.33e-03 |
|
amidase domain-containing protein;
Pssm-ID: 236304 [Multi-domain] Cd Length: 619 Bit Score: 42.08 E-value: 3.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 595 TSPDGRASPINEDIRSSGRNSPSAASTSsNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITS---KKT 671
Cdd:PRK08581 21 TSPTAYADDPQKDSTAKTTSHDSKKSND-DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDfiyKNL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 672 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDpKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAL----QAPS 747
Cdd:PRK08581 100 PQTNINQLLTKNKYDDNYSLTTLIQNLFNLN-SDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKadnqKAPS 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 748 GAASAPST-------APPGTPQLPTQGPTPSATAVPPQGS--------------------PATSQPPNQTQSTVAPAAHT 800
Cdd:PRK08581 179 SNNTKPSTsnkqpnsPKPTQPNQSNSQPASDDTANQKSSSkdnqsmsdsaldsildqyseDAKKTQKDYASQSKKDKTET 258
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1720409699 801 HIQQAPTLHPPRLPSPhpplqpmTAPPSQSSAQPHPQPSLHSQgppgpHSLQTGPLL 857
Cdd:PRK08581 259 SNTKNPQLPTQDELKH-------KSKPAQSFENDVNQSNTRST-----SLFETGPSL 303
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
739-827 |
4.59e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.80 E-value: 4.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 739 QPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPqGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHP 818
Cdd:PRK12270 37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP-AAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
|
....*....
gi 1720409699 819 PLQPMTAPP 827
Cdd:PRK12270 116 EVTPLRGAA 124
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
770-890 |
4.76e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 41.64 E-value: 4.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 770 PSATAVPPQGSPATSQPPNQtqstvAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH 849
Cdd:PHA03269 23 NTNIPIPELHTSAATQKPDP-----APAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD 97
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1720409699 850 SLQTGPLLQHPGPPQPFGLPSQPSQGQGPL---------GPSPAAAHPHS 890
Cdd:PHA03269 98 PAVAPQLAAAPKPDAAEAFTSAAQAHEAPAdagtsaaskKPDPAAHTQHS 147
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
596-839 |
4.96e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.48 E-value: 4.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 596 SPDGRASPINEDIRSSGRNSPSAASTSSNDSKaeTVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQE 675
Cdd:pfam17823 22 PADPRHFVLNKMWNGAGKQNASGDAVPRADNK--SSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSE 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 676 isrpNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSpqdnESDSDSSAQQqmlqAQPPALQAPSGAASAPST 755
Cdd:pfam17823 100 ----PATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPS----EAFSAPRAAA----CRANASAAPRAAIAAASA 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 756 APPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPH 835
Cdd:pfam17823 168 PHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGT 247
|
....
gi 1720409699 836 PQPS 839
Cdd:pfam17823 248 VTPA 251
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
762-1033 |
4.99e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 41.56 E-value: 4.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 762 QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA-----AHTHIQQAPTLhpprlpsphpplQPM-----TAPPSQSS 831
Cdd:pfam09770 105 QQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyeKYKEPEPIPDL------------QVDaslwgVAPKKAAA 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 832 AQPHPQPSLHSQGPPGPH----SLQ----------TGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPAS 897
Cdd:pfam09770 173 PAPAPQPAAQPASLPAPSrkmmSLEeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQ 252
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 898 QsalqpqqppreqplppaplamphikpppttpipqlPAPQAHKHPPHLsgpspfslnanlppppalkplsslsthhppsa 977
Cdd:pfam09770 253 P-----------------------------------QQHPGQGHPVTI-------------------------------- 265
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 978 hpppLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLH----QVPSQSPFPQHPF 1033
Cdd:pfam09770 266 ----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQnpnrLSAARVGYPQNPQ 321
|
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
652-784 |
6.53e-03 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 40.65 E-value: 6.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 652 QREKVASDTEDTDRITSKKTKTQ-EISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDN---------RSTSPSIPSPQ 721
Cdd:TIGR00601 9 QQQKFKIDMEPDETVKELKEKIEaEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDfvvvmvskpKTGTGKVAPPA 88
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699 722 DNESDSDSSAqqqmlqAQPPALQAPSGAASAPSTAPPGTP---QLPTQGPTPSATAVPPQGSPATS 784
Cdd:TIGR00601 89 ATPTSAPTPT------PSPPASPASGMSAAPASAVEEKSPseeSATATAPESPSTSVPSSGSDAAS 148
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
536-710 |
6.55e-03 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 41.19 E-value: 6.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 536 FKKYGELPPIEKPVDPPPFMFKPVKEEDDglsgKHSMRTRRSRGSGQMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNS 615
Cdd:PTZ00108 1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSS----KSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNG 1298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 616 PSAASTSSNDSKAETVKKSAKKVKEeaasPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEgESSDSRSV 695
Cdd:PTZ00108 1299 GSKPSSPTKKKVKKRLEGSLAALKK----KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSS-EDDDDSEV 1373
|
170
....*....|....*
gi 1720409699 696 NDEGSSDPKDIDQDN 710
Cdd:PTZ00108 1374 DDSEDEDDEDDEDDD 1388
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
729-886 |
7.26e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 40.99 E-value: 7.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 729 SSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHThiqqaptl 808
Cdd:PRK07003 413 KAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDA-------- 484
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720409699 809 hpprlpsphppLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAA 886
Cdd:PRK07003 485 -----------PPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
726-831 |
7.59e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 40.91 E-value: 7.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 726 DSDSSAQQQMLQAQPPALQAPSgAASAPSTAPPGTPQLPTQGPTPSatavPPQGSPATSQPPNQTQSTVAPAAHTHIQQA 805
Cdd:PRK14971 366 GDDASGGRGPKQHIKPVFTQPA-AAPQPSAAAAASPSPSQSSAAAQ----PSAPQSATQPAGTPPTVSVDPPAAVPVNPP 440
|
90 100
....*....|....*....|....*.
gi 1720409699 806 PTLHPPRLPSPHPPLQPMtaPPSQSS 831
Cdd:PRK14971 441 STAPQAVRPAQFKEEKKI--PVSKVS 464
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
729-832 |
7.88e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 40.39 E-value: 7.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 729 SSAQ--QQMLQAQPPALQAPSGAASAPSTAPPGTPQlPTQGPTPSATAVP-PQGSPATSQPPNQTQSTVAPAAHThiQQA 805
Cdd:PRK10856 150 SSAElsQNSGQSVPLDTSTTTDPATTPAPAAPVDTT-PTNSQTPAVATAPaPAVDPQQNAVVAPSQANVDTAATP--APA 226
|
90 100
....*....|....*....|....*..
gi 1720409699 806 PTLHPPRLPSPHPPLQPMTAPPSQSSA 832
Cdd:PRK10856 227 APATPDGAAPLPTDQAGVSTPAADPNA 253
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
746-891 |
8.13e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 40.62 E-value: 8.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 746 PSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSphpplqpmta 825
Cdd:PRK07994 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQ---------- 430
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720409699 826 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHST 891
Cdd:PRK07994 431 RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVAT 496
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
718-898 |
8.41e-03 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 40.76 E-value: 8.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 718 PSPQDNESDSDSSAQQQML-QAQPPALQAPSGAAS-------APSTAPPGTPQLPTQGPTPSATAVPPQGSPA------- 782
Cdd:pfam09606 229 MNPQQMGGAPNQVAMQQQQpQQQGQQSQLGMGINQmqqmpqgVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGdqnnyqq 308
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720409699 783 -TSQPPNQTQSTVAPAAHTH-----IQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSL--QTG 854
Cdd:pfam09606 309 qQTRQQQQQQGGNHPAAHQQqmnqsVGQGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQvrQVT 388
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1720409699 855 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQ 898
Cdd:pfam09606 389 PNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQ 432
|
|
|