|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
84-340 |
2.41e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 2.41e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 84 DVSGVLIAGGENGNIILYDPSkiiagDKEVVIAQKDkHTGPVRALDVnIFQTNLVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 19 PDGKLLATGSGDGTIKVWDLE-----TGELLRTLKG-HTGPVRDVAA-SADGTYLASGSSDKTIRLWDLETGECVRTLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 164 KTQppeDISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDvaTQMVLASEDDRLpvVQM 243
Cdd:cd00200 92 HTS---YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKL 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAAS 323
Cdd:cd00200 162 WDLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGS 238
|
250
....*....|....*..
gi 564382896 324 FDGRIRVYSIMGGSIDG 340
Cdd:cd00200 239 EDGTIRVWDLRTGECVQ 255
|
|
| ACE1-Sec16-like super family |
cl14807 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
534-728 |
5.56e-09 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site. The actual alignment was detected with superfamily member cd09233:
Pssm-ID: 449359 [Multi-domain] Cd Length: 314 Bit Score: 59.19 E-value: 5.56e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 534 ITRALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLAQTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 605
Cdd:cd09233 69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 606 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgarleseGDSLLRTQ----ACLCYICAGnverlvacwtkAQDGSNP 675
Cdd:cd09233 147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG-----------VPLGPYP 207
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 676 LS-------LQDLIEKVVILR--KAVQLT------QALDTNTVG--ALLAEKMsQYANLLAAQGSIAAAL 728
Cdd:cd09233 208 SSpsscllgGAVHNKSPRTFAtpEAIQLTeiyeyaLSLGNPQFGlpHLQPYKL-IHAARLAELGLVSEAL 276
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
712-1049 |
1.04e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 1.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 712 SQYANLLAAQGSIAAALAFLPDNTNQPDIVQLRDRlCRAQGRsvPGQESSrssyegqplPKGGPGPLAGHPQVSRVQSqq 791
Cdd:PHA03247 2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGR--AAQASS---------PPQRPRRRAARPTVGSLTS-- 2697
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 792 yYPQGGNPPPPGFIMHGNVVPNSPAPLPTSPGHMHSQPPPYPQPQPYQPAQQYSFGtGGSAVYRPQQPV-----APPASN 866
Cdd:PHA03247 2698 -LADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG-GPARPARPPTTAgppapAPPAAP 2775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 867 AYPNAPYVSPVASYSGQPQMYTAQPASSPTSSSAPLPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTPPAASELPAS- 945
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGs 2855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 946 -------QRTGPQNGWNDPPALNRVPKKKKLPENFMPPVPITSPIMNPGgdpqPQGLQQQPSASGPRSSHASFPQPHLAG 1018
Cdd:PHA03247 2856 vapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQ----PERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
330 340 350
....*....|....*....|....*....|.
gi 564382896 1019 GQPfhgiQQPLAQTGMPPSfskPNTEGAPGA 1049
Cdd:PHA03247 2932 PPP----PPPRPQPPLAPT---TDPAGAGEP 2955
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
84-340 |
2.41e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 2.41e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 84 DVSGVLIAGGENGNIILYDPSkiiagDKEVVIAQKDkHTGPVRALDVnIFQTNLVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 19 PDGKLLATGSGDGTIKVWDLE-----TGELLRTLKG-HTGPVRDVAA-SADGTYLASGSSDKTIRLWDLETGECVRTLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 164 KTQppeDISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDvaTQMVLASEDDRLpvVQM 243
Cdd:cd00200 92 HTS---YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKL 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAAS 323
Cdd:cd00200 162 WDLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGS 238
|
250
....*....|....*..
gi 564382896 324 FDGRIRVYSIMGGSIDG 340
Cdd:cd00200 239 EDGTIRVWDLRTGECVQ 255
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-333 |
2.80e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.92 E-value: 2.80e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 89 LIAGGENGNIILYDpskiIAGDKEvvIAQKDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPp 168
Cdd:COG2319 177 LASGSDDGTVRLWD----LATGKL--LRTLTGHTGAVRSVAFS-PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 169 edISCIAWNRQVQHiLASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPDvATQMVLASEDDRlpvVQMWDLRf 248
Cdd:COG2319 249 --VRSVAFSPDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA- 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 249 ASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRI 328
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTV 396
|
....*
gi 564382896 329 RVYSI 333
Cdd:COG2319 397 RLWDL 401
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
534-728 |
5.56e-09 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 59.19 E-value: 5.56e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 534 ITRALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLAQTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 605
Cdd:cd09233 69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 606 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgarleseGDSLLRTQ----ACLCYICAGnverlvacwtkAQDGSNP 675
Cdd:cd09233 147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG-----------VPLGPYP 207
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 676 LS-------LQDLIEKVVILR--KAVQLT------QALDTNTVG--ALLAEKMsQYANLLAAQGSIAAAL 728
Cdd:cd09233 208 SSpsscllgGAVHNKSPRTFAtpEAIQLTeiyeyaLSLGNPQFGlpHLQPYKL-IHAARLAELGLVSEAL 276
|
|
| Sec16_C |
pfam12931 |
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ... |
534-728 |
8.36e-08 |
|
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.
Pssm-ID: 432884 Cd Length: 279 Bit Score: 54.87 E-value: 8.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 534 ITRALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLAQTQKKY----FAKSQSKITRLItAVVMK----NWKEIVE- 603
Cdd:pfam12931 1 IRALLLTGDREKALWLAL-DKKLwAHALLIASTLGKEKWKEVVQEFvrseFKGSNNKSGESL-AALYQvfagNSEEAVDe 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 604 --------SCDLKNWREALAAVLTYAKPDEFSALCDlLGARLESEGdslLRTQACLCYICAgNVERLVACWTKAQDGSNP 675
Cdd:pfam12931 79 lvppsknaLWALDNWRETLALVLSNRSPGDVEALLA-LGDLLAQYG---RTEAAHICFLLA-GLPLSQTVLLGADHVRFP 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 564382896 676 LSLQDLIEkvvilrkAVQLTQ----ALDTNTVGA-------LLAEKMsQYANLLAAQGSIAAAL 728
Cdd:pfam12931 154 STFGNDLE-------SILLTEiyeyALSLSPPQPpfvglphLLPYKL-QHAAVLAEYGLVSEAQ 209
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
203-333 |
4.31e-04 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 44.69 E-value: 4.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 203 PIIKVSdhsNRMHCSGLAWHPDVATQMVLASEDDrlpVVQMWDLrfASSPLRV-LENHARGILAIAWSMADPELLLSCGK 281
Cdd:PLN00181 525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 564382896 282 DAKILCSNPNTGEVLYELPTNTQWCFdIQWCPRNPAVLSAASFDGRIRVYSI 333
Cdd:PLN00181 597 DGSVKLWSINQGVSIGTIKTKANICC-VQFPSESGRSLAFGSADHKVYYYDL 647
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
712-1049 |
1.04e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 1.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 712 SQYANLLAAQGSIAAALAFLPDNTNQPDIVQLRDRlCRAQGRsvPGQESSrssyegqplPKGGPGPLAGHPQVSRVQSqq 791
Cdd:PHA03247 2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGR--AAQASS---------PPQRPRRRAARPTVGSLTS-- 2697
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 792 yYPQGGNPPPPGFIMHGNVVPNSPAPLPTSPGHMHSQPPPYPQPQPYQPAQQYSFGtGGSAVYRPQQPV-----APPASN 866
Cdd:PHA03247 2698 -LADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG-GPARPARPPTTAgppapAPPAAP 2775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 867 AYPNAPYVSPVASYSGQPQMYTAQPASSPTSSSAPLPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTPPAASELPAS- 945
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGs 2855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 946 -------QRTGPQNGWNDPPALNRVPKKKKLPENFMPPVPITSPIMNPGgdpqPQGLQQQPSASGPRSSHASFPQPHLAG 1018
Cdd:PHA03247 2856 vapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQ----PERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
330 340 350
....*....|....*....|....*....|.
gi 564382896 1019 GQPfhgiQQPLAQTGMPPSfskPNTEGAPGA 1049
Cdd:PHA03247 2932 PPP----PPPRPQPPLAPT---TDPAGAGEP 2955
|
|
| DUF4813 |
pfam16072 |
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ... |
848-944 |
5.62e-03 |
|
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.
Pssm-ID: 435117 [Multi-domain] Cd Length: 288 Bit Score: 40.13 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 848 TGGSAVYRPQQPVAPPASNAYPNAPYVSPVASYSGQPQMYTAQPASSPTSSSAPLPPPPssgasfqhGGPGAPPSSSAYA 927
Cdd:pfam16072 163 AGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPGAPQTPLAPLNPVAAAPAAAAGAAA--------APVVAAAAPAAAA 234
|
90
....*....|....*..
gi 564382896 928 LPPGTTGTPPAASELPA 944
Cdd:pfam16072 235 PPPPAPAAPPADAAPPA 251
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
292-332 |
7.23e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.37 E-value: 7.23e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 564382896 292 TGEVLYELPTNTQWCFDIQWCPRNPAVLSAaSFDGRIRVYS 332
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASG-SDDGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
84-340 |
2.41e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.98 E-value: 2.41e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 84 DVSGVLIAGGENGNIILYDPSkiiagDKEVVIAQKDkHTGPVRALDVnIFQTNLVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 19 PDGKLLATGSGDGTIKVWDLE-----TGELLRTLKG-HTGPVRDVAA-SADGTYLASGSSDKTIRLWDLETGECVRTLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 164 KTQppeDISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDvaTQMVLASEDDRLpvVQM 243
Cdd:cd00200 92 HTS---YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKL 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAAS 323
Cdd:cd00200 162 WDLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGS 238
|
250
....*....|....*..
gi 564382896 324 FDGRIRVYSIMGGSIDG 340
Cdd:cd00200 239 EDGTIRVWDLRTGECVQ 255
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
13-332 |
1.99e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 107.81 E-value: 1.99e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 13 AWSPAQNhpiYLATGtsaqqldatfSTNASLEIFELD-------LSDPSLDMKSCATFSSSHRyhkliwgphkmdskgdv 85
Cdd:cd00200 16 AFSPDGK---LLATG----------SGDGTIKVWDLEtgellrtLKGHTGPVRDVAASADGTY----------------- 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 86 sgvLIAGGENGNIILYDPSKiiagdKEVVIAQKDkHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGA 163
Cdd:cd00200 66 ---LASGSSDKTIRLWDLET-----GECVRTLTG-HTSYVSSVA---FSPDgrILSSSSRDKTIKVWDVETGKCLTTLRG 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 164 KTQPpedISCIAWNrQVQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDvATQMVLASEDDrlpVVQM 243
Cdd:cd00200 134 HTDW---VNSVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKL 203
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 244 WDLRfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaAS 323
Cdd:cd00200 204 WDLS-TGKCLGTLRGHENGVNSVAFS-PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GS 280
|
....*....
gi 564382896 324 FDGRIRVYS 332
Cdd:cd00200 281 ADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-333 |
2.80e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.92 E-value: 2.80e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 89 LIAGGENGNIILYDpskiIAGDKEvvIAQKDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTQPp 168
Cdd:COG2319 177 LASGSDDGTVRLWD----LATGKL--LRTLTGHTGAVRSVAFS-PDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 169 edISCIAWNRQVQHiLASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPDvATQMVLASEDDRlpvVQMWDLRf 248
Cdd:COG2319 249 --VRSVAFSPDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA- 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 249 ASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRI 328
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTV 396
|
....*
gi 564382896 329 RVYSI 333
Cdd:COG2319 397 RLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-333 |
1.16e-22 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 101.91 E-value: 1.16e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 89 LIAGGENGNIILYDpskiIAGDKEvvIAQKDKHTGPVRALDvniFQTN--LVASGANESEIYIWDLNNFATPMTPGAKTQ 166
Cdd:COG2319 135 LASGSADGTVRLWD----LATGKL--LRTLTGHSGAVTSVA---FSPDgkLLASGSDDGTVRLWDLATGKLLRTLTGHTG 205
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 167 PpedISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMHCsgLAWHPDvATQMVLASEDDRlpvVQMWDL 246
Cdd:COG2319 206 A---VRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPD-GRLLASGSADGT---VRLWDL 275
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 247 RfASSPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDG 326
Cdd:COG2319 276 A-TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLAS-GSDDG 352
|
....*..
gi 564382896 327 RIRVYSI 333
Cdd:COG2319 353 TVRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
121-338 |
2.04e-20 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 95.36 E-value: 2.04e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 121 HTGPVRALDVNiFQTNLVASGANESEIYIWDLnnfATPMTPGAKTQPPEDISCIAWNRQvQHILASASPSGRATVWDLRK 200
Cdd:COG2319 77 HTAAVLSVAFS-PDGRLLASASADGTVRLWDL---ATGLLLRTLTGHTGAVRSVAFSPD-GKTLASGSADGTVRLWDLAT 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 201 NEPIIKVSDHSNRMHCsgLAWHPDvATQMVLASEDDRlpvVQMWDLRfASSPLRVLENHARGILAIAWSmADPELLLSCG 280
Cdd:COG2319 152 GKLLRTLTGHSGAVTS--VAFSPD-GKLLASGSDDGT---VRLWDLA-TGKLLRTLTGHTGAVRSVAFS-PDGKLLASGS 223
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 564382896 281 KDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAASFDGRIRVYSIMGGSI 338
Cdd:COG2319 224 ADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGEL 280
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
171-337 |
3.31e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.54 E-value: 3.31e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 171 ISCIAWNRQvQHILASASPSGRATVWDLRKNEPIIKVSDHSNRMhcSGLAWHPDvATQMVLASEDDrlpVVQMWDLRfAS 250
Cdd:cd00200 12 VTCVAFSPD-GKLLATGSGDGTIKVWDLETGELLRTLKGHTGPV--RDVAASAD-GTYLASGSSDK---TIRLWDLE-TG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 251 SPLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPrNPAVLSAASFDGRIRV 330
Cdd:cd00200 84 ECVRTLTGHTSYVSSVAFS-PDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP-DGTFVASSSQDGTIKL 161
|
....*..
gi 564382896 331 YSIMGGS 337
Cdd:cd00200 162 WDLRTGK 168
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
534-728 |
5.56e-09 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 59.19 E-value: 5.56e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 534 ITRALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLAQTQKKyFAKSQSKIT---RLITAVVMKNWKEIVESC---- 605
Cdd:cd09233 69 FRNLLLTGNRKEALELAL-DNGLwAHALLLASSLGKETWAEVVSR-FARSESKLNdplQTLYQLFSGNSPEAITELadnp 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 606 -----DLKNWREALAAVLTYAKPD-EFSALCDLlgarleseGDSLLRTQ----ACLCYICAGnverlvacwtkAQDGSNP 675
Cdd:cd09233 147 aeaewALGNWREHLAIILSNRTSNlDLEALVEL--------GDLLAQRGlveaAHICYLLAG-----------VPLGPYP 207
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 676 LS-------LQDLIEKVVILR--KAVQLT------QALDTNTVG--ALLAEKMsQYANLLAAQGSIAAAL 728
Cdd:cd09233 208 SSpsscllgGAVHNKSPRTFAtpEAIQLTeiyeyaLSLGNPQFGlpHLQPYKL-IHAARLAELGLVSEAL 276
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
89-247 |
5.98e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 59.54 E-value: 5.98e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 89 LIAGGENGNIILYDpskiIAGDKEVVIaqKDKHTGPVRALDVNiFQTNLVASGANESEIYIWDLNNFATPMTPGAKTqpp 168
Cdd:COG2319 261 LASGSADGTVRLWD----LATGELLRT--LTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHT--- 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 169 EDISCIAWNRQVQhILASASPSGRATVWDLRKNEPIIKVSDHSNRMHcsGLAWHPD---VATqmvlASEDDRlpvVQMWD 245
Cdd:COG2319 331 GAVRSVAFSPDGK-TLASGSDDGTVRLWDLATGELLRTLTGHTGAVT--SVAFSPDgrtLAS----GSADGT---VRLWD 400
|
..
gi 564382896 246 LR 247
Cdd:COG2319 401 LA 402
|
|
| Sec16_C |
pfam12931 |
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal ... |
534-728 |
8.36e-08 |
|
Sec23-binding domain of Sec16; Sec16 is a multi-domain vesicle coat protein. The C-terminal region is the part that binds to Sec23, a COPII vesicle coat protein. This association is part of the transport vesicle coat structure.
Pssm-ID: 432884 Cd Length: 279 Bit Score: 54.87 E-value: 8.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 534 ITRALLTGNFESAVDLCLhDNRM-ADAIILAIAGGQELLAQTQKKY----FAKSQSKITRLItAVVMK----NWKEIVE- 603
Cdd:pfam12931 1 IRALLLTGDREKALWLAL-DKKLwAHALLIASTLGKEKWKEVVQEFvrseFKGSNNKSGESL-AALYQvfagNSEEAVDe 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 604 --------SCDLKNWREALAAVLTYAKPDEFSALCDlLGARLESEGdslLRTQACLCYICAgNVERLVACWTKAQDGSNP 675
Cdd:pfam12931 79 lvppsknaLWALDNWRETLALVLSNRSPGDVEALLA-LGDLLAQYG---RTEAAHICFLLA-GLPLSQTVLLGADHVRFP 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 564382896 676 LSLQDLIEkvvilrkAVQLTQ----ALDTNTVGA-------LLAEKMsQYANLLAAQGSIAAAL 728
Cdd:pfam12931 154 STFGNDLE-------SILLTEiyeyALSLSPPQPpfvglphLLPYKL-QHAAVLAEYGLVSEAQ 209
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
252-336 |
7.67e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 52.34 E-value: 7.67e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 252 PLRVLENHARGILAIAWSmADPELLLSCGKDAKILCSNPNTGEVLYELPTNTQWCFDIQWCPRNPAVLSaASFDGRIRVY 331
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLW 78
|
....*
gi 564382896 332 SIMGG 336
Cdd:cd00200 79 DLETG 83
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
203-333 |
4.31e-04 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 44.69 E-value: 4.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 203 PIIKVSdhsNRMHCSGLAWHPDVATQMVLASEDDrlpVVQMWDLrfASSPLRV-LENHARGILAIAWSMADPELLLSCGK 281
Cdd:PLN00181 525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 564382896 282 DAKILCSNPNTGEVLYELPTNTQWCFdIQWCPRNPAVLSAASFDGRIRVYSI 333
Cdd:PLN00181 597 DGSVKLWSINQGVSIGTIKTKANICC-VQFPSESGRSLAFGSADHKVYYYDL 647
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
712-1049 |
1.04e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 1.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 712 SQYANLLAAQGSIAAALAFLPDNTNQPDIVQLRDRlCRAQGRsvPGQESSrssyegqplPKGGPGPLAGHPQVSRVQSqq 791
Cdd:PHA03247 2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGR--AAQASS---------PPQRPRRRAARPTVGSLTS-- 2697
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 792 yYPQGGNPPPPGFIMHGNVVPNSPAPLPTSPGHMHSQPPPYPQPQPYQPAQQYSFGtGGSAVYRPQQPV-----APPASN 866
Cdd:PHA03247 2698 -LADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG-GPARPARPPTTAgppapAPPAAP 2775
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 867 AYPNAPYVSPVASYSGQPQMYTAQPASSPTSSSAPLPPPPSSGASFQHGGPGAPPSSSAYALPPGTTGTPPAASELPAS- 945
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGs 2855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 946 -------QRTGPQNGWNDPPALNRVPKKKKLPENFMPPVPITSPIMNPGgdpqPQGLQQQPSASGPRSSHASFPQPHLAG 1018
Cdd:PHA03247 2856 vapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQ----PERPPQPQAPPPPQPQPQPPPPPQPQP 2931
|
330 340 350
....*....|....*....|....*....|.
gi 564382896 1019 GQPfhgiQQPLAQTGMPPSfskPNTEGAPGA 1049
Cdd:PHA03247 2932 PPP----PPPRPQPPLAPT---TDPAGAGEP 2955
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
853-1043 |
4.11e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 4.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 853 VYRPQQPVAPPASNAYPNAPYVSPVASYSGQPQMYTAQPASSPTSSSAPLPPPPSSGASFQHGGPGAPPSSSAYALPPGT 932
Cdd:PRK12323 382 VAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAA 461
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 933 TGTPPAASELPASQRTGPQNGWNDP----PALNRVPKKKKLPENFMPPVPITSPIMNPGGDPQPQGLQQQPSASGPRSSH 1008
Cdd:PRK12323 462 ARPAAAGPRPVAAAAAAAPARAAPAaapaPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETL 541
|
170 180 190
....*....|....*....|....*....|....*
gi 564382896 1009 ASFPQPHLAggqPFHGIQQPLAQTGMPPSFSKPNT 1043
Cdd:PRK12323 542 APAPAAAPA---PRAAAATEPVVAPRPPRASASGL 573
|
|
| DUF4813 |
pfam16072 |
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ... |
848-944 |
5.62e-03 |
|
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.
Pssm-ID: 435117 [Multi-domain] Cd Length: 288 Bit Score: 40.13 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564382896 848 TGGSAVYRPQQPVAPPASNAYPNAPYVSPVASYSGQPQMYTAQPASSPTSSSAPLPPPPssgasfqhGGPGAPPSSSAYA 927
Cdd:pfam16072 163 AGGQQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPGAPQTPLAPLNPVAAAPAAAAGAAA--------APVVAAAAPAAAA 234
|
90
....*....|....*..
gi 564382896 928 LPPGTTGTPPAASELPA 944
Cdd:pfam16072 235 PPPPAPAAPPADAAPPA 251
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
292-332 |
7.23e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 35.37 E-value: 7.23e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 564382896 292 TGEVLYELPTNTQWCFDIQWCPRNPAVLSAaSFDGRIRVYS 332
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASG-SDDGTIKLWD 40
|
|
|