|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
87-334 |
1.55e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 116.67 E-value: 1.55e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 87 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 164
Cdd:cd00200 24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 165 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 244
Cdd:cd00200 95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 245 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 324
Cdd:cd00200 163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
|
250
....*....|
gi 1046842737 325 DGWISLYSVM 334
Cdd:cd00200 240 DGTIRVWDLR 249
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
10-333 |
9.37e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 9.37e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 10 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 89
Cdd:COG2319 17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 90 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 169
Cdd:COG2319 95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 170 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 249
Cdd:COG2319 164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 250 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 329
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313
|
....
gi 1046842737 330 LYSV 333
Cdd:COG2319 314 LWDL 317
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
798-1076 |
7.38e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.80 E-value: 7.38e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 798 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 877
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 878 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 956
Cdd:PHA03247 2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 957 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1036
Cdd:PHA03247 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1046842737 1037 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247 2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
114-312 |
7.33e-06 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 49.89 E-value: 7.33e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 114 IAQKQKHTGAVRALDFNPFQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSqnppEDIKALSWNRQvQHILSSAHPSGKA 193
Cdd:PTZ00421 118 IVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS----DQITSLEWNLD-GSLLCTTSKDKKL 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 194 VVWDLRKNEPIIKVSDHSSRMNCSGLaWNPDIATQLVLCSEDDRLPVIQLWDLRFASSPLKVLESHSRGILSVSWSQADA 273
Cdd:PTZ00421 193 NIIDPRDGTIVSSVEAHASAKSQRCL-WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDT 271
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1046842737 274 ELL-LSSAKDNQI-----------FCWNLSSSEVVYKLPTQSSWCFDVQWC 312
Cdd:PTZ00421 272 NLLyIGSKGEGNIrcfelmnerltFCSSYSSVEPHKGLCMMPKWSLDTRKC 322
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
560-652 |
1.36e-05 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 48.41 E-value: 1.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 560 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 629
Cdd:cd09233 54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
|
90 100 110
....*....|....*....|....*....|..
gi 1046842737 630 KNWKDLVCACS---------LKNWREALALLL 652
Cdd:cd09233 133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
120-150 |
2.48e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.30 E-value: 2.48e-05
10 20 30
....*....|....*....|....*....|.
gi 1046842737 120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:smart00320 11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
120-150 |
1.66e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 1.66e-04
10 20 30
....*....|....*....|....*....|.
gi 1046842737 120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:pfam00400 10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
810-1073 |
8.55e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 8.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 810 PKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVmPFLPSHPIPSVGSWTQSSSDYRVP----KPQATLPVHFVPGVRPAF- 884
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 885 --SQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGPTPLSSQPAASPVTFS 961
Cdd:pfam03154 251 pmTQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQVPPGPSPAAPGQSQQ 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 962 VAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitapLMSLGPEPQQALLPQ 1037
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF----QMNSNLPPPPALKPL 399
|
250 260 270
....*....|....*....|....*....|....*.
gi 1046842737 1038 SLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1073
Cdd:pfam03154 400 SSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
87-334 |
1.55e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 116.67 E-value: 1.55e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 87 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 164
Cdd:cd00200 24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 165 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 244
Cdd:cd00200 95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 245 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 324
Cdd:cd00200 163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
|
250
....*....|
gi 1046842737 325 DGWISLYSVM 334
Cdd:cd00200 240 DGTIRVWDLR 249
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
13-332 |
7.31e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.74 E-value: 7.31e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 13 AWSPAKQYpvyLATGtsaqqldasfSTNATLEIFEVDFRDPSLDLKrkgilsvsSRFHKLIWGSSSSglleNTGVIAGGG 92
Cdd:cd00200 16 AFSPDGKL---LATG----------SGDGTIKVWDLETGELLRTLK--------GHTGPVRDVAASA----DGTYLASGS 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 93 DSGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQnppeDIK 172
Cdd:cd00200 71 SDKTIRLWDLE-----TGECVRTLTG-HTSYVSSVAFSP-DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD----WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 173 ALSWNrQVQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDrlpVIQLWDLRfASSP 252
Cdd:cd00200 140 SVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDLS-TGKC 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 253 LKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLYS 332
Cdd:cd00200 212 LGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG-SADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
10-333 |
9.37e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 9.37e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 10 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 89
Cdd:COG2319 17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 90 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 169
Cdd:COG2319 95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 170 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 249
Cdd:COG2319 164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 250 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 329
Cdd:COG2319 236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313
|
....
gi 1046842737 330 LYSV 333
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
88-333 |
4.56e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 112.31 E-value: 4.56e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 88 IAGGGDSGMLTLYNVTHilspGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQNp 167
Cdd:COG2319 177 LASGSDDGTVRLWDLAT----GK--LLRTLTGHTGAVRSVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 168 pedIKALSWNRQVQHILSsAHPSGKAVVWDLRKNEPIIKVSDHSSRMNcsGLAWNPDiATQLVLCSEDDRlpvIQLWDLR 247
Cdd:COG2319 249 ---VRSVAFSPDGRLLAS-GSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 248 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 327
Cdd:COG2319 319 -TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASG-SADGT 395
|
....*.
gi 1046842737 328 ISLYSV 333
Cdd:COG2319 396 VRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
88-333 |
9.89e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.07 E-value: 9.89e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 88 IAGGGDSGMLTLYNvthiLSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNP 167
Cdd:COG2319 135 LASGSADGTVRLWD----LATGK--LLRTLTGHSGAVTSVAFSP-DGKLLASGSDDGTVRLWDLATGKLLRTL----TGH 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 168 PEDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 247
Cdd:COG2319 204 TGAVRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPD-GRLLASGSADGT---VRLWDLA 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 248 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 327
Cdd:COG2319 277 -TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG-SDDGT 353
|
....*.
gi 1046842737 328 ISLYSV 333
Cdd:COG2319 354 VRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
88-292 |
2.13e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 91.90 E-value: 2.13e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 88 IAGGGDSGMLTLYNVThilspgKEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSksqnP 167
Cdd:COG2319 219 LASGSADGTVRLWDLA------TGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG----H 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 168 PEDIKALSWNRQVQHILSSAHpSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 247
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGSD-DGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKTLASGSDDGT---VRLWDLA 360
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 1046842737 248 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSS 292
Cdd:COG2319 361 -TGELLRTLTGHTGAVTSVAFS-PDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
210-333 |
2.94e-10 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 62.74 E-value: 2.94e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 210 HSSRMNCsgLAWNPD---IATqlvlCSEDDRlpvIQLWDLRFaSSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIF 286
Cdd:cd00200 8 HTGGVTC--VAFSPDgklLAT----GSGDGT---IKVWDLET-GELLRTLKGHTGPVRDVAAS-ADGTYLASGSSDKTIR 76
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1046842737 287 CWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSFDGWISLYSV 333
Cdd:cd00200 77 LWDLETGECVRTLTGHTSYVSSVAFSP-DGRILSSSSRDKTIKVWDV 122
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
798-1076 |
7.38e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.80 E-value: 7.38e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 798 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 877
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 878 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 956
Cdd:PHA03247 2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 957 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1036
Cdd:PHA03247 2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1046842737 1037 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247 2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
791-1076 |
4.42e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 4.42e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 791 GRQAPAFPFPRVAVGAAlhPKETSSHRMGFQPPR--QVPAPSVRPRAAAQPsvmpflpshpiPSVGSWTQSSSDYRVPKP 868
Cdd:PHA03247 2641 HPPPTVPPPERPRDDPA--PGRVSRPRRARRLGRaaQASSPPQRPRRRAAR-----------PTVGSLTSLADPPPPPPT 2707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 869 QATLPVHFVPGVrpafsqPQPFGGQSVQAINPVGFCGTWPLP---------GPTPVMAPPDVMQPgsthlpetprllplp 939
Cdd:PHA03247 2708 PEPAPHALVSAT------PLPPGPAAARQASPALPAAPAPPAvpagpatpgGPARPARPPTTAGP--------------- 2766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 940 pvgppgptpLSSQPAASPVTfsvAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPP--AP 1017
Cdd:PHA03247 2767 ---------PAPAPPAAPAA---GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPptSA 2834
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1046842737 1018 IITAPLMSLGPEPQQALLPQSLVSGASL---PPPGA----------PRECSLQQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247 2835 QPTAPPPPPGPPPPSLPLGGSVAPGGDVrrrPPSRSpaakpaaparPPVRRLARPAVSRSTESFALPPDQPE 2906
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
737-1062 |
2.03e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.63 E-value: 2.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 737 PGPATTHRFTQYASLLAAQGSLA-IAMSVLPSDCTQPAVLQLKDRLFHAQGSTVLGRQAPAFPF----PRVAVGAALHPK 811
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTVGSLTsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAapapPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 812 ETSSHR--MGFQPPRQVPA---PSVRPRAAAQPSVMPFLPSHP-IPSvgSWTQSSSDYRVPKPQATLPvhfvPGVRPAFS 885
Cdd:PHA03247 2754 PARPARppTTAGPPAPAPPaapAAGPPRRLTRPAVASLSESREsLPS--PWDPADPPAAVLAPAAALP----PAASPAGP 2827
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 886 QPQPFGGQSVQAINPVGfcgtwPLPGPTPV---MAP-PDVMQPGSTHLPETPRLLPLPPVGPPGptplsSQPAASPVTFS 961
Cdd:PHA03247 2828 LPPPTSAQPTAPPPPPG-----PPPPSLPLggsVAPgGDVRRRPPSRSPAAKPAAPARPPVRRL-----ARPAVSRSTES 2897
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 962 VAHPPGGPGAPRSSALPssgilaTRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQ---ALLPQS 1038
Cdd:PHA03247 2898 FALPPDQPERPPQPQAP------PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgALVPGR 2971
|
330 340
....*....|....*....|....*
gi 1046842737 1039 L-VSGASLPPPGAPRECSLQQLQPL 1062
Cdd:PHA03247 2972 VaVPRFRVPQPAPSREAPASSTPPL 2996
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
252-335 |
3.31e-06 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 50.41 E-value: 3.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 252 PLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLY 331
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASG-SSDKTIRLW 78
|
....
gi 1046842737 332 SVMG 335
Cdd:cd00200 79 DLET 82
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
790-1076 |
5.18e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 5.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 790 LGRQAPAFPFPRvavgAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPsVMPFLPSHPIP---SVGSWTQSSSDY-RV 865
Cdd:PHA03247 2791 LSESRESLPSPW----DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-TAPPPPPGPPPpslPLGGSVAPGGDVrRR 2865
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 866 PKPQATLPVHFVPGVRPAFSQPQPFGGQSVQ--AINPVGfcgtwPLPGPTPVMAPPDVMQPgsthlpeTPRLLPLPPVGP 943
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfALPPDQ-----PERPPQPQAPPPPQPQP-------QPPPPPQPQPPP 2933
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 944 PGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATR---PGPQDTwKVAPASQENLQRKKlpetfmpPAPIIT 1020
Cdd:PHA03247 2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvPQPAPS-REAPASSTPPLTGH-------SLSRVS 3005
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1046842737 1021 APLMSLG----PEPQQALLPQSLV-------SGASLPPPGAPRECSLQQLQPLPPEKTqkeLPPEHQ 1076
Cdd:PHA03247 3006 SWASSLAlheeTDPPPVSLKQTLWppddtedSDADSLFDSDSERSDLEALDPLPPEPH---DPFAHE 3069
|
|
| PTZ00421 |
PTZ00421 |
coronin; Provisional |
114-312 |
7.33e-06 |
|
coronin; Provisional
Pssm-ID: 173611 [Multi-domain] Cd Length: 493 Bit Score: 49.89 E-value: 7.33e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 114 IAQKQKHTGAVRALDFNPFQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSqnppEDIKALSWNRQvQHILSSAHPSGKA 193
Cdd:PTZ00421 118 IVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS----DQITSLEWNLD-GSLLCTTSKDKKL 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 194 VVWDLRKNEPIIKVSDHSSRMNCSGLaWNPDIATQLVLCSEDDRLPVIQLWDLRFASSPLKVLESHSRGILSVSWSQADA 273
Cdd:PTZ00421 193 NIIDPRDGTIVSSVEAHASAKSQRCL-WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDT 271
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1046842737 274 ELL-LSSAKDNQI-----------FCWNLSSSEVVYKLPTQSSWCFDVQWC 312
Cdd:PTZ00421 272 NLLyIGSKGEGNIrcfelmnerltFCSSYSSVEPHKGLCMMPKWSLDTRKC 322
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
52-202 |
7.73e-06 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 49.95 E-value: 7.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 52 DPSLDLKrkGILSVSSRFHKLIWGSSSSGLLENTGVIAGGGDSGMLTLYNVTHilspgKEPLIAQKqKHTGAVRALDFNP 131
Cdd:PTZ00420 13 DPSNNLF--DDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMR-----KPPVIKLK-GHTSSILDLQFNP 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1046842737 132 FQGNLLASGASDSEIFIWDLNH----LTVPMTPGSKSQNPPEDIKALSWNRQVQHILSSAHPSGKAVVWDLrKNE 202
Cdd:PTZ00420 85 CFSEILASGSEDLTIRVWEIPHndesVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDI-ENE 158
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
87-199 |
1.18e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 49.14 E-value: 1.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 87 VIAGGGDSGMLTLYNVthilSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQN 166
Cdd:COG2319 302 LLASGSDDGTVRLWDL----ATGK--LLRTLTGHTGAVRSVAFSP-DGKTLASGSDDGTVRLWDLATGELLRTL----TG 370
|
90 100 110
....*....|....*....|....*....|...
gi 1046842737 167 PPEDIKALSWNRQVQHILSSAHpSGKAVVWDLR 199
Cdd:COG2319 371 HTGAVTSVAFSPDGRTLASGSA-DGTVRLWDLA 402
|
|
| ACE1-Sec16-like |
cd09233 |
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ... |
560-652 |
1.36e-05 |
|
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.
Pssm-ID: 187750 [Multi-domain] Cd Length: 314 Bit Score: 48.41 E-value: 1.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 560 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 629
Cdd:cd09233 54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
|
90 100 110
....*....|....*....|....*....|..
gi 1046842737 630 KNWKDLVCACS---------LKNWREALALLL 652
Cdd:cd09233 133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
203-306 |
2.24e-05 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 48.93 E-value: 2.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 203 PIIKVSdhsSRMNCSGLAWNPDIATQLVLCSEDDrlpVIQLWDLrfASSPLKV-LESHSRGILSVSWSQADAELLLSSAK 281
Cdd:PLN00181 525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
|
90 100
....*....|....*....|....*
gi 1046842737 282 DNQIFCWNLSSSEVVYKLPTQSSWC 306
Cdd:PLN00181 597 DGSVKLWSINQGVSIGTIKTKANIC 621
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
120-150 |
2.48e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.30 E-value: 2.48e-05
10 20 30
....*....|....*....|....*....|.
gi 1046842737 120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:smart00320 11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
791-1081 |
5.10e-05 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 47.74 E-value: 5.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 791 GRQAPAFPFPRVAVGAALHPKETSSHR---MGFQPPRQVPAPSV---------RPRAAAQPSVMPFLPSHPIpsvgswtQ 858
Cdd:PHA03377 644 GPKPKSFWEMRAGRDGSGIQQEPSSRRqpaTQSTPPRPSWLPSVfvlpsvdagRAQPSEESHLSSMSPTQPI-------S 716
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 859 SSSDYRVPKPQATLPVHFVPGVRPAFSQPQPFGG----QSVQAINPvgfcGTW-PLPGPTPVMAppdVMQPGSTHLPETP 933
Cdd:PHA03377 717 HEEQPRYEDPDDPLDLSLHPDQAPPPSHQAPYSGheepQAQQAPYP----GYWePRPPQAPYLG---YQEPQAQGVQVSS 789
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 934 RLLPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGpQDTWKVAPASQenlqrkklPETfM 1013
Cdd:PHA03377 790 YPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHG-QDQVSQFPHLQ--------SET-G 859
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1046842737 1014 PPAPIITaplmslgpEPQQALLPQSLVSGASL----PPPGAPrecslqqLQPLPpektqKELPPEHQCLKDS 1081
Cdd:PHA03377 860 PPRLQLS--------QVPQLPYSQTLVSSSAPswssPQPRAP-------IRPIP-----TRFPPPPMPLQDS 911
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
821-1022 |
1.28e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 46.38 E-value: 1.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 821 QPPRQVPAPsvRPRAAAQPSVMPFLPSHPIPSVGSwtqsssdyRVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQAINP 900
Cdd:PRK07003 375 RVAGAVPAP--GARAAAAVGASAVPAVTAVTGAAG--------AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAAD 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 901 vgfcGTWPLPGPTPVMAPPDvmqpgsthlpetprllplppvgpPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSS 980
Cdd:PRK07003 445 ----GDAPVPAKANARASAD-----------------------SRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAA 497
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1046842737 981 GILATRPGPQDTWKVAPASQENlqrkKLPETFMPPAPIITAP 1022
Cdd:PRK07003 498 APSAATPAAVPDARAPAAASRE----DAPAAAAPPAPEARPP 535
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
120-150 |
1.66e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.02 E-value: 1.66e-04
10 20 30
....*....|....*....|....*....|.
gi 1046842737 120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:pfam00400 10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
250-289 |
6.13e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.48 E-value: 6.13e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1046842737 250 SSPLKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWN 289
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
810-1073 |
8.55e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 8.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 810 PKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVmPFLPSHPIPSVGSWTQSSSDYRVP----KPQATLPVHFVPGVRPAF- 884
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLq 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 885 --SQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGPTPLSSQPAASPVTFS 961
Cdd:pfam03154 251 pmTQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQVPPGPSPAAPGQSQQ 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 962 VAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitapLMSLGPEPQQALLPQ 1037
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF----QMNSNLPPPPALKPL 399
|
250 260 270
....*....|....*....|....*....|....*.
gi 1046842737 1038 SLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1073
Cdd:pfam03154 400 SSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
|
|
| PTZ00420 |
PTZ00420 |
coronin; Provisional |
200-311 |
1.36e-03 |
|
coronin; Provisional
Pssm-ID: 240412 [Multi-domain] Cd Length: 568 Bit Score: 42.63 E-value: 1.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 200 KNEPIIKVSDHSSRMncSGLAWNPDIATQLVLCSEDdrlPVIQLWDLRF-------ASSPLKVLESHSRGILSVSWSQAD 272
Cdd:PTZ00420 63 RKPPVIKLKGHTSSI--LDLQFNPCFSEILASGSED---LTIRVWEIPHndesvkeIKDPQCILKGHKKKISIIDWNPMN 137
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1046842737 273 AELLLSSAKDNQIFCWNLSSSEVVYK--LPTQSSwcfDVQW 311
Cdd:PTZ00420 138 YYIMCSSGFDSFVNIWDIENEKRAFQinMPKKLS---SLKW 175
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
791-1064 |
1.66e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.00 E-value: 1.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 791 GRQAPAFpFPRVAVGAALHPK---------ETSSHRMGFQPPrqvPAPSVRPRAAAQPSVMPflpSHPIPsvgswtqsss 861
Cdd:PHA03247 2513 SRLAPAI-LPDEPVGEPVHPRmltwirgleELASDDAGDPPP---PLPPAAPPAAPDRSVPP---PRPAP---------- 2575
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 862 dyRVPKPQATLPVHfVPGVRPAFSQPQpfggqsvqainpvgfcgtwplpgpTPVmAPPDvmqpgsthlpetprllplppv 941
Cdd:PHA03247 2576 --RPSEPAVTSRAR-RPDAPPQSARPR------------------------APV-DDRG--------------------- 2606
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 942 gppgPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKL----------PET 1011
Cdd:PHA03247 2607 ----DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRlgraaqasspPQR 2682
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 1012 FMPPA-PIITAPLMSLG----PEPQQALLPQSLVSGASLPP-PGAPRECS-LQQLQPLPP 1064
Cdd:PHA03247 2683 PRRRAaRPTVGSLTSLAdpppPPPTPEPAPHALVSATPLPPgPAAARQASpALPAAPAPP 2742
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
819-1047 |
1.78e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.56 E-value: 1.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 819 GFQPPRQVPAPSVRPRAAAQ-PSVMPFLPSHPIPSVGSWTQSSSdyrVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQA 897
Cdd:PRK12323 371 GAGPATAAAAPVAQPAPAAAaPAAAAPAPAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAARQASARGPGGA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 898 INPVgfcgtwPLPGPTPVMAPPDVMQPgsthlpetprllPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPrssAL 977
Cdd:PRK12323 448 PAPA------PAPAAAPAAAARPAAAG------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPP---EF 506
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 978 PSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQALLPQSLvSGASLPP 1047
Cdd:PRK12323 507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA-SASGLPD 575
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
809-1051 |
2.55e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 41.98 E-value: 2.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 809 HPKETSSHRMGFQPPRQVPAPSVRPRAAA-QPSVMPfLPSHP-----IPSVGSWTQSSSDYRVPKP---QATLPVHFVPG 879
Cdd:PHA03378 614 HIPETSAPRQWPMPLRPIPMRPLRMQPITfNVLVFP-TPHQPpqveiTPYKPTWTQIGHIPYQPSPtgaNTMLPIQWAPG 692
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 880 -----------VRPAFSQP---QPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPETPRLLPLPPVGPPG 945
Cdd:PHA03378 693 tmqpppraptpMRPPAAPPgraQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 946 PTPLSSQPAASPVtfSVAHPPGGPG-APRSSALPSSGILATR--PGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAP 1022
Cdd:PHA03378 773 APTPQPPPQAPPA--PQQRPRGAPTpQPPPQAGPTSMQLMPRaaPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQA 850
|
250 260
....*....|....*....|....*....
gi 1046842737 1023 LMSLGPEPQQALLPQSLVSGASLPPPGAP 1051
Cdd:PHA03378 851 AAGPTPSPGSGTSDKIVQAPVFYPPVLQP 879
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
793-1030 |
6.99e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.52 E-value: 6.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 793 QAPAFPFPRVAVGAA--LHPKETSSHRMGFQPPRQVPAPsvrPRAAAQPSVMPfLPSHPIPSVG--------SWTQSSSD 862
Cdd:pfam03154 308 QVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLP---PAPLSMPHIKP-PPTTPIPQLPnpqshkhpPHLSGPSP 383
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 863 YRVP---------KPQATLPVHFVPGVRPAFSQPQPfGGQSVQA--INPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPE 931
Cdd:pfam03154 384 FQMNsnlppppalKPLSSLSTHHPPSAHPPPLQLMP-QSQQLPPppAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 932 TPRLLPLPPVGPpgPTPLSSQPAASPVTFSVAHPPggpgaprSSALPSSGIlatrPGPQDTWKVAPASQenLQRKKLPET 1011
Cdd:pfam03154 463 PQHPFVPGGPPP--ITPPSGPPTSTSSAMPGIQPP-------SSASVSSSG----PVPAAVSCPLPPVQ--IKEEALDEA 527
|
250
....*....|....*....
gi 1046842737 1012 FMPPAPiiTAPLMSLGPEP 1030
Cdd:pfam03154 528 EEPESP--PPPPRSPSPEP 544
|
|
|