NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1046842737|ref|XP_017444769|]
View 

protein transport protein Sec31B isoform X4 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
87-334 1.55e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 116.67  E-value: 1.55e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   87 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 164
Cdd:cd00200     24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  165 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 244
Cdd:cd00200     95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  245 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 324
Cdd:cd00200    163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
                          250
                   ....*....|
gi 1046842737  325 DGWISLYSVM 334
Cdd:cd00200    240 DGTIRVWDLR 249
PHA03247 super family cl33720
large tegument protein UL36; Provisional
798-1076 7.38e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 7.38e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  798 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 877
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  878 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 956
Cdd:PHA03247  2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  957 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1036
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1046842737 1037 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247  2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
ACE1-Sec16-like super family cl14807
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
560-652 1.36e-05

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


The actual alignment was detected with superfamily member cd09233:

Pssm-ID: 449359 [Multi-domain]  Cd Length: 314  Bit Score: 48.41  E-value: 1.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  560 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 629
Cdd:cd09233     54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1046842737  630 KNWKDLVCACS---------LKNWREALALLL 652
Cdd:cd09233    133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
87-334 1.55e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 116.67  E-value: 1.55e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   87 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 164
Cdd:cd00200     24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  165 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 244
Cdd:cd00200     95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  245 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 324
Cdd:cd00200    163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
                          250
                   ....*....|
gi 1046842737  325 DGWISLYSVM 334
Cdd:cd00200    240 DGTIRVWDLR 249
WD40 COG2319
WD40 repeat [General function prediction only];
10-333 9.37e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.24  E-value: 9.37e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   10 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 89
Cdd:COG2319     17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   90 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 169
Cdd:COG2319     95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  170 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 249
Cdd:COG2319    164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  250 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 329
Cdd:COG2319    236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313

                   ....
gi 1046842737  330 LYSV 333
Cdd:COG2319    314 LWDL 317
PHA03247 PHA03247
large tegument protein UL36; Provisional
798-1076 7.38e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 7.38e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  798 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 877
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  878 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 956
Cdd:PHA03247  2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  957 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1036
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1046842737 1037 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247  2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
PTZ00421 PTZ00421
coronin; Provisional
114-312 7.33e-06

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 49.89  E-value: 7.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  114 IAQKQKHTGAVRALDFNPFQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSqnppEDIKALSWNRQvQHILSSAHPSGKA 193
Cdd:PTZ00421   118 IVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS----DQITSLEWNLD-GSLLCTTSKDKKL 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  194 VVWDLRKNEPIIKVSDHSSRMNCSGLaWNPDIATQLVLCSEDDRLPVIQLWDLRFASSPLKVLESHSRGILSVSWSQADA 273
Cdd:PTZ00421   193 NIIDPRDGTIVSSVEAHASAKSQRCL-WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDT 271
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1046842737  274 ELL-LSSAKDNQI-----------FCWNLSSSEVVYKLPTQSSWCFDVQWC 312
Cdd:PTZ00421   272 NLLyIGSKGEGNIrcfelmnerltFCSSYSSVEPHKGLCMMPKWSLDTRKC 322
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
560-652 1.36e-05

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 48.41  E-value: 1.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  560 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 629
Cdd:cd09233     54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1046842737  630 KNWKDLVCACS---------LKNWREALALLL 652
Cdd:cd09233    133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
120-150 2.48e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.30  E-value: 2.48e-05
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1046842737   120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:smart00320   11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
120-150 1.66e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.02  E-value: 1.66e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1046842737  120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:pfam00400   10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
810-1073 8.55e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 8.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  810 PKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVmPFLPSHPIPSVGSWTQSSSDYRVP----KPQATLPVHFVPGVRPAF- 884
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  885 --SQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGPTPLSSQPAASPVTFS 961
Cdd:pfam03154  251 pmTQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQVPPGPSPAAPGQSQQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  962 VAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitapLMSLGPEPQQALLPQ 1037
Cdd:pfam03154  324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF----QMNSNLPPPPALKPL 399
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1046842737 1038 SLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1073
Cdd:pfam03154  400 SSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
87-334 1.55e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 116.67  E-value: 1.55e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   87 VIAGGGDsGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPFqGNLLASGASDSEIFIWDLN--HLTVPMTpGSKS 164
Cdd:cd00200     24 LATGSGD-GTIKVWDLE-----TGELLRTLKG-HTGPVRDVAASAD-GTYLASGSSDKTIRLWDLEtgECVRTLT-GHTS 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  165 qnppeDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiaTQLVLCSEDDRLpvIQLW 244
Cdd:cd00200     95 -----YVSSVAFSPD-GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS--VAFSPD--GTFVASSSQDGT--IKLW 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  245 DLRfASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSF 324
Cdd:cd00200    163 DLR-TGKCVATLTGHTGEVNSVAFS-PDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSP-DGYLLASGSE 239
                          250
                   ....*....|
gi 1046842737  325 DGWISLYSVM 334
Cdd:cd00200    240 DGTIRVWDLR 249
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
13-332 7.31e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 114.74  E-value: 7.31e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   13 AWSPAKQYpvyLATGtsaqqldasfSTNATLEIFEVDFRDPSLDLKrkgilsvsSRFHKLIWGSSSSglleNTGVIAGGG 92
Cdd:cd00200     16 AFSPDGKL---LATG----------SGDGTIKVWDLETGELLRTLK--------GHTGPVRDVAASA----DGTYLASGS 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   93 DSGMLTLYNVThilspGKEPLIAQKQkHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQnppeDIK 172
Cdd:cd00200     71 SDKTIRLWDLE-----TGECVRTLTG-HTSYVSSVAFSP-DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD----WVN 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  173 ALSWNrQVQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDrlpVIQLWDLRfASSP 252
Cdd:cd00200    140 SVAFS-PDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS--VAFSPD-GEKLLSSSSDG---TIKLWDLS-TGKC 211
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  253 LKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLYS 332
Cdd:cd00200    212 LGTLRGHENGVNSVAFSP-DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG-SADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
10-333 9.37e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.24  E-value: 9.37e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   10 AVQAWSPAKQYPVYLATGTSAQQLDASFSTNATLEIFEVDFRDPSLDLKRKGILSVSSRFHklIWGSSSSGLLENTGVIA 89
Cdd:COG2319     17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH--TAAVLSVAFSPDGRLLA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   90 GGGDSGMLTLYNVTHilspgkEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNPPE 169
Cdd:COG2319     95 SASADGTVRLWDLAT------GLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTL----TGHSG 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  170 DIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLRfA 249
Cdd:COG2319    164 AVTSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKLLASGSADGT---VRLWDLA-T 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  250 SSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWIS 329
Cdd:COG2319    236 GKLLRTLTGHSGSVRSVAFS-PDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASG-SDDGTVR 313

                   ....
gi 1046842737  330 LYSV 333
Cdd:COG2319    314 LWDL 317
WD40 COG2319
WD40 repeat [General function prediction only];
88-333 4.56e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 112.31  E-value: 4.56e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   88 IAGGGDSGMLTLYNVTHilspGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSQNp 167
Cdd:COG2319    177 LASGSDDGTVRLWDLAT----GK--LLRTLTGHTGAVRSVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGS- 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  168 pedIKALSWNRQVQHILSsAHPSGKAVVWDLRKNEPIIKVSDHSSRMNcsGLAWNPDiATQLVLCSEDDRlpvIQLWDLR 247
Cdd:COG2319    249 ---VRSVAFSPDGRLLAS-GSADGTVRLWDLATGELLRTLTGHSGGVN--SVAFSPD-GKLLASGSDDGT---VRLWDLA 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  248 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 327
Cdd:COG2319    319 -TGKLLRTLTGHTGAVRSVAFS-PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASG-SADGT 395

                   ....*.
gi 1046842737  328 ISLYSV 333
Cdd:COG2319    396 VRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
88-333 9.89e-25

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 108.07  E-value: 9.89e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   88 IAGGGDSGMLTLYNvthiLSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQNP 167
Cdd:COG2319    135 LASGSADGTVRLWD----LATGK--LLRTLTGHSGAVTSVAFSP-DGKLLASGSDDGTVRLWDLATGKLLRTL----TGH 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  168 PEDIKALSWNRQvQHILSSAHPSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 247
Cdd:COG2319    204 TGAVRSVAFSPD-GKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRS--VAFSPD-GRLLASGSADGT---VRLWDLA 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  248 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGW 327
Cdd:COG2319    277 -TGELLRTLTGHSGGVNSVAFS-PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASG-SDDGT 353

                   ....*.
gi 1046842737  328 ISLYSV 333
Cdd:COG2319    354 VRLWDL 359
WD40 COG2319
WD40 repeat [General function prediction only];
88-292 2.13e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 91.90  E-value: 2.13e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   88 IAGGGDSGMLTLYNVThilspgKEPLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPGSksqnP 167
Cdd:COG2319    219 LASGSADGTVRLWDLA------TGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG----H 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  168 PEDIKALSWNRQVQHILSSAHpSGKAVVWDLRKNEPIIKVSDHSSRMNCsgLAWNPDiATQLVLCSEDDRlpvIQLWDLR 247
Cdd:COG2319    288 SGGVNSVAFSPDGKLLASGSD-DGTVRLWDLATGKLLRTLTGHTGAVRS--VAFSPD-GKTLASGSDDGT---VRLWDLA 360
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1046842737  248 fASSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSS 292
Cdd:COG2319    361 -TGELLRTLTGHTGAVTSVAFS-PDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
210-333 2.94e-10

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 62.74  E-value: 2.94e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  210 HSSRMNCsgLAWNPD---IATqlvlCSEDDRlpvIQLWDLRFaSSPLKVLESHSRGILSVSWSqADAELLLSSAKDNQIF 286
Cdd:cd00200      8 HTGGVTC--VAFSPDgklLAT----GSGDGT---IKVWDLET-GELLRTLKGHTGPVRDVAAS-ADGTYLASGSSDKTIR 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1046842737  287 CWNLSSSEVVYKLPTQSSWCFDVQWCPrNPPAFSAVSFDGWISLYSV 333
Cdd:cd00200     77 LWDLETGECVRTLTGHTSYVSSVAFSP-DGRILSSSSRDKTIKVWDV 122
PHA03247 PHA03247
large tegument protein UL36; Provisional
798-1076 7.38e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 7.38e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  798 PFPRVAVGAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVMPFLPSHPIPSvGSWTQSSSDYRVPKPQATLPVHFV 877
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA-GPPAPAPPAAPAAGPPRRLTRPAV 2788
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  878 PGVRPAF-SQPQPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSThlpetprllplppvgppgptPLSSQPAAS 956
Cdd:PHA03247  2789 ASLSESReSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP--------------------PPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  957 PVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPiitaplmslGPEPQQALLP 1036
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP---------QPQAPPPPQP 2919
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1046842737 1037 QSlvsgaslPPPGAPREcslqQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247  2920 QP-------QPPPPPQP----QPPPPPPPRPQPPLAPTTD 2948
PHA03247 PHA03247
large tegument protein UL36; Provisional
791-1076 4.42e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 4.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  791 GRQAPAFPFPRVAVGAAlhPKETSSHRMGFQPPR--QVPAPSVRPRAAAQPsvmpflpshpiPSVGSWTQSSSDYRVPKP 868
Cdd:PHA03247  2641 HPPPTVPPPERPRDDPA--PGRVSRPRRARRLGRaaQASSPPQRPRRRAAR-----------PTVGSLTSLADPPPPPPT 2707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  869 QATLPVHFVPGVrpafsqPQPFGGQSVQAINPVGFCGTWPLP---------GPTPVMAPPDVMQPgsthlpetprllplp 939
Cdd:PHA03247  2708 PEPAPHALVSAT------PLPPGPAAARQASPALPAAPAPPAvpagpatpgGPARPARPPTTAGP--------------- 2766
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  940 pvgppgptpLSSQPAASPVTfsvAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPP--AP 1017
Cdd:PHA03247  2767 ---------PAPAPPAAPAA---GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPptSA 2834
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1046842737 1018 IITAPLMSLGPEPQQALLPQSLVSGASL---PPPGA----------PRECSLQQLQPLPPEKTQKELPPEHQ 1076
Cdd:PHA03247  2835 QPTAPPPPPGPPPPSLPLGGSVAPGGDVrrrPPSRSpaakpaaparPPVRRLARPAVSRSTESFALPPDQPE 2906
PHA03247 PHA03247
large tegument protein UL36; Provisional
737-1062 2.03e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  737 PGPATTHRFTQYASLLAAQGSLA-IAMSVLPSDCTQPAVLQLKDRLFHAQGSTVLGRQAPAFPF----PRVAVGAALHPK 811
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTVGSLTsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAapapPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  812 ETSSHR--MGFQPPRQVPA---PSVRPRAAAQPSVMPFLPSHP-IPSvgSWTQSSSDYRVPKPQATLPvhfvPGVRPAFS 885
Cdd:PHA03247  2754 PARPARppTTAGPPAPAPPaapAAGPPRRLTRPAVASLSESREsLPS--PWDPADPPAAVLAPAAALP----PAASPAGP 2827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  886 QPQPFGGQSVQAINPVGfcgtwPLPGPTPV---MAP-PDVMQPGSTHLPETPRLLPLPPVGPPGptplsSQPAASPVTFS 961
Cdd:PHA03247  2828 LPPPTSAQPTAPPPPPG-----PPPPSLPLggsVAPgGDVRRRPPSRSPAAKPAAPARPPVRRL-----ARPAVSRSTES 2897
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  962 VAHPPGGPGAPRSSALPssgilaTRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQ---ALLPQS 1038
Cdd:PHA03247  2898 FALPPDQPERPPQPQAP------PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPwlgALVPGR 2971
                          330       340
                   ....*....|....*....|....*
gi 1046842737 1039 L-VSGASLPPPGAPRECSLQQLQPL 1062
Cdd:PHA03247  2972 VaVPRFRVPQPAPSREAPASSTPPL 2996
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
252-335 3.31e-06

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 50.41  E-value: 3.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  252 PLKVLESHSRGILSVSWSqADAELLLSSAKDNQIFCWNLSSSEVVYKLPTQSSWCFDVQWCPRNPPAFSAvSFDGWISLY 331
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASG-SSDKTIRLW 78

                   ....
gi 1046842737  332 SVMG 335
Cdd:cd00200     79 DLET 82
PHA03247 PHA03247
large tegument protein UL36; Provisional
790-1076 5.18e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 5.18e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  790 LGRQAPAFPFPRvavgAALHPKETSSHRMGFQPPRQVPAPSVRPRAAAQPsVMPFLPSHPIP---SVGSWTQSSSDY-RV 865
Cdd:PHA03247  2791 LSESRESLPSPW----DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-TAPPPPPGPPPpslPLGGSVAPGGDVrRR 2865
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  866 PKPQATLPVHFVPGVRPAFSQPQPFGGQSVQ--AINPVGfcgtwPLPGPTPVMAPPDVMQPgsthlpeTPRLLPLPPVGP 943
Cdd:PHA03247  2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTEsfALPPDQ-----PERPPQPQAPPPPQPQP-------QPPPPPQPQPPP 2933
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  944 PGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATR---PGPQDTwKVAPASQENLQRKKlpetfmpPAPIIT 1020
Cdd:PHA03247  2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvPQPAPS-REAPASSTPPLTGH-------SLSRVS 3005
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1046842737 1021 APLMSLG----PEPQQALLPQSLV-------SGASLPPPGAPRECSLQQLQPLPPEKTqkeLPPEHQ 1076
Cdd:PHA03247  3006 SWASSLAlheeTDPPPVSLKQTLWppddtedSDADSLFDSDSERSDLEALDPLPPEPH---DPFAHE 3069
PTZ00421 PTZ00421
coronin; Provisional
114-312 7.33e-06

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 49.89  E-value: 7.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  114 IAQKQKHTGAVRALDFNPFQGNLLASGASDSEIFIWDLNHLTVPMTPGSKSqnppEDIKALSWNRQvQHILSSAHPSGKA 193
Cdd:PTZ00421   118 IVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHS----DQITSLEWNLD-GSLLCTTSKDKKL 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  194 VVWDLRKNEPIIKVSDHSSRMNCSGLaWNPDIATQLVLCSEDDRLPVIQLWDLRFASSPLKVLESHSRGILSVSWSQADA 273
Cdd:PTZ00421   193 NIIDPRDGTIVSSVEAHASAKSQRCL-WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDT 271
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1046842737  274 ELL-LSSAKDNQI-----------FCWNLSSSEVVYKLPTQSSWCFDVQWC 312
Cdd:PTZ00421   272 NLLyIGSKGEGNIrcfelmnerltFCSSYSSVEPHKGLCMMPKWSLDTRKC 322
PTZ00420 PTZ00420
coronin; Provisional
52-202 7.73e-06

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 49.95  E-value: 7.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   52 DPSLDLKrkGILSVSSRFHKLIWGSSSSGLLENTGVIAGGGDSGMLTLYNVTHilspgKEPLIAQKqKHTGAVRALDFNP 131
Cdd:PTZ00420    13 DPSNNLF--DDLRICSRVIDSCGIACSSGFVAVPWEVEGGGLIGAIRLENQMR-----KPPVIKLK-GHTSSILDLQFNP 84
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1046842737  132 FQGNLLASGASDSEIFIWDLNH----LTVPMTPGSKSQNPPEDIKALSWNRQVQHILSSAHPSGKAVVWDLrKNE 202
Cdd:PTZ00420    85 CFSEILASGSEDLTIRVWEIPHndesVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDI-ENE 158
WD40 COG2319
WD40 repeat [General function prediction only];
87-199 1.18e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.14  E-value: 1.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737   87 VIAGGGDSGMLTLYNVthilSPGKepLIAQKQKHTGAVRALDFNPfQGNLLASGASDSEIFIWDLNHLTVPMTPgsksQN 166
Cdd:COG2319    302 LLASGSDDGTVRLWDL----ATGK--LLRTLTGHTGAVRSVAFSP-DGKTLASGSDDGTVRLWDLATGELLRTL----TG 370
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1046842737  167 PPEDIKALSWNRQVQHILSSAHpSGKAVVWDLR 199
Cdd:COG2319    371 HTGAVTSVAFSPDGRTLASGSA-DGTVRLWDLA 402
ACE1-Sec16-like cd09233
Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat ...
560-652 1.36e-05

Ancestral coatomer element 1 (ACE1) of COPII coat complex assembly protein Sec16; COPII coat complex plays an important role in vesicular traffic of newly synthezised proteins from the endoplasmatic reticulum (ER) to the Golgi apparatus by mediating the formation of transport vesicles. COPII consists of an outer coat, made up of the scaffold proteins Sec31 and Sec13, and the cargo adaptor complex, Sec23 and Sec24, which are recruited by the small GTPase Sar1. Sec16 is involved in the early steps of the assembly process. Sec16 forms elongated heterotetramers with Sec13, Sec13-(Sec16)2-Sec13. It interacts with Sec13 by insertion of a single beta-blade to close the six-bladded beta propeller of Sec13. In the same way Sec13 interacts with Sec31 and Nup145C, a nuclear pore protein, all of these contain a structurally related ancestral coatomer element 1 (ACE1). Sec16 is believed to be a key component in maintaining the integrity of the ER exit site.


Pssm-ID: 187750 [Multi-domain]  Cd Length: 314  Bit Score: 48.41  E-value: 1.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  560 TTEDTDGLLSQA-------LLLGELRSAVELCLKEERFADAIILAQAGDAELLKWTQERYlAKRRTKTSSLLACVVK--- 629
Cdd:cd09233     54 KLVGTDIAEQKAlnrfrnlLLTGNRKEALELALDNGLWAHALLLASSLGKETWAEVVSRF-ARSESKLNDPLQTLYQlfs 132
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1046842737  630 KNWKDLVCACS---------LKNWREALALLL 652
Cdd:cd09233    133 GNSPEAITELAdnpaeaewaLGNWREHLAIIL 164
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
203-306 2.24e-05

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 48.93  E-value: 2.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  203 PIIKVSdhsSRMNCSGLAWNPDIATQLVLCSEDDrlpVIQLWDLrfASSPLKV-LESHSRGILSVSWSQADAELLLSSAK 281
Cdd:PLN00181   525 PVVELA---SRSKLSGICWNSYIKSQVASSNFEG---VVQVWDV--ARSQLVTeMKEHEKRVWSIDYSSADPTLLASGSD 596
                           90       100
                   ....*....|....*....|....*
gi 1046842737  282 DNQIFCWNLSSSEVVYKLPTQSSWC 306
Cdd:PLN00181   597 DGSVKLWSINQGVSIGTIKTKANIC 621
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
120-150 2.48e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.30  E-value: 2.48e-05
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1046842737   120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:smart00320   11 HTGPVTSVAFSP-DGKYLASGSDDGTIKLWD 40
PHA03377 PHA03377
EBNA-3C; Provisional
791-1081 5.10e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 47.74  E-value: 5.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  791 GRQAPAFPFPRVAVGAALHPKETSSHR---MGFQPPRQVPAPSV---------RPRAAAQPSVMPFLPSHPIpsvgswtQ 858
Cdd:PHA03377   644 GPKPKSFWEMRAGRDGSGIQQEPSSRRqpaTQSTPPRPSWLPSVfvlpsvdagRAQPSEESHLSSMSPTQPI-------S 716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  859 SSSDYRVPKPQATLPVHFVPGVRPAFSQPQPFGG----QSVQAINPvgfcGTW-PLPGPTPVMAppdVMQPGSTHLPETP 933
Cdd:PHA03377   717 HEEQPRYEDPDDPLDLSLHPDQAPPPSHQAPYSGheepQAQQAPYP----GYWePRPPQAPYLG---YQEPQAQGVQVSS 789
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  934 RLLPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGpQDTWKVAPASQenlqrkklPETfM 1013
Cdd:PHA03377   790 YPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAGHG-QDQVSQFPHLQ--------SET-G 859
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1046842737 1014 PPAPIITaplmslgpEPQQALLPQSLVSGASL----PPPGAPrecslqqLQPLPpektqKELPPEHQCLKDS 1081
Cdd:PHA03377   860 PPRLQLS--------QVPQLPYSQTLVSSSAPswssPQPRAP-------IRPIP-----TRFPPPPMPLQDS 911
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
821-1022 1.28e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 46.38  E-value: 1.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  821 QPPRQVPAPsvRPRAAAQPSVMPFLPSHPIPSVGSwtqsssdyRVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQAINP 900
Cdd:PRK07003   375 RVAGAVPAP--GARAAAAVGASAVPAVTAVTGAAG--------AALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAAD 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  901 vgfcGTWPLPGPTPVMAPPDvmqpgsthlpetprllplppvgpPGPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSS 980
Cdd:PRK07003   445 ----GDAPVPAKANARASAD-----------------------SRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAA 497
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1046842737  981 GILATRPGPQDTWKVAPASQENlqrkKLPETFMPPAPIITAP 1022
Cdd:PRK07003   498 APSAATPAAVPDARAPAAASRE----DAPAAAAPPAPEARPP 535
WD40 pfam00400
WD domain, G-beta repeat;
120-150 1.66e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.02  E-value: 1.66e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1046842737  120 HTGAVRALDFNPfQGNLLASGASDSEIFIWD 150
Cdd:pfam00400   10 HTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
250-289 6.13e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.48  E-value: 6.13e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1046842737  250 SSPLKVLESHSRGILSVSWSQaDAELLLSSAKDNQIFCWN 289
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSP-DGKLLASGSDDGTVKVWD 39
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
810-1073 8.55e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 8.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  810 PKETSSHRMGFQPPRQVPAPSVRPRAAAQPSVmPFLPSHPIPSVGSWTQSSSDYRVP----KPQATLPVHFVPGVRPAF- 884
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPhtliQQTPTLHPQRLPSPHPPLq 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  885 --SQPQPFGGQSVQAINPVGFCGTWPlPGPTPVMAPPDVMQ-PGSTHLpetprlLPLPPVGPPGPTPLSSQPAASPVTFS 961
Cdd:pfam03154  251 pmTQPPPPSQVSPQPLPQPSLHGQMP-PMPHSLQTGPSHMQhPVPPQP------FPLTPQSSQSQVPPGPSPAAPGQSQQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  962 VAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQ----ENLQRKKLPETFMPPAPIitapLMSLGPEPQQALLPQ 1037
Cdd:pfam03154  324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPipqlPNPQSHKHPPHLSGPSPF----QMNSNLPPPPALKPL 399
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1046842737 1038 SLVSGASLPPPGAPRECSLQQLQPLPPEKTQkelPP 1073
Cdd:pfam03154  400 SSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQ---PP 432
PTZ00420 PTZ00420
coronin; Provisional
200-311 1.36e-03

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 42.63  E-value: 1.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  200 KNEPIIKVSDHSSRMncSGLAWNPDIATQLVLCSEDdrlPVIQLWDLRF-------ASSPLKVLESHSRGILSVSWSQAD 272
Cdd:PTZ00420    63 RKPPVIKLKGHTSSI--LDLQFNPCFSEILASGSED---LTIRVWEIPHndesvkeIKDPQCILKGHKKKISIIDWNPMN 137
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1046842737  273 AELLLSSAKDNQIFCWNLSSSEVVYK--LPTQSSwcfDVQW 311
Cdd:PTZ00420   138 YYIMCSSGFDSFVNIWDIENEKRAFQinMPKKLS---SLKW 175
PHA03247 PHA03247
large tegument protein UL36; Provisional
791-1064 1.66e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  791 GRQAPAFpFPRVAVGAALHPK---------ETSSHRMGFQPPrqvPAPSVRPRAAAQPSVMPflpSHPIPsvgswtqsss 861
Cdd:PHA03247  2513 SRLAPAI-LPDEPVGEPVHPRmltwirgleELASDDAGDPPP---PLPPAAPPAAPDRSVPP---PRPAP---------- 2575
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  862 dyRVPKPQATLPVHfVPGVRPAFSQPQpfggqsvqainpvgfcgtwplpgpTPVmAPPDvmqpgsthlpetprllplppv 941
Cdd:PHA03247  2576 --RPSEPAVTSRAR-RPDAPPQSARPR------------------------APV-DDRG--------------------- 2606
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  942 gppgPTPLSSQPAASPVTFSVAHPPGGPGAPRSSALPSSGILATRPGPQDTWKVAPASQENLQRKKL----------PET 1011
Cdd:PHA03247  2607 ----DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRlgraaqasspPQR 2682
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737 1012 FMPPA-PIITAPLMSLG----PEPQQALLPQSLVSGASLPP-PGAPRECS-LQQLQPLPP 1064
Cdd:PHA03247  2683 PRRRAaRPTVGSLTSLAdpppPPPTPEPAPHALVSATPLPPgPAAARQASpALPAAPAPP 2742
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
819-1047 1.78e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.56  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  819 GFQPPRQVPAPSVRPRAAAQ-PSVMPFLPSHPIPSVGSWTQSSSdyrVPKPQATLPVHFVPGVRPAFSQPQPFGGQSVQA 897
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAaPAAAAPAPAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAARQASARGPGGA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  898 INPVgfcgtwPLPGPTPVMAPPDVMQPgsthlpetprllPLPPVGPPGPTPLSSQPAASPVTFSVAHPPGGPGAPrssAL 977
Cdd:PRK12323   448 PAPA------PAPAAAPAAAARPAAAG------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPP---EF 506
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  978 PSSGILATRPGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAPLMSLGPEPQQALLPQSLvSGASLPP 1047
Cdd:PRK12323   507 ASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA-SASGLPD 575
PHA03378 PHA03378
EBNA-3B; Provisional
809-1051 2.55e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 2.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  809 HPKETSSHRMGFQPPRQVPAPSVRPRAAA-QPSVMPfLPSHP-----IPSVGSWTQSSSDYRVPKP---QATLPVHFVPG 879
Cdd:PHA03378   614 HIPETSAPRQWPMPLRPIPMRPLRMQPITfNVLVFP-TPHQPpqveiTPYKPTWTQIGHIPYQPSPtgaNTMLPIQWAPG 692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  880 -----------VRPAFSQP---QPFGGQSVQAINPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPETPRLLPLPPVGPPG 945
Cdd:PHA03378   693 tmqpppraptpMRPPAAPPgraQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG 772
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  946 PTPLSSQPAASPVtfSVAHPPGGPG-APRSSALPSSGILATR--PGPQDTWKVAPASQENLQRKKLPETFMPPAPIITAP 1022
Cdd:PHA03378   773 APTPQPPPQAPPA--PQQRPRGAPTpQPPPQAGPTSMQLMPRaaPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQA 850
                          250       260
                   ....*....|....*....|....*....
gi 1046842737 1023 LMSLGPEPQQALLPQSLVSGASLPPPGAP 1051
Cdd:PHA03378   851 AAGPTPSPGSGTSDKIVQAPVFYPPVLQP 879
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
793-1030 6.99e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.52  E-value: 6.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  793 QAPAFPFPRVAVGAA--LHPKETSSHRMGFQPPRQVPAPsvrPRAAAQPSVMPfLPSHPIPSVG--------SWTQSSSD 862
Cdd:pfam03154  308 QVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLP---PAPLSMPHIKP-PPTTPIPQLPnpqshkhpPHLSGPSP 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  863 YRVP---------KPQATLPVHFVPGVRPAFSQPQPfGGQSVQA--INPVGFCGTWPLPGPTPVMAPPDVMQPGSTHLPE 931
Cdd:pfam03154  384 FQMNsnlppppalKPLSSLSTHHPPSAHPPPLQLMP-QSQQLPPppAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPF 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046842737  932 TPRLLPLPPVGPpgPTPLSSQPAASPVTFSVAHPPggpgaprSSALPSSGIlatrPGPQDTWKVAPASQenLQRKKLPET 1011
Cdd:pfam03154  463 PQHPFVPGGPPP--ITPPSGPPTSTSSAMPGIQPP-------SSASVSSSG----PVPAAVSCPLPPVQ--IKEEALDEA 527
                          250
                   ....*....|....*....
gi 1046842737 1012 FMPPAPiiTAPLMSLGPEP 1030
Cdd:pfam03154  528 EEPESP--PPPPRSPSPEP 544
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH