NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|143682198|sp|Q3U3T8|]
View 

RecName: Full=WD repeat-containing protein 62

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 1019192)

WD40 repeat domain-containing protein folds into a beta-propeller structure and functions as a scaffold, providing a platform for the interaction and assembly of several proteins into a signalosome; similar to a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
364-744 1.32e-36

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 143.90  E-value: 1.32e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  364 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 442
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  443 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319   150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  523 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 601
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  602 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 681
Cdd:COG2319   267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 143682198  682 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 744
Cdd:COG2319   339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
93-444 1.18e-23

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 105.38  E-value: 1.18e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   93 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPTVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 250
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  251 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 328
Cdd:COG2319   290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  329 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 408
Cdd:COG2319   324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 143682198  409 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 444
Cdd:COG2319   377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
983-1450 2.08e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 2.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   983 EETEAGPEDQQGDTYLRVSSVSS-KDQSPPEDSGESEAELECS---FAAAHSSAPQTDPGPHLTMTAGKPEYPSteelSQ 1058
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSPSIPSpQDNESDSDSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPTP----SA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1059 PELPGLGNGSLPQTPEQEKflrhhfetltdapteelfhGSLGDIKISETEDYFFNPRLSISTqflSRLQKTSRCPPrlPL 1138
Cdd:pfam03154  204 PSVPPQGSPATSQPPNQTQ-------------------STAAPHTLIQQTPTLHPQRLPSPH---PPLQPMTQPPP--PS 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1139 HLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSSDGTNVLSGQKAEETQEALslldrkPPTPTSVLTTGREQSISAPSSCSY 1218
Cdd:pfam03154  260 QVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQV------PPGPSPAAPGQSQQRIHTPPSQSQ 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1219 LES-------------TTSSHAKTTRSISLGDSEGPVTAELPQSLHKPlSPGQELQAIPTTVALT--SSIKDHEPAplsw 1283
Cdd:pfam03154  334 LQSqqppreqplppapLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP-SPFQMNSNLPPPPALKplSSLSTHHPP---- 408
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1284 gnheARASLKLTLSSVCEQLLSPPPQEPPITHVWSQEP--VDVPPSMAVTVASFCAPSP-----VDMSTLGLHSSMFLPK 1356
Cdd:pfam03154  409 ----SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaASHPPTSGLHQVPSQSPFPqhpfvPGGPPPITPPSGPPTS 484
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1357 TSASGPLTPPAHLQLLETRSRVPGSTAALLEPT------PDASGVIADSPGHWDTEVPTPELlgsVESVLHRLQTA-FQE 1429
Cdd:pfam03154  485 TSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVqikeeaLDEAEEPESPPPPPRSPSPEPTV---VNTPSHASQSArFYK 561
                          490       500       510
                   ....*....|....*....|....*....|.
gi 143682198  1430 AL----------DLYRMLVSSSQLGPEQQQA 1450
Cdd:pfam03154  562 HLdrgynscartDLYFMPLAGSKLAKKREEA 592
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
364-744 1.32e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 143.90  E-value: 1.32e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  364 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 442
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  443 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319   150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  523 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 601
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  602 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 681
Cdd:COG2319   267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 143682198  682 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 744
Cdd:COG2319   339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
364-742 3.16e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.20  E-value: 3.16e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  364 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqRACLPSGTFLTCSSDNTIRFWNLD 443
Cdd:cd00200    14 CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG---HTGPVRDV---------AASADGTYLASGSSDKTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  444 SASDTRwqknIFsdsllkvvyvendIQHLQDlshfpdrgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELHF 523
Cdd:cd00200    82 TGECVR----TL-------------TGHTSY--------------------VSSVAFSPDGRILSSSSRDKTIKVWDVET 124
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  524 MDELIKVEAHDAEVLCLEYSKPETgvtLLASASRDRLIHVLNVEKNYnLEQTLDDHSSSITAIKFAGTRDvQMISCGADK 603
Cdd:cd00200   125 GKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDLRTGK-CVATLTGHTGEVNSVAFSPDGE-KLLSSSSDG 199
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  604 SIyfrsaqqasdglhfvrthhvaekttlydmdiditqkyvavacqdrnvRVYNTVSGKQKKCYkgsQGDEGSLLKVHVDP 683
Cdd:cd00200   200 TI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSVAFSP 229
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 143682198  684 SGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 742
Cdd:cd00200   230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
93-444 1.18e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 105.38  E-value: 1.18e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   93 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPTVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 250
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  251 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 328
Cdd:COG2319   290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  329 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 408
Cdd:COG2319   324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 143682198  409 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 444
Cdd:COG2319   377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
111-441 2.68e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.03  E-value: 2.68e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  111 RKSLSALAFSPDGKYIVTG-ENGhrpTVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGsGDG---TIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVTSTvplvgrsgilgelHNN-IFCGVACGRGRMA 266
Cdd:cd00200    84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTTLRG-------------HTDwVNSVAFSPDGTFV 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  267 gntFCVSYSGL-----LCQFNEKRVL---DKWINlkvslssCLCVSD--ELIFCGCTDGIVRIFqahsllyltNLPKPHY 336
Cdd:cd00200   151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSPdgEKLLSSSSDGTIKLW---------DLSTGKC 211
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  337 LGVDVAHGldssflfhrkaEAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypef 416
Cdd:cd00200   212 LGTLRGHE-----------NGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL------ 267
                         330       340
                  ....*....|....*....|....*.
gi 143682198  417 edqrACLPSGTFL-TCSSDNTIRFWN 441
Cdd:cd00200   268 ----AWSPDGKRLaSGSADGTIRIWD 289
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
983-1450 2.08e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 2.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   983 EETEAGPEDQQGDTYLRVSSVSS-KDQSPPEDSGESEAELECS---FAAAHSSAPQTDPGPHLTMTAGKPEYPSteelSQ 1058
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSPSIPSpQDNESDSDSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPTP----SA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1059 PELPGLGNGSLPQTPEQEKflrhhfetltdapteelfhGSLGDIKISETEDYFFNPRLSISTqflSRLQKTSRCPPrlPL 1138
Cdd:pfam03154  204 PSVPPQGSPATSQPPNQTQ-------------------STAAPHTLIQQTPTLHPQRLPSPH---PPLQPMTQPPP--PS 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1139 HLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSSDGTNVLSGQKAEETQEALslldrkPPTPTSVLTTGREQSISAPSSCSY 1218
Cdd:pfam03154  260 QVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQV------PPGPSPAAPGQSQQRIHTPPSQSQ 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1219 LES-------------TTSSHAKTTRSISLGDSEGPVTAELPQSLHKPlSPGQELQAIPTTVALT--SSIKDHEPAplsw 1283
Cdd:pfam03154  334 LQSqqppreqplppapLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP-SPFQMNSNLPPPPALKplSSLSTHHPP---- 408
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1284 gnheARASLKLTLSSVCEQLLSPPPQEPPITHVWSQEP--VDVPPSMAVTVASFCAPSP-----VDMSTLGLHSSMFLPK 1356
Cdd:pfam03154  409 ----SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaASHPPTSGLHQVPSQSPFPqhpfvPGGPPPITPPSGPPTS 484
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1357 TSASGPLTPPAHLQLLETRSRVPGSTAALLEPT------PDASGVIADSPGHWDTEVPTPELlgsVESVLHRLQTA-FQE 1429
Cdd:pfam03154  485 TSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVqikeeaLDEAEEPESPPPPPRSPSPEPTV---VNTPSHASQSArFYK 561
                          490       500       510
                   ....*....|....*....|....*....|.
gi 143682198  1430 AL----------DLYRMLVSSSQLGPEQQQA 1450
Cdd:pfam03154  562 HLdrgynscartDLYFMPLAGSKLAKKREEA 592
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
704-743 2.93e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.68  E-value: 2.93e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 143682198    704 SGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWH 743
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
705-742 1.86e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.02  E-value: 1.86e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 143682198   705 GECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 742
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
102-141 2.78e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.91  E-value: 2.78e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 143682198    102 KQQHIFNTTRKSLSALAFSPDGKYIVTG-ENGhrpTVRIWD 141
Cdd:smart00320    3 ELLKTLKGHTGPVTSVAFSPDGKYLASGsDDG---TIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
101-141 4.26e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 4.26e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 143682198   101 NKQQHIFNTTRKSLSALAFSPDGKYIVTG-ENGhrpTVRIWD 141
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGsDDG---TVKVWD 39
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
364-744 1.32e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 143.90  E-value: 1.32e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  364 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNL 442
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  443 DSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319   150 ATG------------------------KLLRTLTGHSGA-------------VTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  523 FMDELIKVEAHDAEVLCLEYSkPEtGvTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFagTRDVQMI-SCGA 601
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAF--SPDGRLLaSGSA 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  602 DKSIYFRSAQqaSDGLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHV 681
Cdd:COG2319   267 DGTVRLWDLA--TGELLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAF 338
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 143682198  682 DPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL 744
Cdd:COG2319   339 SPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
364-742 3.16e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.20  E-value: 3.16e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  364 ALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypefedqRACLPSGTFLTCSSDNTIRFWNLD 443
Cdd:cd00200    14 CVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKG---HTGPVRDV---------AASADGTYLASGSSDKTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  444 SASDTRwqknIFsdsllkvvyvendIQHLQDlshfpdrgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRIHELHF 523
Cdd:cd00200    82 TGECVR----TL-------------TGHTSY--------------------VSSVAFSPDGRILSSSSRDKTIKVWDVET 124
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  524 MDELIKVEAHDAEVLCLEYSKPETgvtLLASASRDRLIHVLNVEKNYnLEQTLDDHSSSITAIKFAGTRDvQMISCGADK 603
Cdd:cd00200   125 GKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDLRTGK-CVATLTGHTGEVNSVAFSPDGE-KLLSSSSDG 199
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  604 SIyfrsaqqasdglhfvrthhvaekttlydmdiditqkyvavacqdrnvRVYNTVSGKQKKCYkgsQGDEGSLLKVHVDP 683
Cdd:cd00200   200 TI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSVAFSP 229
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 143682198  684 SGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 742
Cdd:cd00200   230 DGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
492-744 4.79e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 112.81  E-value: 4.79e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  492 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSKpetGVTLLASASRDRLIHVLNVEKNyN 571
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  572 LEQTLDDHSSSITAIKFAGTRDVqMISCGADKSIyfRSAQQASDGLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 651
Cdd:cd00200    85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTI--KVWDVETGKCLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  652 VRVYNTVSGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLI 731
Cdd:cd00200   159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|...
gi 143682198  732 TVSGDSCVFIWHL 744
Cdd:cd00200   236 SGSEDGTIRVWDL 248
WD40 COG2319
WD40 repeat [General function prediction only];
42-567 2.69e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 113.47  E-value: 2.69e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   42 LRRRTRLAAAPEDTVQNRVTLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVVLNPKENKQQHIFNTTRKSLSALAFSP 121
Cdd:COG2319     9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  122 DGKYIVTGenGHRPTVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319    89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  201 VIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVtstvpLVGrsgilgelHNNIFCGVAcgrgrmagntfcVSYSGllc 279
Cdd:COG2319   165 VTSVAFSPDGKLLASGSdDGTVRLWDLATGKLLRT-----LTG--------HTGAVRSVA------------FSPDG--- 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  280 qfnekrvldkwinlkvslssclcvsdELIFCGCTDGIVRIFQAHSLLYLTNLPkphylgvdvAHGldssflfhrkaEAVY 359
Cdd:COG2319   217 --------------------------KLLASGSADGTVRLWDLATGKLLRTLT---------GHS-----------GSVR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  360 pdtvALTFDPVHQWLSCVYKDHSIYIWDVKDiDEVSKIWSElfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIR 438
Cdd:COG2319   251 ----SVAFSPDGRLLASGSADGTVRLWDLAT-GELLRTLTG--HSGGVNSV----------AFSPDGKLLaSGSDDGTVR 313
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  439 FWNLDSAsdtrwqknifsdsllkvvyvendiQHLQDLSHFPDRgsengtpmdmkagVRVMQVSPDGQHLASGDRSGNLRI 518
Cdd:COG2319   314 LWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVRL 356
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*....
gi 143682198  519 HELHFMDELIKVEAHDAEVLCLEYSKPEtgvTLLASASRDRLIHVLNVE 567
Cdd:COG2319   357 WDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
533-742 2.94e-25

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 107.42  E-value: 2.94e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  533 HDAEVLCLEYSkpeTGVTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGTRDvQMISCGADKSIYFrsaqQ 612
Cdd:cd00200     8 HTGGVTCVAFS---PDGKLLATGSGDGTIKVWDLETG-ELLRTLKGHTGPVRDVAASADGT-YLASGSSDKTIRL----W 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  613 ASDGLHFVRTHHVAEKTtLYDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDegsLLKVHVDPSGTFLATSC 692
Cdd:cd00200    79 DLETGECVRTLTGHTSY-VSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW---VNSVAFSPDGTFVASSS 154
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 143682198  693 SDKSISLIDFYSGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 742
Cdd:cd00200   155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
WD40 COG2319
WD40 repeat [General function prediction only];
93-444 1.18e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 105.38  E-value: 1.18e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   93 VVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGenGHRPTVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEASTEAKVTStvpLVGRSGilgel 250
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLATGELLRT---LTGHSG----- 289
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  251 hnnifcgvacgrgrmagntfcvsysgllcqfnekrvldkWINlkvslssCLCVS--DELIFCGCTDGIVRIFQAHSLLYL 328
Cdd:COG2319   290 ---------------------------------------GVN-------SVAFSpdGKLLASGSDDGTVRLWDLATGKLL 323
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  329 TNLpKPHYLGVDvahgldssflfhrkaeavypdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVW 408
Cdd:COG2319   324 RTL-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG---HTGAVT 376
                         330       340       350
                  ....*....|....*....|....*....|....*..
gi 143682198  409 NVevypefedqrACLPSGTFL-TCSSDNTIRFWNLDS 444
Cdd:COG2319   377 SV----------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
2-518 1.73e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.99  E-value: 1.73e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198    2 AALAAGGYTRSDTIEKLSSVMAGVPARRNQSSPPPAPPLCLRRRTRLAAAPEDTVQNRVTLEKVLGITAQNSSGLTCDPG 81
Cdd:COG2319    11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   82 TGHVAYLAGCVVVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGENGHrpTVRIWDVEEKTQVAEMLGHKYGVACV 161
Cdd:COG2319    91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTLTGHSGAVTSV 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  162 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVtstvp 239
Cdd:COG2319   169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLWDLATGKLLRT----- 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  240 LVGRSGILgelhnnifcgvacgrgrmagntFCVSYSGllcqfnekrvldkwinlkvslssclcvSDELIFCGCTDGIVRI 319
Cdd:COG2319   242 LTGHSGSV----------------------RSVAFSP---------------------------DGRLLASGSADGTVRL 272
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  320 FqahsllyltnlpkphylgvDVAHGLDSSFLFHRKAeAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWS 399
Cdd:COG2319   273 W-------------------DLATGELLRTLTGHSG-GVN----SVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTG 328
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  400 elfHSSFVWNVevypefedqrACLPSGTFL-TCSSDNTIRFWNLdsasDTRWQKNIFSdsllkvvyvendiQHlqdlshf 478
Cdd:COG2319   329 ---HTGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDL----ATGELLRTLT-------------GH------- 371
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|
gi 143682198  479 pdrgsengtpmdmKAGVRVMQVSPDGQHLASGDRSGNLRI 518
Cdd:COG2319   372 -------------TGAVTSVAFSPDGRTLASGSADGTVRL 398
WD40 COG2319
WD40 repeat [General function prediction only];
480-750 2.32e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 2.32e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  480 DRGSENGTPMDMKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkPETgvTLLASASRDR 559
Cdd:COG2319    66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  560 LIHVLNVEKNyNLEQTLDDHSSSITAIKFA--GTRdvqMISCGADKSIYFRSAQQASDgLHFVRTHhvaeKTTLYDMDID 637
Cdd:COG2319   143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSpdGKL---LASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFS 213
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  638 ITQKYVAVACQDRNVRVYNTVSGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEI 717
Cdd:COG2319   214 PDGKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGG 290
                         250       260       270
                  ....*....|....*....|....*....|....*
gi 143682198  718 VTGMKFTYDCRHLITVSGDSCVFIWHL--GPEITT 750
Cdd:COG2319   291 VNSVAFSPDGKLLASGSDDGTVRLWDLatGKLLRT 325
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
111-441 2.68e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.03  E-value: 2.68e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  111 RKSLSALAFSPDGKYIVTG-ENGhrpTVRIWDVEEKTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGsGDG---TIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASTEAKVTSTvplvgrsgilgelHNN-IFCGVACGRGRMA 266
Cdd:cd00200    84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTTLRG-------------HTDwVNSVAFSPDGTFV 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  267 gntFCVSYSGL-----LCQFNEKRVL---DKWINlkvslssCLCVSD--ELIFCGCTDGIVRIFqahsllyltNLPKPHY 336
Cdd:cd00200   151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSPdgEKLLSSSSDGTIKLW---------DLSTGKC 211
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  337 LGVDVAHGldssflfhrkaEAVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIWSelfHSSFVWNVevypef 416
Cdd:cd00200   212 LGTLRGHE-----------NGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL------ 267
                         330       340
                  ....*....|....*....|....*.
gi 143682198  417 edqrACLPSGTFL-TCSSDNTIRFWN 441
Cdd:cd00200   268 ----AWSPDGKRLaSGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
299-605 1.23e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 88.16  E-value: 1.23e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  299 SCLCVSD--ELIFCGCTDGIVRIFQAHSLLYLTNLpKPHYLGV-DVAHGLDSSFLF------------------------ 351
Cdd:cd00200    13 TCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTL-KGHTGPVrDVAASADGTYLAsgssdktirlwdletgecvrtltg 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  352 HRKAeaVYpdtvALTFDPVHQWLSCVYKDHSIYIWDVKDIDEVSKIwseLFHSSFVWNVEVypefedqracLPSGTFLTC 431
Cdd:cd00200    92 HTSY--VS----SVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL---RGHTDWVNSVAF----------SPDGTFVAS 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  432 SS-DNTIRFWNLDSasdtrwqknifsdsllkvvyvendiqhlqdlshfpdrGSENGTPMDMKAGVRVMQVSPDGQHLASG 510
Cdd:cd00200   153 SSqDGTIKLWDLRT-------------------------------------GKCVATLTGHTGEVNSVAFSPDGEKLLSS 195
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  511 DRSGNLRIHELHfMDELIKV-EAHDAEVLCLEYSKPEtgvTLLASASRDRLIHVLNVEKNYNLeQTLDDHSSSITAIKFA 589
Cdd:cd00200   196 SSDGTIKLWDLS-TGKCLGTlRGHENGVNSVAFSPDG---YLLASGSEDGTIRVWDLRTGECV-QTLSGHTNSVTSLAWS 270
                         330
                  ....*....|....*.
gi 143682198  590 GTRDVqMISCGADKSI 605
Cdd:cd00200   271 PDGKR-LASGSADGTI 285
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
153-563 1.05e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 82.38  E-value: 1.05e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  153 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWFLEASt 230
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLWDLETG- 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  231 eaKVTSTvpLVGrsgilgelHNnifcgvacgrgrmaGNTFCVSYSgllcqfnekrvldkwinlkvslssclcVSDELIFC 310
Cdd:cd00200    84 --ECVRT--LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  311 GCTDGIVRIFQAHSLLYLTnlpkphylgvdvahgldsSFLFHRKaeavypDTVALTFDPVHQWLSCVYKDHSIYIWDVKD 390
Cdd:cd00200   111 SSRDKTIKVWDVETGKCLT------------------TLRGHTD------WVNSVAFSPDGTFVASSSQDGTIKLWDLRT 166
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  391 IdevSKIWSELFHSSFVWNVEVYPEfedqraclpSGTFLTCSSDNTIRFWNLDSAsdtrwqknifsdsllkvvyvendiQ 470
Cdd:cd00200   167 G---KCVATLTGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------K 210
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  471 HLQDLshfpdRGSENgtpmdmkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkPETGVt 550
Cdd:cd00200   211 CLGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS-PDGKR- 275
                         410
                  ....*....|...
gi 143682198  551 lLASASRDRLIHV 563
Cdd:cd00200   276 -LASGSADGTIRI 287
WD40 COG2319
WD40 repeat [General function prediction only];
480-750 1.05e-14

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 78.03  E-value: 1.05e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  480 DRGSENGTPMDMKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELIKVEAHDAEVLCLEYSkpeTGVTLLASASRDR 559
Cdd:COG2319    24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  560 LIHVLNVEKNYNLeQTLDDHSSSITAIKFAgtrdvqmiscgadksiyfrsaqqaSDGlhfvrthhvaekttlydmdidit 639
Cdd:COG2319   101 TVRLWDLATGLLL-RTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  640 qKYVAVACQDRNVRVYNTVSGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISLIDFYSGECVAKMFGHSEIVT 719
Cdd:COG2319   133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
                         250       260       270
                  ....*....|....*....|....*....|...
gi 143682198  720 GMKFTYDCRHLITVSGDSCVFIWHL--GPEITT 750
Cdd:COG2319   209 SVAFSPDGKLLASGSADGTVRLWDLatGKLLRT 241
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
87-224 1.77e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 72.75  E-value: 1.77e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   87 YLAGC----VVVVLNPKENKQQHIFNTTRKSLSALAFSPDGKYIVTGENGHrpTVRIWDVEEKTQVAEMLGHKYGVACVA 162
Cdd:cd00200   107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 143682198  163 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 224
Cdd:cd00200   185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
632-755 1.48e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.06  E-value: 1.48e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  632 YDMDIDITQKYVAVACQDRNVRVYNTVSGKQKKCYKGSqgdEGSLLKVHVDPSGTFLATSCSDKSISLIDFYSGECVAKM 711
Cdd:cd00200    13 TCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGH---TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTL 89
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 143682198  712 FGHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL-GPEITTCMKQH 755
Cdd:cd00200    90 TGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVeTGKCLTTLRGH 134
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
983-1450 2.08e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.46  E-value: 2.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   983 EETEAGPEDQQGDTYLRVSSVSS-KDQSPPEDSGESEAELECS---FAAAHSSAPQTDPGPHLTMTAGKPEYPSteelSQ 1058
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSPSIPSpQDNESDSDSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPTP----SA 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1059 PELPGLGNGSLPQTPEQEKflrhhfetltdapteelfhGSLGDIKISETEDYFFNPRLSISTqflSRLQKTSRCPPrlPL 1138
Cdd:pfam03154  204 PSVPPQGSPATSQPPNQTQ-------------------STAAPHTLIQQTPTLHPQRLPSPH---PPLQPMTQPPP--PS 259
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1139 HLMKSPEAQPVGQGGNQPKAGPLRAGTGYMSSDGTNVLSGQKAEETQEALslldrkPPTPTSVLTTGREQSISAPSSCSY 1218
Cdd:pfam03154  260 QVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQV------PPGPSPAAPGQSQQRIHTPPSQSQ 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1219 LES-------------TTSSHAKTTRSISLGDSEGPVTAELPQSLHKPlSPGQELQAIPTTVALT--SSIKDHEPAplsw 1283
Cdd:pfam03154  334 LQSqqppreqplppapLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP-SPFQMNSNLPPPPALKplSSLSTHHPP---- 408
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1284 gnheARASLKLTLSSVCEQLLSPPPQEPPITHVWSQEP--VDVPPSMAVTVASFCAPSP-----VDMSTLGLHSSMFLPK 1356
Cdd:pfam03154  409 ----SAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaASHPPTSGLHQVPSQSPFPqhpfvPGGPPPITPPSGPPTS 484
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198  1357 TSASGPLTPPAHLQLLETRSRVPGSTAALLEPT------PDASGVIADSPGHWDTEVPTPELlgsVESVLHRLQTA-FQE 1429
Cdd:pfam03154  485 TSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVqikeeaLDEAEEPESPPPPPRSPSPEPTV---VNTPSHASQSArFYK 561
                          490       500       510
                   ....*....|....*....|....*....|.
gi 143682198  1430 AL----------DLYRMLVSSSQLGPEQQQA 1450
Cdd:pfam03154  562 HLdrgynscartDLYFMPLAGSKLAKKREEA 592
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
704-743 2.93e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 42.68  E-value: 2.93e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 143682198    704 SGECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIWH 743
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
705-742 1.86e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.02  E-value: 1.86e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 143682198   705 GECVAKMFGHSEIVTGMKFTYDCRHLITVSGDSCVFIW 742
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
634-726 3.23e-04

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 41.11  E-value: 3.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 143682198   634 MDIditqkyVAVACQDRNVRVYNTvSGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISLIDFYSGECVAKMF 712
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 143682198   713 GHSEIVTGMKFTYD 726
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
102-141 2.78e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.91  E-value: 2.78e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 143682198    102 KQQHIFNTTRKSLSALAFSPDGKYIVTG-ENGhrpTVRIWD 141
Cdd:smart00320    3 ELLKTLKGHTGPVTSVAFSPDGKYLASGsDDG---TIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
101-141 4.26e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 4.26e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 143682198   101 NKQQHIFNTTRKSLSALAFSPDGKYIVTG-ENGhrpTVRIWD 141
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGsDDG---TVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
145-185 4.33e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 4.33e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 143682198    145 KTQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
713-761 6.08e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 40.40  E-value: 6.08e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 143682198  713 GHSEIVTGMKFTYDCRHLITVSGDSCVFIWHL-GPEITTCMKQHLLEINH 761
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLeTGELLRTLKGHTGPVRD 56
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH