NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217371896|ref|XP_047277709|]
View 

WD repeat-containing protein 97 isoform X8 [Homo sapiens]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 13234759)

WD40 repeat domain-containing protein folds into a beta-propeller structure and functions as a scaffold, providing a platform for the interaction and assembly of several proteins into a signalosome; similar to a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
Gene Ontology:  GO:0005515
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
200-370 1.57e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 78.92  E-value: 1.57e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  200 TCCLPVPDLRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHpppspTGRLMRLAVAPVPPhhvlrcFAAYGSA---VLTF 276
Cdd:cd00200     97 SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRG---H-----TDWVNSVAFSPDGT------FVASSSQdgtIKLW 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  277 DLHAWTLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDG 355
Cdd:cd00200    163 DLRTGKCVATLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLStGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDG 241
                          170
                   ....*....|....*
gi 2217371896  356 TLRTWDLQAAAQVGE 370
Cdd:cd00200    242 TIRVWDLRTGECVQT 256
WD40 COG2319
WD40 repeat [General function prediction only];
206-719 7.76e-13

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 72.25  E-value: 7.76e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  206 PDLRLLLVAEMNSSLALWQFRSGGRRLVLRGSALHPPPSPTGRLMRLAVAPVPPHHVLRCFAAYGSAVLTFDLHAWTLVD 285
Cdd:COG2319      4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  286 VRRDLHKTTIsdlayceeveamVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWDLQA 364
Cdd:COG2319     84 VAFSPDGRLL------------ASASADGTVRLWDLAtGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLAT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  365 AAQVGEvalgFWGQDKLSRRV-----GRLLApvrpgwpvlslCASS---MQLWRVR--ELYSPLAQLPAKVLHVQVAPAl 434
Cdd:COG2319    152 GKLLRT----LTGHSGAVTSVafspdGKLLA-----------SGSDdgtVRLWDLAtgKLLRTLTGHTGAVRSVAFSPD- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  435 papahqslPTRLVCACADGSVYLLSAATGRIVSSLllepedcaaavayclprealwlltrAGHLVRANAArcpmsvlhrv 514
Cdd:COG2319    216 --------GKLLASGSADGTVRLWDLATGKLLRTL-------------------------TGHSGSVRSV---------- 252
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  515 cpppppapqpcclhlyshltdlegAFSSweivrqhwgelrcssvacawknKNRYLpVVGHTDGTLSVleW-LSSKTVFQT 593
Cdd:COG2319    253 ------------------------AFSP----------------------DGRLL-ASGSADGTVRL--WdLATGELLRT 283
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  594 EAHSPGPVVAIASTWNS--IVSSGGDLTVKMWRVfpyaeESLSLLRTFS---CCYPAVALCALGRRVTAGFEDpdsatyG 668
Cdd:COG2319    284 LTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDL-----ATGKLLRTLTghtGAVRSVAFSPDGKTLASGSDD------G 352
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217371896  669 LVQFGLGDSPRLDHRPQdDPTDHITGLCCCPTLKLYACSSLDCTVRIWTAE 719
Cdd:COG2319    353 TVRLWDLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
200-370 1.57e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 78.92  E-value: 1.57e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  200 TCCLPVPDLRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHpppspTGRLMRLAVAPVPPhhvlrcFAAYGSA---VLTF 276
Cdd:cd00200     97 SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRG---H-----TDWVNSVAFSPDGT------FVASSSQdgtIKLW 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  277 DLHAWTLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDG 355
Cdd:cd00200    163 DLRTGKCVATLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLStGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDG 241
                          170
                   ....*....|....*
gi 2217371896  356 TLRTWDLQAAAQVGE 370
Cdd:cd00200    242 TIRVWDLRTGECVQT 256
WD40 COG2319
WD40 repeat [General function prediction only];
128-469 3.31e-15

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 79.57  E-value: 3.31e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  128 AGRLHLHKEDGWAQETLLAPVRLTGLVTVLGPLGAVGRFVGWGPAGLAILRPNLSLLWLSEQGVGRAPGWAPTCCLPVPD 207
Cdd:COG2319     10 AAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPD 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  208 LRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHPPP------SPTGRlmRLAVApvpphhvlrcfaAYGSAVLTFDLHAW 281
Cdd:COG2319     90 GRLLASASADGTVRLWDLATGLLLRTLTG---HTGAvrsvafSPDGK--TLASG------------SADGTVRLWDLATG 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  282 TLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWE-ADWQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTW 360
Cdd:COG2319    153 KLLRTLTG-HSGAVTSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  361 DLQAAAQVGEVAlgfwGQDKLSRRV-----GRLLApvrpgwpvlslCAS---SMQLWRV--RELYSPLAQLPAKVLHVQV 430
Cdd:COG2319    232 DLATGKLLRTLT----GHSGSVRSVafspdGRLLA-----------SGSadgTVRLWDLatGELLRTLTGHSGGVNSVAF 296
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 2217371896  431 APAlpapahqslPTRLVCACADGSVYLLSAATGRIVSSL 469
Cdd:COG2319    297 SPD---------GKLLASGSDDGTVRLWDLATGKLLRTL 326
WD40 COG2319
WD40 repeat [General function prediction only];
206-719 7.76e-13

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 72.25  E-value: 7.76e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  206 PDLRLLLVAEMNSSLALWQFRSGGRRLVLRGSALHPPPSPTGRLMRLAVAPVPPHHVLRCFAAYGSAVLTFDLHAWTLVD 285
Cdd:COG2319      4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  286 VRRDLHKTTIsdlayceeveamVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWDLQA 364
Cdd:COG2319     84 VAFSPDGRLL------------ASASADGTVRLWDLAtGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLAT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  365 AAQVGEvalgFWGQDKLSRRV-----GRLLApvrpgwpvlslCASS---MQLWRVR--ELYSPLAQLPAKVLHVQVAPAl 434
Cdd:COG2319    152 GKLLRT----LTGHSGAVTSVafspdGKLLA-----------SGSDdgtVRLWDLAtgKLLRTLTGHTGAVRSVAFSPD- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  435 papahqslPTRLVCACADGSVYLLSAATGRIVSSLllepedcaaavayclprealwlltrAGHLVRANAArcpmsvlhrv 514
Cdd:COG2319    216 --------GKLLASGSADGTVRLWDLATGKLLRTL-------------------------TGHSGSVRSV---------- 252
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  515 cpppppapqpcclhlyshltdlegAFSSweivrqhwgelrcssvacawknKNRYLpVVGHTDGTLSVleW-LSSKTVFQT 593
Cdd:COG2319    253 ------------------------AFSP----------------------DGRLL-ASGSADGTVRL--WdLATGELLRT 283
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  594 EAHSPGPVVAIASTWNS--IVSSGGDLTVKMWRVfpyaeESLSLLRTFS---CCYPAVALCALGRRVTAGFEDpdsatyG 668
Cdd:COG2319    284 LTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDL-----ATGKLLRTLTghtGAVRSVAFSPDGKTLASGSDD------G 352
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217371896  669 LVQFGLGDSPRLDHRPQdDPTDHITGLCCCPTLKLYACSSLDCTVRIWTAE 719
Cdd:COG2319    353 TVRLWDLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
291-624 1.01e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.44  E-value: 1.01e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  291 HKTTISDLAYCEEVEAMVTASRDSTVKVW--EADWQIRmVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWDLQaaaqv 368
Cdd:cd00200      8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWdlETGELLR-TLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLE----- 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  369 gevalgfwgQDKLSRRVGRLLAPVR-----PGWPVLSLCAS--SMQLWRVRElYSPLAQLPAK---VLHVQVAPAlpapa 438
Cdd:cd00200     82 ---------TGECVRTLTGHTSYVSsvafsPDGRILSSSSRdkTIKVWDVET-GKCLTTLRGHtdwVNSVAFSPD----- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  439 hqslPTRLVCACADGSVYLLSAATGRIVSSLLLEpEDCAAAVAYcLPREAlwlltragHLVRANAARCpmsvlhrvcppp 518
Cdd:cd00200    147 ----GTFVASSSQDGTIKLWDLRTGKCVATLTGH-TGEVNSVAF-SPDGE--------KLLSSSSDGT------------ 200
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  519 ppapqpccLHLYShltdlegaFSSWEIVRQHWGELRcSSVACAWkNKNRYLPVVGHTDGTLSVLEWLSSKTVFQTEAHSp 598
Cdd:cd00200    201 --------IKLWD--------LSTGKCLGTLRGHEN-GVNSVAF-SPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHT- 261
                          330       340
                   ....*....|....*....|....*...
gi 2217371896  599 GPVVAIA--STWNSIVSSGGDLTVKMWR 624
Cdd:cd00200    262 NSVTSLAwsPDGKRLASGSADGTIRIWD 289
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
322-361 2.52e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.46  E-value: 2.52e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2217371896   322 DWQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWD 361
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
323-361 2.42e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 45.80  E-value: 2.42e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2217371896  323 WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWD 361
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
200-370 1.57e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 78.92  E-value: 1.57e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  200 TCCLPVPDLRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHpppspTGRLMRLAVAPVPPhhvlrcFAAYGSA---VLTF 276
Cdd:cd00200     97 SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRG---H-----TDWVNSVAFSPDGT------FVASSSQdgtIKLW 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  277 DLHAWTLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDG 355
Cdd:cd00200    163 DLRTGKCVATLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLStGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDG 241
                          170
                   ....*....|....*
gi 2217371896  356 TLRTWDLQAAAQVGE 370
Cdd:cd00200    242 TIRVWDLRTGECVQT 256
WD40 COG2319
WD40 repeat [General function prediction only];
128-469 3.31e-15

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 79.57  E-value: 3.31e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  128 AGRLHLHKEDGWAQETLLAPVRLTGLVTVLGPLGAVGRFVGWGPAGLAILRPNLSLLWLSEQGVGRAPGWAPTCCLPVPD 207
Cdd:COG2319     10 AAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPD 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  208 LRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHPPP------SPTGRlmRLAVApvpphhvlrcfaAYGSAVLTFDLHAW 281
Cdd:COG2319     90 GRLLASASADGTVRLWDLATGLLLRTLTG---HTGAvrsvafSPDGK--TLASG------------SADGTVRLWDLATG 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  282 TLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWE-ADWQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTW 360
Cdd:COG2319    153 KLLRTLTG-HSGAVTSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  361 DLQAAAQVGEVAlgfwGQDKLSRRV-----GRLLApvrpgwpvlslCAS---SMQLWRV--RELYSPLAQLPAKVLHVQV 430
Cdd:COG2319    232 DLATGKLLRTLT----GHSGSVRSVafspdGRLLA-----------SGSadgTVRLWDLatGELLRTLTGHSGGVNSVAF 296
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 2217371896  431 APAlpapahqslPTRLVCACADGSVYLLSAATGRIVSSL 469
Cdd:COG2319    297 SPD---------GKLLASGSDDGTVRLWDLATGKLLRTL 326
WD40 COG2319
WD40 repeat [General function prediction only];
206-719 7.76e-13

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 72.25  E-value: 7.76e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  206 PDLRLLLVAEMNSSLALWQFRSGGRRLVLRGSALHPPPSPTGRLMRLAVAPVPPHHVLRCFAAYGSAVLTFDLHAWTLVD 285
Cdd:COG2319      4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  286 VRRDLHKTTIsdlayceeveamVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWDLQA 364
Cdd:COG2319     84 VAFSPDGRLL------------ASASADGTVRLWDLAtGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLAT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  365 AAQVGEvalgFWGQDKLSRRV-----GRLLApvrpgwpvlslCASS---MQLWRVR--ELYSPLAQLPAKVLHVQVAPAl 434
Cdd:COG2319    152 GKLLRT----LTGHSGAVTSVafspdGKLLA-----------SGSDdgtVRLWDLAtgKLLRTLTGHTGAVRSVAFSPD- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  435 papahqslPTRLVCACADGSVYLLSAATGRIVSSLllepedcaaavayclprealwlltrAGHLVRANAArcpmsvlhrv 514
Cdd:COG2319    216 --------GKLLASGSADGTVRLWDLATGKLLRTL-------------------------TGHSGSVRSV---------- 252
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  515 cpppppapqpcclhlyshltdlegAFSSweivrqhwgelrcssvacawknKNRYLpVVGHTDGTLSVleW-LSSKTVFQT 593
Cdd:COG2319    253 ------------------------AFSP----------------------DGRLL-ASGSADGTVRL--WdLATGELLRT 283
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  594 EAHSPGPVVAIASTWNS--IVSSGGDLTVKMWRVfpyaeESLSLLRTFS---CCYPAVALCALGRRVTAGFEDpdsatyG 668
Cdd:COG2319    284 LTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDL-----ATGKLLRTLTghtGAVRSVAFSPDGKTLASGSDD------G 352
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217371896  669 LVQFGLGDSPRLDHRPQdDPTDHITGLCCCPTLKLYACSSLDCTVRIWTAE 719
Cdd:COG2319    353 TVRLWDLATGELLRTLT-GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
291-624 1.01e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.44  E-value: 1.01e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  291 HKTTISDLAYCEEVEAMVTASRDSTVKVW--EADWQIRmVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWDLQaaaqv 368
Cdd:cd00200      8 HTGGVTCVAFSPDGKLLATGSGDGTIKVWdlETGELLR-TLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLE----- 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  369 gevalgfwgQDKLSRRVGRLLAPVR-----PGWPVLSLCAS--SMQLWRVRElYSPLAQLPAK---VLHVQVAPAlpapa 438
Cdd:cd00200     82 ---------TGECVRTLTGHTSYVSsvafsPDGRILSSSSRdkTIKVWDVET-GKCLTTLRGHtdwVNSVAFSPD----- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  439 hqslPTRLVCACADGSVYLLSAATGRIVSSLLLEpEDCAAAVAYcLPREAlwlltragHLVRANAARCpmsvlhrvcppp 518
Cdd:cd00200    147 ----GTFVASSSQDGTIKLWDLRTGKCVATLTGH-TGEVNSVAF-SPDGE--------KLLSSSSDGT------------ 200
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  519 ppapqpccLHLYShltdlegaFSSWEIVRQHWGELRcSSVACAWkNKNRYLPVVGHTDGTLSVLEWLSSKTVFQTEAHSp 598
Cdd:cd00200    201 --------IKLWD--------LSTGKCLGTLRGHEN-GVNSVAF-SPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHT- 261
                          330       340
                   ....*....|....*....|....*...
gi 2217371896  599 GPVVAIA--STWNSIVSSGGDLTVKMWR 624
Cdd:cd00200    262 NSVTSLAwsPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
200-369 6.44e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 68.13  E-value: 6.44e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  200 TCCLPVPDLRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHpppspTGRLMrlAVAPVPPHHVLRCFAAYGSAVLtFDLH 279
Cdd:cd00200     55 RDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTG---H-----TSYVS--SVAFSPDGRILSSSSRDKTIKV-WDVE 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  280 AWTLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLR 358
Cdd:cd00200    124 TGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRtGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIK 202
                          170
                   ....*....|.
gi 2217371896  359 TWDLQAAAQVG 369
Cdd:cd00200    203 LWDLSTGKCLG 213
WD40 COG2319
WD40 repeat [General function prediction only];
206-462 8.56e-12

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 69.17  E-value: 8.56e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  206 PDLRLLLVAEMNSSLALWQFRSGGRRLVLRGsalHPPP------SPTGRlmRLAVApvpphhvlrcfaAYGSAVLTFDLH 279
Cdd:COG2319    172 PDGKLLASGSDDGTVRLWDLATGKLLRTLTG---HTGAvrsvafSPDGK--LLASG------------SADGTVRLWDLA 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  280 AWTLVDVRRDlHKTTISDLAYCEEVEAMVTASRDSTVKVWEAD-WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLR 358
Cdd:COG2319    235 TGKLLRTLTG-HSGSVRSVAFSPDGRLLASGSADGTVRLWDLAtGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  359 TWDLQAaaqvGEVALGFWGQDKLSRRV-----GRLLApvrpgwpvlslCASS---MQLWRV--RELYSPLAQLPAKVLHV 428
Cdd:COG2319    314 LWDLAT----GKLLRTLTGHTGAVRSVafspdGKTLA-----------SGSDdgtVRLWDLatGELLRTLTGHTGAVTSV 378
                          250       260       270
                   ....*....|....*....|....*....|....
gi 2217371896  429 QVAPAlpapahqslPTRLVCACADGSVYLLSAAT 462
Cdd:COG2319    379 AFSPD---------GRTLASGSADGTVRLWDLAT 403
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
322-361 2.52e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.46  E-value: 2.52e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2217371896   322 DWQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWD 361
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
323-361 2.42e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 45.80  E-value: 2.42e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2217371896  323 WQIRMVFVGHTGPVTAMTVLPNTTLVLSASQDGTLRTWD 361
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
559-716 1.25e-04

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 45.79  E-value: 1.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  559 ACAWKNKNRYLpVVGHTDGTLSVLEWLSSKTVFQTEAHSPGPVVAIASTW-NSIVSSGGDLTVKMWRVfpyaeESLSLLR 637
Cdd:cd00200     14 CVAFSPDGKLL-ATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADgTYLASGSSDKTIRLWDL-----ETGECVR 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  638 TFS--CCYP-AVALCALGRRVTAGFEDP-----DSATYGLVQFGLGdsprldHrpqddpTDHITGLCCCPTLKLYACSSL 709
Cdd:cd00200     88 TLTghTSYVsSVAFSPDGRILSSSSRDKtikvwDVETGKCLTTLRG------H------TDWVNSVAFSPDGTFVASSSQ 155

                   ....*..
gi 2217371896  710 DCTVRIW 716
Cdd:cd00200    156 DGTIKLW 162
WD40 COG2319
WD40 repeat [General function prediction only];
571-751 9.19e-03

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 40.28  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  571 VVGHTDGTLSVLEWLSSKTVFQTEAHSpGPVVAIASTWNS--IVSSGGDLTVKMWRVfpyaeESLSLLRTF---SCCYPA 645
Cdd:COG2319     94 ASASADGTVRLWDLATGLLLRTLTGHT-GAVRSVAFSPDGktLASGSADGTVRLWDL-----ATGKLLRTLtghSGAVTS 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217371896  646 VALCALGRRVTAGFEDpdsatyGLVQFGLGDSPRLDHRPQDdPTDHITGLCCCPTLKLYACSSLDCTVRIWTAE--NRLL 723
Cdd:COG2319    168 VAFSPDGKLLASGSDD------GTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKLLASGSADGTVRLWDLAtgKLLR 240
                          170       180
                   ....*....|....*....|....*...
gi 2217371896  724 RLLQLNGAPQALAFcSNSGDLvLALGSR 751
Cdd:COG2319    241 TLTGHSGSVRSVAF-SPDGRL-LASGSA 266
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH