NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462601033|ref|XP_054207839|]
View 

dmX-like protein 1 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Rav1p_C super family cl13644
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between ...
1142-1875 2.20e-74

RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.


The actual alignment was detected with superfamily member pfam12234:

Pssm-ID: 432413  Cd Length: 637  Bit Score: 262.50  E-value: 2.20e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1142 LDWMSREDGSHILTVGIGSKLFMYGPLAgkvQDQTGKEtlafPLWestkvVPLSKFvllrsvdlvsSVDGSPPFPVS-LS 1220
Cdd:pfam12234   76 LDWTSTPDSQSILAVGFPHHVLLLTQLR---YDYTNKG----PSW-----APIRKI----------DIRDLTPHPIGdSI 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1221 WVRDGILVVGMDCEMHVYcqwqpsSKQEPVITDSYSGSTPSITSLIKQSNSssglhppkktltrsmtslaqkicgkktaf 1300
Cdd:pfam12234  134 WLDDGTLVVAAGNQLFIY------DKWLDLRLPDDPFTLRSIGSRKILSND----------------------------- 178
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1301 dpsvdmedsgLFEAAHVLSPTLPQYHPLQLLELMDLGKVRRAKAILSHLVKCIagevvalneaesnherrlrsltisasg 1380
Cdd:pfam12234  179 ----------LFHLVSVLNGPLPVYHPQFLIQCLLAGKLELVKEILLRLFKEL--------------------------- 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1381 sttrdpqafnKAENTDYTEIDSVPPLPLYALLAADDdscyssleksSNESTLSKSNQLSKESYDELFQTqllmtdthmle 1460
Cdd:pfam12234  222 ----------KFYSEDLEDLDSFLGIDLEKFLKDDD----------KAYSKNKAFTSSSDDDDPDPYET----------- 270
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1461 tdeentkprvidlsqysptyFGPEHAQVLSGHLLHSSLPGLSRMEQMSLMALADTIATTStdigesrdrsQGGETLDECG 1540
Cdd:pfam12234  271 --------------------FNEEVASSLNEKLTKISLPQLTRHEQITLINVIEAVGEVE----------KHRRSLDENG 320
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1541 LKFLLAVRLHTflttslpAYRAQLLHQGLSTSHFAWAFHSVAEEELLNMLPAMQKDDPTWSELRAMGVGWWVRNTRILRK 1620
Cdd:pfam12234  321 ARFLLGFKLHL-------LHKKRTSQSSLSWRDISWALHSDNQEILLDLVSRHYGNKLLWEAARESGIFMWLKDIEALRA 393
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1621 CIEKVAKAAfYRKN---DPLDAAIFYLAMKKKAVIWGLYR-AEKN---TRMTQFFGHNFEDERWRKAALKNAFSLLGKQR 1693
Cdd:pfam12234  394 QFEVIARNE-YTKSderDPVDCSLFYLALKKKQVLQGLWRmASWHpeqAKTLKFLSNDFSEPRWRTAALKNAFALLSKHR 472
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1694 FEHSAAFFLLAGCLRDAIEVCLEKLNDIQLALVIARLYESefDTSAAYKSILRKKVLgidsPVSElcslninMHHDPFLR 1773
Cdd:pfam12234  473 YEYAAAFFLLADSLKDAVNVLLRQLKDLQLAIAVARVYEG--DDGPVLRELLEERVL----PLAI-------KEGDRWLA 539
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1774 SMAYWILEDYSGALETLIKQPIRENDDQVLSASNPTVFNFYNYLRTHPLL------LRRhfgssdtfSTHMSLTGKSGLA 1847
Cdd:pfam12234  540 SWAFWMLKRRDLAVRALVTPPYDLLENTDLKKSDPASPVSKSFLTDDPALvllyqqLRK--------KTLQTLKGALKVT 611
                          730       740
                   ....*....|....*....|....*...
gi 2462601033 1848 GTInlsERRLFFTTASAHLKAGCPMLAL 1875
Cdd:pfam12234  612 PKE---EYDFVLRVARIYDRMGCDLLAL 636
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2806-3063 1.41e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 100.87  E-value: 1.41e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2806 VRRMTSHPTLPYYLTGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNWKCCPVTgs 2885
Cdd:cd00200     12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRT-- 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2886 mpkpyltWQCHNKTANDFVFVSSSSLIATAGlstDNRNVCLWDTlvaPANSLVHAFTCHDSGATVLAYAPKHQLLISGGR 2965
Cdd:cd00200     89 -------LTGHTSYVSSVAFSPDGRILSSSS---RDKTIKVWDV---ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2966 KGFTYVFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFVSeharqsiFRNIGTGVMQI 3045
Cdd:cd00200    156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG-------HENGVNSVAFS 228
                          250
                   ....*....|....*...
gi 2462601033 3046 ETGpaNHIFSCGADGTMK 3063
Cdd:cd00200    229 PDG--YLLASGSEDGTIR 244
 
Name Accession Description Interval E-value
Rav1p_C pfam12234
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between ...
1142-1875 2.20e-74

RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.


Pssm-ID: 432413  Cd Length: 637  Bit Score: 262.50  E-value: 2.20e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1142 LDWMSREDGSHILTVGIGSKLFMYGPLAgkvQDQTGKEtlafPLWestkvVPLSKFvllrsvdlvsSVDGSPPFPVS-LS 1220
Cdd:pfam12234   76 LDWTSTPDSQSILAVGFPHHVLLLTQLR---YDYTNKG----PSW-----APIRKI----------DIRDLTPHPIGdSI 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1221 WVRDGILVVGMDCEMHVYcqwqpsSKQEPVITDSYSGSTPSITSLIKQSNSssglhppkktltrsmtslaqkicgkktaf 1300
Cdd:pfam12234  134 WLDDGTLVVAAGNQLFIY------DKWLDLRLPDDPFTLRSIGSRKILSND----------------------------- 178
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1301 dpsvdmedsgLFEAAHVLSPTLPQYHPLQLLELMDLGKVRRAKAILSHLVKCIagevvalneaesnherrlrsltisasg 1380
Cdd:pfam12234  179 ----------LFHLVSVLNGPLPVYHPQFLIQCLLAGKLELVKEILLRLFKEL--------------------------- 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1381 sttrdpqafnKAENTDYTEIDSVPPLPLYALLAADDdscyssleksSNESTLSKSNQLSKESYDELFQTqllmtdthmle 1460
Cdd:pfam12234  222 ----------KFYSEDLEDLDSFLGIDLEKFLKDDD----------KAYSKNKAFTSSSDDDDPDPYET----------- 270
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1461 tdeentkprvidlsqysptyFGPEHAQVLSGHLLHSSLPGLSRMEQMSLMALADTIATTStdigesrdrsQGGETLDECG 1540
Cdd:pfam12234  271 --------------------FNEEVASSLNEKLTKISLPQLTRHEQITLINVIEAVGEVE----------KHRRSLDENG 320
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1541 LKFLLAVRLHTflttslpAYRAQLLHQGLSTSHFAWAFHSVAEEELLNMLPAMQKDDPTWSELRAMGVGWWVRNTRILRK 1620
Cdd:pfam12234  321 ARFLLGFKLHL-------LHKKRTSQSSLSWRDISWALHSDNQEILLDLVSRHYGNKLLWEAARESGIFMWLKDIEALRA 393
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1621 CIEKVAKAAfYRKN---DPLDAAIFYLAMKKKAVIWGLYR-AEKN---TRMTQFFGHNFEDERWRKAALKNAFSLLGKQR 1693
Cdd:pfam12234  394 QFEVIARNE-YTKSderDPVDCSLFYLALKKKQVLQGLWRmASWHpeqAKTLKFLSNDFSEPRWRTAALKNAFALLSKHR 472
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1694 FEHSAAFFLLAGCLRDAIEVCLEKLNDIQLALVIARLYESefDTSAAYKSILRKKVLgidsPVSElcslninMHHDPFLR 1773
Cdd:pfam12234  473 YEYAAAFFLLADSLKDAVNVLLRQLKDLQLAIAVARVYEG--DDGPVLRELLEERVL----PLAI-------KEGDRWLA 539
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1774 SMAYWILEDYSGALETLIKQPIRENDDQVLSASNPTVFNFYNYLRTHPLL------LRRhfgssdtfSTHMSLTGKSGLA 1847
Cdd:pfam12234  540 SWAFWMLKRRDLAVRALVTPPYDLLENTDLKKSDPASPVSKSFLTDDPALvllyqqLRK--------KTLQTLKGALKVT 611
                          730       740
                   ....*....|....*....|....*...
gi 2462601033 1848 GTInlsERRLFFTTASAHLKAGCPMLAL 1875
Cdd:pfam12234  612 PKE---EYDFVLRVARIYDRMGCDLLAL 636
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2806-3063 1.41e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 100.87  E-value: 1.41e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2806 VRRMTSHPTLPYYLTGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNWKCCPVTgs 2885
Cdd:cd00200     12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRT-- 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2886 mpkpyltWQCHNKTANDFVFVSSSSLIATAGlstDNRNVCLWDTlvaPANSLVHAFTCHDSGATVLAYAPKHQLLISGGR 2965
Cdd:cd00200     89 -------LTGHTSYVSSVAFSPDGRILSSSS---RDKTIKVWDV---ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2966 KGFTYVFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFVSeharqsiFRNIGTGVMQI 3045
Cdd:cd00200    156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG-------HENGVNSVAFS 228
                          250
                   ....*....|....*...
gi 2462601033 3046 ETGpaNHIFSCGADGTMK 3063
Cdd:cd00200    229 PDG--YLLASGSEDGTIR 244
WD40 COG2319
WD40 repeat [General function prediction only];
2819-3063 5.33e-20

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 95.36  E-value: 5.33e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2819 LTGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNwkccpvTGsmpKPYLTWQCHNK 2898
Cdd:COG2319    136 ASGSADGTVRLWDLATGKLLRTLT-GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA------TG---KLLRTLTGHTG 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2899 TANDFVFVSSSSLIATAGlstDNRNVCLWDtlvAPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQ 2978
Cdd:COG2319    206 AVRSVAFSPDGKLLASGS---ADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGE 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2979 QRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFVSEHARqsifrnigtgVMQIETGPA-NHIFSCG 3057
Cdd:COG2319    280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGA----------VRSVAFSPDgKTLASGS 349

                   ....*.
gi 2462601033 3058 ADGTMK 3063
Cdd:COG2319    350 DDGTVR 355
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2976-3015 3.26e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.07  E-value: 3.26e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462601033  2976 QRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWS 3015
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
2977-3015 2.76e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.41  E-value: 2.76e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2462601033 2977 RQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWS 3015
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Name Accession Description Interval E-value
Rav1p_C pfam12234
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between ...
1142-1875 2.20e-74

RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.


Pssm-ID: 432413  Cd Length: 637  Bit Score: 262.50  E-value: 2.20e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1142 LDWMSREDGSHILTVGIGSKLFMYGPLAgkvQDQTGKEtlafPLWestkvVPLSKFvllrsvdlvsSVDGSPPFPVS-LS 1220
Cdd:pfam12234   76 LDWTSTPDSQSILAVGFPHHVLLLTQLR---YDYTNKG----PSW-----APIRKI----------DIRDLTPHPIGdSI 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1221 WVRDGILVVGMDCEMHVYcqwqpsSKQEPVITDSYSGSTPSITSLIKQSNSssglhppkktltrsmtslaqkicgkktaf 1300
Cdd:pfam12234  134 WLDDGTLVVAAGNQLFIY------DKWLDLRLPDDPFTLRSIGSRKILSND----------------------------- 178
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1301 dpsvdmedsgLFEAAHVLSPTLPQYHPLQLLELMDLGKVRRAKAILSHLVKCIagevvalneaesnherrlrsltisasg 1380
Cdd:pfam12234  179 ----------LFHLVSVLNGPLPVYHPQFLIQCLLAGKLELVKEILLRLFKEL--------------------------- 221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1381 sttrdpqafnKAENTDYTEIDSVPPLPLYALLAADDdscyssleksSNESTLSKSNQLSKESYDELFQTqllmtdthmle 1460
Cdd:pfam12234  222 ----------KFYSEDLEDLDSFLGIDLEKFLKDDD----------KAYSKNKAFTSSSDDDDPDPYET----------- 270
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1461 tdeentkprvidlsqysptyFGPEHAQVLSGHLLHSSLPGLSRMEQMSLMALADTIATTStdigesrdrsQGGETLDECG 1540
Cdd:pfam12234  271 --------------------FNEEVASSLNEKLTKISLPQLTRHEQITLINVIEAVGEVE----------KHRRSLDENG 320
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1541 LKFLLAVRLHTflttslpAYRAQLLHQGLSTSHFAWAFHSVAEEELLNMLPAMQKDDPTWSELRAMGVGWWVRNTRILRK 1620
Cdd:pfam12234  321 ARFLLGFKLHL-------LHKKRTSQSSLSWRDISWALHSDNQEILLDLVSRHYGNKLLWEAARESGIFMWLKDIEALRA 393
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1621 CIEKVAKAAfYRKN---DPLDAAIFYLAMKKKAVIWGLYR-AEKN---TRMTQFFGHNFEDERWRKAALKNAFSLLGKQR 1693
Cdd:pfam12234  394 QFEVIARNE-YTKSderDPVDCSLFYLALKKKQVLQGLWRmASWHpeqAKTLKFLSNDFSEPRWRTAALKNAFALLSKHR 472
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1694 FEHSAAFFLLAGCLRDAIEVCLEKLNDIQLALVIARLYESefDTSAAYKSILRKKVLgidsPVSElcslninMHHDPFLR 1773
Cdd:pfam12234  473 YEYAAAFFLLADSLKDAVNVLLRQLKDLQLAIAVARVYEG--DDGPVLRELLEERVL----PLAI-------KEGDRWLA 539
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 1774 SMAYWILEDYSGALETLIKQPIRENDDQVLSASNPTVFNFYNYLRTHPLL------LRRhfgssdtfSTHMSLTGKSGLA 1847
Cdd:pfam12234  540 SWAFWMLKRRDLAVRALVTPPYDLLENTDLKKSDPASPVSKSFLTDDPALvllyqqLRK--------KTLQTLKGALKVT 611
                          730       740
                   ....*....|....*....|....*...
gi 2462601033 1848 GTInlsERRLFFTTASAHLKAGCPMLAL 1875
Cdd:pfam12234  612 PKE---EYDFVLRVARIYDRMGCDLLAL 636
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2806-3063 1.41e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 100.87  E-value: 1.41e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2806 VRRMTSHPTLPYYLTGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNWKCCPVTgs 2885
Cdd:cd00200     12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRT-- 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2886 mpkpyltWQCHNKTANDFVFVSSSSLIATAGlstDNRNVCLWDTlvaPANSLVHAFTCHDSGATVLAYAPKHQLLISGGR 2965
Cdd:cd00200     89 -------LTGHTSYVSSVAFSPDGRILSSSS---RDKTIKVWDV---ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2966 KGFTYVFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFVSeharqsiFRNIGTGVMQI 3045
Cdd:cd00200    156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG-------HENGVNSVAFS 228
                          250
                   ....*....|....*...
gi 2462601033 3046 ETGpaNHIFSCGADGTMK 3063
Cdd:cd00200    229 PDG--YLLASGSEDGTIR 244
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2844-3064 5.04e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 96.25  E-value: 5.04e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2844 GGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNWKCCPVTGsmpkpyltwQCHNKTANDFVFVSSSSLIATAGlstDNRN 2923
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL---------KGHTGPVRDVAASADGTYLASGS---SDKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2924 VCLWDTlvaPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFV 3003
Cdd:cd00200     75 IRLWDL---ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA 151
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462601033 3004 TGSAEGNIKIWSLSTFGLLHTFvSEHARQsifrniGTGVMQIETGpaNHIFSCGADGTMKM 3064
Cdd:cd00200    152 SSSQDGTIKLWDLRTGKCVATL-TGHTGE------VNSVAFSPDG--EKLLSSSSDGTIKL 203
WD40 COG2319
WD40 repeat [General function prediction only];
2819-3063 5.33e-20

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 95.36  E-value: 5.33e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2819 LTGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNwkccpvTGsmpKPYLTWQCHNK 2898
Cdd:COG2319    136 ASGSADGTVRLWDLATGKLLRTLT-GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA------TG---KLLRTLTGHTG 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2899 TANDFVFVSSSSLIATAGlstDNRNVCLWDtlvAPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQ 2978
Cdd:COG2319    206 AVRSVAFSPDGKLLASGS---ADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGE 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2979 QRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFVSEHARqsifrnigtgVMQIETGPA-NHIFSCG 3057
Cdd:COG2319    280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGA----------VRSVAFSPDgKTLASGS 349

                   ....*.
gi 2462601033 3058 ADGTMK 3063
Cdd:COG2319    350 DDGTVR 355
WD40 COG2319
WD40 repeat [General function prediction only];
2820-3063 2.42e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 93.44  E-value: 2.42e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2820 TGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNwkccpvTGsmpKPYLTWQCHNKT 2899
Cdd:COG2319    179 SGSDDGTVRLWDLATGKLLRTLT-GHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA------TG---KLLRTLTGHSGS 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2900 ANDFVFVSSSSLIATAGlstDNRNVCLWDtlvAPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQQ 2979
Cdd:COG2319    249 VRSVAFSPDGRLLASGS---ADGTVRLWD---LATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2980 RQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFvSEHARqsifrnigtGVMQIETGPA-NHIFSCGA 3058
Cdd:COG2319    323 LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTL-TGHTG---------AVTSVAFSPDgRTLASGSA 392

                   ....*
gi 2462601033 3059 DGTMK 3063
Cdd:COG2319    393 DGTVR 397
WD40 COG2319
WD40 repeat [General function prediction only];
2820-3018 1.95e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 90.74  E-value: 1.95e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2820 TGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNYQGNKFGIVDADGYLSLYQTNwkccpvTGSMPKpylTWQCHNKT 2899
Cdd:COG2319    221 SGSADGTVRLWDLATGKLLRTLT-GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLA------TGELLR---TLTGHSGG 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2900 ANDFVFVSSSSLIATAGlstDNRNVCLWDtlvAPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQQ 2979
Cdd:COG2319    291 VNSVAFSPDGKLLASGS---DDGTVRLWD---LATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGEL 364
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2462601033 2980 RQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLST 3018
Cdd:COG2319    365 LRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2804-3015 1.56e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 82.77  E-value: 1.56e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2804 NNVRRMTSHPTLPYYLTGAQDGSVRMFEWGHSQQITCFRsGGNSRVTRMRFNyQGNKFGIV-DADGYLSLYQT-NWKCCP 2881
Cdd:cd00200     94 SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR-GHTDWVNSVAFS-PDGTFVASsSQDGTIKLWDLrTGKCVA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2882 V-TGsmpkpyltwqcHNKTANDFVFVSSSSLIATAGlstDNRNVCLWDTLVApanSLVHAFTCHDSGATVLAYAPKHQLL 2960
Cdd:cd00200    172 TlTG-----------HTGEVNSVAFSPDGEKLLSSS---SDGTIKLWDLSTG---KCLGTLRGHENGVNSVAFSPDGYLL 234
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462601033 2961 ISGGRKGFTYVFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWS 3015
Cdd:cd00200    235 ASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
2891-3063 1.73e-14

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 78.41  E-value: 1.73e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2891 LTWQCHNKTANDFVFVSSSSLIATAGlstDNRNVCLWDtlvAPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTY 2970
Cdd:COG2319     72 ATLLGHTAAVLSVAFSPDGRLLASAS---ADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVR 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2971 VFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTFvSEHARqsifrnigtGVMQIETGPA 3050
Cdd:COG2319    146 LWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL-TGHTG---------AVRSVAFSPD 215
                          170
                   ....*....|....
gi 2462601033 3051 -NHIFSCGADGTMK 3063
Cdd:COG2319    216 gKLLASGSADGTVR 229
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
2937-3064 2.41e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 67.36  E-value: 2.41e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2937 LVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSL 3016
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2462601033 3017 STFGLLHTFVSeHARqsifrnigtGVMQIETGPANHIF-SCGADGTMKM 3064
Cdd:cd00200     81 ETGECVRTLTG-HTS---------YVSSVAFSPDGRILsSSSRDKTIKV 119
WD40 COG2319
WD40 repeat [General function prediction only];
2904-3025 1.82e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 1.82e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462601033 2904 VFVSSSSLIATAGLSTDNRNVCLWDtlvAPANSLVHAFTCHDSGATVLAYAPKHQLLISGGRKGFTYVFDLCQRQQRQLF 2983
Cdd:COG2319     40 ASLAASPDGARLAAGAGDLTLLLLD---AAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL 116
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2462601033 2984 QSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWSLSTFGLLHTF 3025
Cdd:COG2319    117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTL 158
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
2976-3015 3.26e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.07  E-value: 3.26e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462601033  2976 QRQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWS 3015
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
2977-3015 2.76e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.41  E-value: 2.76e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2462601033 2977 RQQRQLFQSHDSPVKAVAVDPTEEYFVTGSAEGNIKIWS 3015
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH