|
Name |
Accession |
Description |
Interval |
E-value |
| Raptor_N |
pfam14538 |
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ... |
55-206 |
5.21e-96 |
|
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity. :
Pssm-ID: 464202 Cd Length: 152 Bit Score: 304.20 E-value: 5.21e-96
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 55 MKTVSVALVLCLNVGVDPPDVVKTTPCARLECWIDPLSMGPQKALETIGANLQKQYENWQPRARYKQSLDPTVDEVKKLC 134
Cdd:pfam14538 1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 154146249 135 TSLRRNAKEERVLFHYNGHGVPRPTVNGEVWVFNKNYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 206
Cdd:pfam14538 81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1027-1322 |
1.89e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 99.33 E-value: 1.89e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1027 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 1103
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1104 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVA 1183
Cdd:cd00200 82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1184 GLGDGSIRVYDrrMALSECrVMTYREHTAWVVKayLQKHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDI 1261
Cdd:cd00200 153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEVNS--VAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154146249 1262 HPQANLIACGSMNqftaiyngngeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1322
Cdd:cd00200 228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
563-670 |
1.86e-08 |
|
HEAT repeat [General function prediction only]; :
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 54.25 E-value: 1.86e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 563 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413 53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
|
90 100
....*....|....*....|....*...
gi 154146249 643 nvammLAQLINDGSPMVRKELVVALSHL 670
Cdd:COG1413 114 -----LLEALKDPDWEVRRAAARALGRL 136
|
|
| COG5096 super family |
cl34899 |
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ... |
472-696 |
1.97e-04 |
|
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport]; The actual alignment was detected with superfamily member COG5096:
Pssm-ID: 227427 [Multi-domain] Cd Length: 757 Bit Score: 45.87 E-value: 1.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 472 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 542
Cdd:COG5096 56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 543 vnsyttgQEACLQGNLIAICLEQLSDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 622
Cdd:COG5096 120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154146249 623 LGTFVGNSAErtDHSTTIDHNVAMMLAQLINDGSPMVRKELVVALSHLVVQYESN---FCT-VALQFMEEEKNYPLPS 696
Cdd:COG5096 190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPDSaedFEErLSPPLQHNNAEVLLIA 265
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Raptor_N |
pfam14538 |
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ... |
55-206 |
5.21e-96 |
|
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.
Pssm-ID: 464202 Cd Length: 152 Bit Score: 304.20 E-value: 5.21e-96
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 55 MKTVSVALVLCLNVGVDPPDVVKTTPCARLECWIDPLSMGPQKALETIGANLQKQYENWQPRARYKQSLDPTVDEVKKLC 134
Cdd:pfam14538 1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 154146249 135 TSLRRNAKEERVLFHYNGHGVPRPTVNGEVWVFNKNYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 206
Cdd:pfam14538 81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1027-1322 |
1.89e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 99.33 E-value: 1.89e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1027 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 1103
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1104 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVA 1183
Cdd:cd00200 82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1184 GLGDGSIRVYDrrMALSECrVMTYREHTAWVVKayLQKHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDI 1261
Cdd:cd00200 153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEVNS--VAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154146249 1262 HPQANLIACGSMNqftaiyngngeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1322
Cdd:cd00200 228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1038-1331 |
3.63e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.27 E-value: 3.63e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1038 IAVADKD-SICFWDWEKGEKLDYF--HNGnprytRVTAMEYL-NGQdcsLLLTATDDGAIRVWknfaDLEKNPEMVTawq 1113
Cdd:COG2319 135 LASGSADgTVRLWDLATGKLLRTLtgHSG-----AVTSVAFSpDGK---LLASGSDDGTVRLW----DLATGKLLRT--- 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1114 glsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVY 1193
Cdd:COG2319 200 -----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTL-TGHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1194 DRRmalSECRVMTYREHTAWVVKAYLqkHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDIHPQANLIACG 1271
Cdd:COG2319 274 DLA---TGELLRTLTGHSGGVNSVAF--SPDGkLLASGSDDGTVRLWDLATGKLLRTLTGhTGAVRSVAFSPDGKTLASG 348
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 154146249 1272 SMNQFTAIYN-GNGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319 349 SDDGTVRLWDlATGELLRTLT-------GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
563-670 |
1.86e-08 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 54.25 E-value: 1.86e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 563 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413 53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
|
90 100
....*....|....*....|....*...
gi 154146249 643 nvammLAQLINDGSPMVRKELVVALSHL 670
Cdd:COG1413 114 -----LLEALKDPDWEVRRAAARALGRL 136
|
|
| HEAT_2 |
pfam13646 |
HEAT repeats; This family includes multiple HEAT repeats. |
563-668 |
3.52e-05 |
|
HEAT repeats; This family includes multiple HEAT repeats.
Pssm-ID: 433376 [Multi-domain] Cd Length: 88 Bit Score: 43.48 E-value: 3.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 563 LEQL-SDPHPLLRQWVAICLGRIwqNFDSARwcgvrdsahEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTID 641
Cdd:pfam13646 5 LQALlRDPDPEVRAAAIRALGRI--GDPEAV---------PALLELLKDEDPAVRRAAAEALG--------KIGDPEALP 65
|
90 100
....*....|....*....|....*..
gi 154146249 642 HnvamMLAQLINDGSPMVRKELVVALS 668
Cdd:pfam13646 66 A----LLELLRDDDDDVVRAAAAEALA 88
|
|
| COG5096 |
COG5096 |
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ... |
472-696 |
1.97e-04 |
|
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 227427 [Multi-domain] Cd Length: 757 Bit Score: 45.87 E-value: 1.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 472 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 542
Cdd:COG5096 56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 543 vnsyttgQEACLQGNLIAICLEQLSDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 622
Cdd:COG5096 120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154146249 623 LGTFVGNSAErtDHSTTIDHNVAMMLAQLINDGSPMVRKELVVALSHLVVQYESN---FCT-VALQFMEEEKNYPLPS 696
Cdd:COG5096 190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPDSaedFEErLSPPLQHNNAEVLLIA 265
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Raptor_N |
pfam14538 |
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ... |
55-206 |
5.21e-96 |
|
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.
Pssm-ID: 464202 Cd Length: 152 Bit Score: 304.20 E-value: 5.21e-96
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 55 MKTVSVALVLCLNVGVDPPDVVKTTPCARLECWIDPLSMGPQKALETIGANLQKQYENWQPRARYKQSLDPTVDEVKKLC 134
Cdd:pfam14538 1 LKTVSVALVLCLNIGVDPPDVVKTKPCARLECWIDPSSMSPQKALEEIGKNLQDQYESWQPRARYKQSLDPSVEDVKKLC 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 154146249 135 TSLRRNAKEERVLFHYNGHGVPRPTVNGEVWVFNKNYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 206
Cdd:pfam14538 81 SKLRRNAKDERVLFHYNGHGVPRPTSNGEIWVFNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1027-1322 |
1.89e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 99.33 E-value: 1.89e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1027 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 1103
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1104 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVA 1183
Cdd:cd00200 82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1184 GLGDGSIRVYDrrMALSECrVMTYREHTAWVVKayLQKHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDI 1261
Cdd:cd00200 153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEVNS--VAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154146249 1262 HPQANLIACGSMNqftaiyngngeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1322
Cdd:cd00200 228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1123-1330 |
6.91e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 82.77 E-value: 6.91e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1123 RGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVYDRRmalSEC 1202
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL-KGHTGPVRDVAASADGTYLASGSSDKTIRLWDLE---TGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1203 RVMTYREHTAWVvkAYLQKHPEGHIVSVS-VNGDVRFFDPRMPESVNVMQ-IVKGLTALDIHPQANLIACGSMNQFTAIY 1280
Cdd:cd00200 85 CVRTLTGHTSYV--SSVAFSPDGRILSSSsRDKTIKVWDVETGKCLTTLRgHTDWVNSVAFSPDGTFVASSSQDGTIKLW 162
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 154146249 1281 NG-NGELINNIKYYDGFmgqrvgaISCLAFHPHWPHLAVGSNDYYISVYSV 1330
Cdd:cd00200 163 DLrTGKCVATLTGHTGE-------VNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1038-1331 |
3.63e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.27 E-value: 3.63e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1038 IAVADKD-SICFWDWEKGEKLDYF--HNGnprytRVTAMEYL-NGQdcsLLLTATDDGAIRVWknfaDLEKNPEMVTawq 1113
Cdd:COG2319 135 LASGSADgTVRLWDLATGKLLRTLtgHSG-----AVTSVAFSpDGK---LLASGSDDGTVRLW----DLATGKLLRT--- 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1114 glsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVY 1193
Cdd:COG2319 200 -----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTL-TGHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1194 DRRmalSECRVMTYREHTAWVVKAYLqkHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDIHPQANLIACG 1271
Cdd:COG2319 274 DLA---TGELLRTLTGHSGGVNSVAF--SPDGkLLASGSDDGTVRLWDLATGKLLRTLTGhTGAVRSVAFSPDGKTLASG 348
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 154146249 1272 SMNQFTAIYN-GNGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319 349 SDDGTVRLWDlATGELLRTLT-------GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1019-1331 |
2.18e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 76.87 E-value: 2.18e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1019 LNRNPGVPSVVKFHPFTPCIAVADKD-SICFWDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRV 1095
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADgTVRLWDLATGLLLRTLtgHTGAVRSVAFSP-------DGKTLASGSADGTVRL 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1096 WknfaDLEKNPEMVTawqglsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCD 1175
Cdd:COG2319 147 W----DLATGKLLRT--------LTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL-TGHTGAVRSVAFS 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1176 SHRSLIVAGLGDGSIRVYDrrMALSECrVMTYREHTAWVVK-AYlqkHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQIV 1253
Cdd:COG2319 214 PDGKLLASGSADGTVRLWD--LATGKL-LRTLTGHSGSVRSvAF---SPDGrLLASGSADGTVRLWDLATGELLRTLTGH 287
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1254 KG-LTALDIHPQANLIACGSMNQFTAIYN-GNGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319 288 SGgVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLT-------GHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLA 360
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1029-1240 |
1.86e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 1.86e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1029 VKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNprytrVTAMEYLngQDCSLLLTATDDGAIRVWknfaDLEKN 1105
Cdd:cd00200 57 VAASADGTYLASGSSDKTIRlWDLETGECVRTLtgHTSY-----VSSVAFS--PDGRILSSSSRDKTIKVW----DVETG 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1106 pEMVTAWQGLSDmlpttrgAGMVVDWEQETGLLMSSGDVRIVRIWDT--------------------------------- 1152
Cdd:cd00200 126 -KCLTTLRGHTD-------WVNSVAFSPDGTFVASSSQDGTIKLWDLrtgkcvatltghtgevnsvafspdgekllssss 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1153 DRETKVQDIPTGADSC--------VTSLSCDSHRSLIVAGLGDGSIRVYDRRMAlsECrVMTYREHTAWVVKAYLqkHPE 1224
Cdd:cd00200 198 DGTIKLWDLSTGKCLGtlrghengVNSVAFSPDGYLLASGSEDGTIRVWDLRTG--EC-VQTLSGHTNSVTSLAW--SPD 272
|
250
....*....|....*..
gi 154146249 1225 GH-IVSVSVNGDVRFFD 1240
Cdd:cd00200 273 GKrLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1025-1331 |
1.91e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 70.71 E-value: 1.91e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1025 VPSVVKFHPFTPCIAVADKDSICFWDWEKGEKLDYFHNGNPRYTRVTAMeylngQDCSLLLTATDDGAIRVWknfaDLEk 1104
Cdd:COG2319 39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS-----PDGRLLASASADGTVRLW----DLA- 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1105 NPEMVTAWQGLSDMLPTtrgagmvVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAG 1184
Cdd:COG2319 109 TGLLLRTLTGHTGAVRS-------VAFSPDGKTLASGSADGTVRLWDLATGKLLRTL-TGHSGAVTSVAFSPDGKLLASG 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1185 LGDGSIRVYDrrmALSECRVMTYREHTAWVVK-AYlqkHPEGH-IVSVSVNGDVRFFDprmPESVNVMQIVKG----LTA 1258
Cdd:COG2319 181 SDDGTVRLWD---LATGKLLRTLTGHTGAVRSvAF---SPDGKlLASGSADGTVRLWD---LATGKLLRTLTGhsgsVRS 251
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 154146249 1259 LDIHPQANLIACGSMNQFTAIYN-GNGELInnikyydGFMGQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319 252 VAFSPDGRLLASGSADGTVRLWDlATGELL-------RTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLA 318
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1110-1331 |
9.52e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 59.15 E-value: 9.52e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1110 TAWQGLSDMLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQdIPTGADSCVTSLSCDSHRSLIVAGLGDGS 1189
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA-TLLGHTAAVLSVAFSPDGRLLASASADGT 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1190 IRVYDrrmALSECRVMTYREHTAWVVKAYLqkHPEGH-IVSVSVNGDVRFFDPRMPESVNVMQIVKG-LTALDIHPQANL 1267
Cdd:COG2319 102 VRLWD---LATGLLLRTLTGHTGAVRSVAF--SPDGKtLASGSADGTVRLWDLATGKLLRTLTGHSGaVTSVAFSPDGKL 176
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 154146249 1268 IACGSMNQFTAIYN-GNGELINNIKYYDgfmgqrvGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1331
Cdd:COG2319 177 LASGSDDGTVRLWDlATGKLLRTLTGHT-------GAVRSVAFSPDGKLLASGSADGTVRLWDLA 234
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
563-670 |
1.86e-08 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 54.25 E-value: 1.86e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 563 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413 53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
|
90 100
....*....|....*....|....*...
gi 154146249 643 nvammLAQLINDGSPMVRKELVVALSHL 670
Cdd:COG1413 114 -----LLEALKDPDWEVRRAAARALGRL 136
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
1038-1333 |
3.15e-07 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 53.77 E-value: 3.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1038 IAVADKD-SICFWDWEKGEKLDYFHNGNPRYT-----RVTAMEYLNGQdcslLLTATDDGAIRVWKNFADLEKNPEmVTA 1111
Cdd:cd22857 47 LAVARKNgTVEVLDPENGDLLASFSDSEPATKlseedHFVGLHLFSGT----LLTCTSKGSLRSTKLPDDSTASSS-PTA 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1112 WQGLSDMLPTTRGagmvvdwEQETGLLMSSGDVRIVRIWDTdrETKVQDI---------------PTgadsCVTS---LS 1173
Cdd:cd22857 122 WVCLGGNLLCMRV-------DPNENYFAFGGKEVELNVWDL--EEKPGKIwraknvpndslglrvPV----WVTDltfLS 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1174 CDSHRSlIVAGLGDGSIRVYD----RRmalsecRVM--TYREHTAWVVkaylQKHPEGHIVSVSVN-GDVRFFDPRmpes 1246
Cdd:cd22857 189 KDDHRK-IVTGTGYHQVRLYDtraqRR------PVVsvDFGETPIKAV----AEDPDGHTVYVGDTsGDLASIDLR---- 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 1247 vnvmqivkgltaldihpqanliacgsmnqftaiyngNGELINNikyYDGFMGqrvGAISCLAFHPHWPHLAVGSNDYYIS 1326
Cdd:cd22857 254 ------------------------------------TGKLLGC---FKGKCG---GSIRSIARHPELPLIASCGLDRYLR 291
|
....*..
gi 154146249 1327 VYSVEKR 1333
Cdd:cd22857 292 IWDTETR 298
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
563-670 |
6.32e-07 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 50.01 E-value: 6.32e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 563 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 642
Cdd:COG1413 22 IAALADEDPDVRAAAARALGRLGD-----------PRAVPALLEALKDPDPEVRAAAAEALG--------RIGDPEAVPA 82
|
90 100
....*....|....*....|....*...
gi 154146249 643 nvammLAQLINDGSPMVRKELVVALSHL 670
Cdd:COG1413 83 -----LIAALKDEDPEVRRAAAEALGRL 105
|
|
| HEAT_2 |
pfam13646 |
HEAT repeats; This family includes multiple HEAT repeats. |
563-668 |
3.52e-05 |
|
HEAT repeats; This family includes multiple HEAT repeats.
Pssm-ID: 433376 [Multi-domain] Cd Length: 88 Bit Score: 43.48 E-value: 3.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 563 LEQL-SDPHPLLRQWVAICLGRIwqNFDSARwcgvrdsahEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTID 641
Cdd:pfam13646 5 LQALlRDPDPEVRAAAIRALGRI--GDPEAV---------PALLELLKDEDPAVRRAAAEALG--------KIGDPEALP 65
|
90 100
....*....|....*....|....*..
gi 154146249 642 HnvamMLAQLINDGSPMVRKELVVALS 668
Cdd:pfam13646 66 A----LLELLRDDDDDVVRAAAAEALA 88
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
559-624 |
3.69e-05 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 45.00 E-value: 3.69e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 154146249 559 IAICLEQLSDPHPLLRQWVAICLGRIWqnfdsarwcgvRDSAHEKLYSLLSDPIPEVRCAAVFALG 624
Cdd:COG1413 80 VPALIAALKDEDPEVRRAAAEALGRLG-----------DPAAVPALLEALKDPDWEVRRAAARALG 134
|
|
| COG5096 |
COG5096 |
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ... |
472-696 |
1.97e-04 |
|
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 227427 [Multi-domain] Cd Length: 757 Bit Score: 45.87 E-value: 1.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 472 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 542
Cdd:COG5096 56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 154146249 543 vnsyttgQEACLQGNLIAICLEQLSDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 622
Cdd:COG5096 120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 154146249 623 LGTFVGNSAErtDHSTTIDHNVAMMLAQLINDGSPMVRKELVVALSHLVVQYESN---FCT-VALQFMEEEKNYPLPS 696
Cdd:COG5096 190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPDSaedFEErLSPPLQHNNAEVLLIA 265
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
598-670 |
1.69e-03 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 40.00 E-value: 1.69e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 154146249 598 DSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDHnvammLAQLINDGSPMVRKELVVALSHL 670
Cdd:COG1413 15 PAAVPALIAALADEDPDVRAAAARALG--------RLGDPRAVPA-----LLEALKDPDPEVRAAAAEALGRI 74
|
|
| HEAT |
pfam02985 |
HEAT repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see ... |
604-627 |
6.01e-03 |
|
HEAT repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514).
Pssm-ID: 460773 Cd Length: 31 Bit Score: 35.58 E-value: 6.01e-03
|
|