|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
879-1174 |
3.42e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 98.18 E-value: 3.42e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 879 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 955
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 956 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVA 1035
Cdd:cd00200 82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1036 GLGDGSIRVYDrrMALSECrVMTYREHTAWVVKayLQKHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDI 1113
Cdd:cd00200 153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEVNS--VAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907083413 1114 HPQANLIACGSMNqftaiyngngeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1174
Cdd:cd00200 228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
|
|
| Raptor_N super family |
cl46306 |
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ... |
8-58 |
2.98e-18 |
|
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity. The actual alignment was detected with superfamily member pfam14538:
Pssm-ID: 480646 Cd Length: 152 Bit Score: 82.71 E-value: 2.98e-18
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1907083413 8 IKNCSCFIsenHDENYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 58
Cdd:pfam14538 105 TSNGEIWV---FNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
415-522 |
1.43e-08 |
|
HEAT repeat [General function prediction only]; :
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 54.63 E-value: 1.43e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 415 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 494
Cdd:COG1413 53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
|
90 100
....*....|....*....|....*...
gi 1907083413 495 nvammLAQLINDGSPMVRKELVVALSHL 522
Cdd:COG1413 114 -----LLEALKDPDWEVRRAAARALGRL 136
|
|
| COG5096 super family |
cl34899 |
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ... |
324-548 |
1.72e-04 |
|
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport]; The actual alignment was detected with superfamily member COG5096:
Pssm-ID: 227427 [Multi-domain] Cd Length: 757 Bit Score: 45.87 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 324 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 394
Cdd:COG5096 56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 395 vnsyttgQEACLQGNLIAICLEQLSDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 474
Cdd:COG5096 120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907083413 475 LGTFVGNSAErtDHSTTIDHNVAMMLAQLINDGSPMVRKELVVALSHLVVQYESN---FCT-VALQFMEEEKNYPLPS 548
Cdd:COG5096 190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPDSaedFEErLSPPLQHNNAEVLLIA 265
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
879-1174 |
3.42e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 98.18 E-value: 3.42e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 879 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 955
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 956 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVA 1035
Cdd:cd00200 82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1036 GLGDGSIRVYDrrMALSECrVMTYREHTAWVVKayLQKHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDI 1113
Cdd:cd00200 153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEVNS--VAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907083413 1114 HPQANLIACGSMNqftaiyngngeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1174
Cdd:cd00200 228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
|
|
| Raptor_N |
pfam14538 |
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ... |
8-58 |
2.98e-18 |
|
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.
Pssm-ID: 464202 Cd Length: 152 Bit Score: 82.71 E-value: 2.98e-18
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1907083413 8 IKNCSCFIsenHDENYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 58
Cdd:pfam14538 105 TSNGEIWV---FNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
890-1183 |
3.45e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.27 E-value: 3.45e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 890 IAVADKD-SICFWDWEKGEKLDYF--HNGnprytRVTAMEYL-NGQdcsLLLTATDDGAIRVWknfaDLEKNPEMVTawq 965
Cdd:COG2319 135 LASGSADgTVRLWDLATGKLLRTLtgHSG-----AVTSVAFSpDGK---LLASGSDDGTVRLW----DLATGKLLRT--- 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 966 glsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVY 1045
Cdd:COG2319 200 -----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTL-TGHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1046 DRRmalSECRVMTYREHTAWVVKAYLqkHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDIHPQANLIACG 1123
Cdd:COG2319 274 DLA---TGELLRTLTGHSGGVNSVAF--SPDGkLLASGSDDGTVRLWDLATGKLLRTLTGhTGAVRSVAFSPDGKTLASG 348
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907083413 1124 SMNQFTAIYN-GNGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1183
Cdd:COG2319 349 SDDGTVRLWDlATGELLRTLT-------GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
415-522 |
1.43e-08 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 54.63 E-value: 1.43e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 415 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 494
Cdd:COG1413 53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
|
90 100
....*....|....*....|....*...
gi 1907083413 495 nvammLAQLINDGSPMVRKELVVALSHL 522
Cdd:COG1413 114 -----LLEALKDPDWEVRRAAARALGRL 136
|
|
| HEAT_2 |
pfam13646 |
HEAT repeats; This family includes multiple HEAT repeats. |
415-520 |
2.59e-05 |
|
HEAT repeats; This family includes multiple HEAT repeats.
Pssm-ID: 433376 [Multi-domain] Cd Length: 88 Bit Score: 43.87 E-value: 2.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 415 LEQL-SDPHPLLRQWVAICLGRIwqNFDSARwcgvrdsahEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTID 493
Cdd:pfam13646 5 LQALlRDPDPEVRAAAIRALGRI--GDPEAV---------PALLELLKDEDPAVRRAAAEALG--------KIGDPEALP 65
|
90 100
....*....|....*....|....*..
gi 1907083413 494 HnvamMLAQLINDGSPMVRKELVVALS 520
Cdd:pfam13646 66 A----LLELLRDDDDDVVRAAAAEALA 88
|
|
| COG5096 |
COG5096 |
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ... |
324-548 |
1.72e-04 |
|
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 227427 [Multi-domain] Cd Length: 757 Bit Score: 45.87 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 324 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 394
Cdd:COG5096 56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 395 vnsyttgQEACLQGNLIAICLEQLSDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 474
Cdd:COG5096 120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907083413 475 LGTFVGNSAErtDHSTTIDHNVAMMLAQLINDGSPMVRKELVVALSHLVVQYESN---FCT-VALQFMEEEKNYPLPS 548
Cdd:COG5096 190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPDSaedFEErLSPPLQHNNAEVLLIA 265
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
879-1174 |
3.42e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 98.18 E-value: 3.42e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 879 SVVKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRVWknfaDLE 955
Cdd:cd00200 13 TCVAFSPDGKLLATGSGDGTIKvWDLETGELLRTLkgHTGPVRDVAASA-------DGTYLASGSSDKTIRLW----DLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 956 KNpEMVTAWQGLSDMLpttrgagMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVA 1035
Cdd:cd00200 82 TG-ECVRTLTGHTSYV-------SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL-RGHTDWVNSVAFSPDGTFVAS 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1036 GLGDGSIRVYDrrMALSECrVMTYREHTAWVVKayLQKHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDI 1113
Cdd:cd00200 153 SSQDGTIKLWD--LRTGKC-VATLTGHTGEVNS--VAFSPDGeKLLSSSSDGTIKLWDLSTGKCLGTLRGhENGVNSVAF 227
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907083413 1114 HPQANLIACGSMNqftaiyngngeliNNIKYYDGFMGQRV-------GAISCLAFHPHWPHLAVGSND 1174
Cdd:cd00200 228 SPDGYLLASGSED-------------GTIRVWDLRTGECVqtlsghtNSVTSLAWSPDGKRLASGSAD 282
|
|
| Raptor_N |
pfam14538 |
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor ... |
8-58 |
2.98e-18 |
|
Raptor N-terminal CASPase like domain; This domain is found at the N-terminus of the Raptor protein. It has been identified to have a CASPase like structure. It conserves the characteriztic cys/his dyad of the caspases suggesting it may have a peptidase activity.
Pssm-ID: 464202 Cd Length: 152 Bit Score: 82.71 E-value: 2.98e-18
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 1907083413 8 IKNCSCFIsenHDENYTQYIPLSIYDLQTWMGSPSIFVYDCSNAGLIVKSF 58
Cdd:pfam14538 105 TSNGEIWV---FNKDYTQYIPLSIYDLFSWLGSPSIFIFDCSNAGNLLNAF 152
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
975-1182 |
9.31e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 82.00 E-value: 9.31e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 975 RGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVYDRRmalSEC 1054
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL-KGHTGPVRDVAASADGTYLASGSSDKTIRLWDLE---TGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1055 RVMTYREHTAWVvkAYLQKHPEGHIVSVS-VNGDVRFFDPRMPESVNVMQ-IVKGLTALDIHPQANLIACGSMNQFTAIY 1132
Cdd:cd00200 85 CVRTLTGHTSYV--SSVAFSPDGRILSSSsRDKTIKVWDVETGKCLTTLRgHTDWVNSVAFSPDGTFVASSSQDGTIKLW 162
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 1907083413 1133 NG-NGELINNIKYYDGFmgqrvgaISCLAFHPHWPHLAVGSNDYYISVYSV 1182
Cdd:cd00200 163 DLrTGKCVATLTGHTGE-------VNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
890-1183 |
3.45e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.27 E-value: 3.45e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 890 IAVADKD-SICFWDWEKGEKLDYF--HNGnprytRVTAMEYL-NGQdcsLLLTATDDGAIRVWknfaDLEKNPEMVTawq 965
Cdd:COG2319 135 LASGSADgTVRLWDLATGKLLRTLtgHSG-----AVTSVAFSpDGK---LLASGSDDGTVRLW----DLATGKLLRT--- 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 966 glsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAGLGDGSIRVY 1045
Cdd:COG2319 200 -----LTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTL-TGHSGSVRSVAFSPDGRLLASGSADGTVRLW 273
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1046 DRRmalSECRVMTYREHTAWVVKAYLqkHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQI-VKGLTALDIHPQANLIACG 1123
Cdd:COG2319 274 DLA---TGELLRTLTGHSGGVNSVAF--SPDGkLLASGSDDGTVRLWDLATGKLLRTLTGhTGAVRSVAFSPDGKTLASG 348
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907083413 1124 SMNQFTAIYN-GNGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1183
Cdd:COG2319 349 SDDGTVRLWDlATGELLRTLT-------GHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
871-1183 |
2.02e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 76.87 E-value: 2.02e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 871 LNRNPGVPSVVKFHPFTPCIAVADKD-SICFWDWEKGEKLDYF--HNGNPRYTRVTAmeylngqDCSLLLTATDDGAIRV 947
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADgTVRLWDLATGLLLRTLtgHTGAVRSVAFSP-------DGKTLASGSADGTVRL 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 948 WknfaDLEKNPEMVTawqglsdmLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCD 1027
Cdd:COG2319 147 W----DLATGKLLRT--------LTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL-TGHTGAVRSVAFS 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1028 SHRSLIVAGLGDGSIRVYDrrMALSECrVMTYREHTAWVVK-AYlqkHPEG-HIVSVSVNGDVRFFDPRMPESVNVMQIV 1105
Cdd:COG2319 214 PDGKLLASGSADGTVRLWD--LATGKL-LRTLTGHSGSVRSvAF---SPDGrLLASGSADGTVRLWDLATGELLRTLTGH 287
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1106 KG-LTALDIHPQANLIACGSMNQFTAIYN-GNGELINNIKyydgfmgQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1183
Cdd:COG2319 288 SGgVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLT-------GHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLA 360
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
881-1092 |
2.93e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.60 E-value: 2.93e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 881 VKFHPFTPCIAVADKDSICF-WDWEKGEKLDYF--HNGNprytrVTAMEYLngQDCSLLLTATDDGAIRVWknfaDLEKN 957
Cdd:cd00200 57 VAASADGTYLASGSSDKTIRlWDLETGECVRTLtgHTSY-----VSSVAFS--PDGRILSSSSRDKTIKVW----DVETG 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 958 pEMVTAWQGLSDmlpttrgAGMVVDWEQETGLLMSSGDVRIVRIWDT--------------------------------- 1004
Cdd:cd00200 126 -KCLTTLRGHTD-------WVNSVAFSPDGTFVASSSQDGTIKLWDLrtgkcvatltghtgevnsvafspdgekllssss 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1005 DRETKVQDIPTGADSC--------VTSLSCDSHRSLIVAGLGDGSIRVYDRRMAlsECrVMTYREHTAWVVKAYLqkHPE 1076
Cdd:cd00200 198 DGTIKLWDLSTGKCLGtlrghengVNSVAFSPDGYLLASGSEDGTIRVWDLRTG--EC-VQTLSGHTNSVTSLAW--SPD 272
|
250
....*....|....*..
gi 1907083413 1077 GH-IVSVSVNGDVRFFD 1092
Cdd:cd00200 273 GKrLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
877-1183 |
1.77e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 70.71 E-value: 1.77e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 877 VPSVVKFHPFTPCIAVADKDSICFWDWEKGEKLDYFHNGNPRYTRVTAMeylngQDCSLLLTATDDGAIRVWknfaDLEk 956
Cdd:COG2319 39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS-----PDGRLLASASADGTVRLW----DLA- 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 957 NPEMVTAWQGLSDMLPTtrgagmvVDWEQETGLLMSSGDVRIVRIWDTDRETKVQDIpTGADSCVTSLSCDSHRSLIVAG 1036
Cdd:COG2319 109 TGLLLRTLTGHTGAVRS-------VAFSPDGKTLASGSADGTVRLWDLATGKLLRTL-TGHSGAVTSVAFSPDGKLLASG 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1037 LGDGSIRVYDrrmALSECRVMTYREHTAWVVK-AYlqkHPEGH-IVSVSVNGDVRFFDprmPESVNVMQIVKG----LTA 1110
Cdd:COG2319 181 SDDGTVRLWD---LATGKLLRTLTGHTGAVRSvAF---SPDGKlLASGSADGTVRLWD---LATGKLLRTLTGhsgsVRS 251
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907083413 1111 LDIHPQANLIACGSMNQFTAIYN-GNGELInnikyydGFMGQRVGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1183
Cdd:COG2319 252 VAFSPDGRLLASGSADGTVRLWDlATGELL-------RTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLA 318
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
962-1183 |
8.87e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 59.15 E-value: 8.87e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 962 TAWQGLSDMLPTTRGAGMVVDWEQETGLLMSSGDVRIVRIWDTDRETKVQdIPTGADSCVTSLSCDSHRSLIVAGLGDGS 1041
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA-TLLGHTAAVLSVAFSPDGRLLASASADGT 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1042 IRVYDrrmALSECRVMTYREHTAWVVKAYLqkHPEGH-IVSVSVNGDVRFFDPRMPESVNVMQIVKG-LTALDIHPQANL 1119
Cdd:COG2319 102 VRLWD---LATGLLLRTLTGHTGAVRSVAF--SPDGKtLASGSADGTVRLWDLATGKLLRTLTGHSGaVTSVAFSPDGKL 176
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907083413 1120 IACGSMNQFTAIYN-GNGELINNIKYYDgfmgqrvGAISCLAFHPHWPHLAVGSNDYYISVYSVE 1183
Cdd:COG2319 177 LASGSDDGTVRLWDlATGKLLRTLTGHT-------GAVRSVAFSPDGKLLASGSADGTVRLWDLA 234
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
415-522 |
1.43e-08 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 54.63 E-value: 1.43e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 415 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 494
Cdd:COG1413 53 LEALKDPDPEVRAAAAEALGRIGD-----------PEAVPALIAALKDEDPEVRRAAAEALG--------RLGDPAAVPA 113
|
90 100
....*....|....*....|....*...
gi 1907083413 495 nvammLAQLINDGSPMVRKELVVALSHL 522
Cdd:COG1413 114 -----LLEALKDPDWEVRRAAARALGRL 136
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
890-1185 |
2.75e-07 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 53.77 E-value: 2.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 890 IAVADKD-SICFWDWEKGEKLDYFHNGNPRYT-----RVTAMEYLNGQdcslLLTATDDGAIRVWKNFADLEKNPEmVTA 963
Cdd:cd22857 47 LAVARKNgTVEVLDPENGDLLASFSDSEPATKlseedHFVGLHLFSGT----LLTCTSKGSLRSTKLPDDSTASSS-PTA 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 964 WQGLSDMLPTTRGagmvvdwEQETGLLMSSGDVRIVRIWDTdrETKVQDI---------------PTgadsCVTS---LS 1025
Cdd:cd22857 122 WVCLGGNLLCMRV-------DPNENYFAFGGKEVELNVWDL--EEKPGKIwraknvpndslglrvPV----WVTDltfLS 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1026 CDSHRSlIVAGLGDGSIRVYD----RRmalsecRVM--TYREHTAWVVkaylQKHPEGHIVSVSVN-GDVRFFDPRmpes 1098
Cdd:cd22857 189 KDDHRK-IVTGTGYHQVRLYDtraqRR------PVVsvDFGETPIKAV----AEDPDGHTVYVGDTsGDLASIDLR---- 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 1099 vnvmqivkgltaldihpqanliacgsmnqftaiyngNGELINNikyYDGFMGqrvGAISCLAFHPHWPHLAVGSNDYYIS 1178
Cdd:cd22857 254 ------------------------------------TGKLLGC---FKGKCG---GSIRSIARHPELPLIASCGLDRYLR 291
|
....*..
gi 1907083413 1179 VYSVEKR 1185
Cdd:cd22857 292 IWDTETR 298
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
415-522 |
4.69e-07 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 50.40 E-value: 4.69e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 415 LEQLSDPHPLLRQWVAICLGRIWQnfdsarwcgvrDSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDH 494
Cdd:COG1413 22 IAALADEDPDVRAAAARALGRLGD-----------PRAVPALLEALKDPDPEVRAAAAEALG--------RIGDPEAVPA 82
|
90 100
....*....|....*....|....*...
gi 1907083413 495 nvammLAQLINDGSPMVRKELVVALSHL 522
Cdd:COG1413 83 -----LIAALKDEDPEVRRAAAEALGRL 105
|
|
| HEAT_2 |
pfam13646 |
HEAT repeats; This family includes multiple HEAT repeats. |
415-520 |
2.59e-05 |
|
HEAT repeats; This family includes multiple HEAT repeats.
Pssm-ID: 433376 [Multi-domain] Cd Length: 88 Bit Score: 43.87 E-value: 2.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 415 LEQL-SDPHPLLRQWVAICLGRIwqNFDSARwcgvrdsahEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTID 493
Cdd:pfam13646 5 LQALlRDPDPEVRAAAIRALGRI--GDPEAV---------PALLELLKDEDPAVRRAAAEALG--------KIGDPEALP 65
|
90 100
....*....|....*....|....*..
gi 1907083413 494 HnvamMLAQLINDGSPMVRKELVVALS 520
Cdd:pfam13646 66 A----LLELLRDDDDDVVRAAAAEALA 88
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
411-476 |
2.93e-05 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 45.00 E-value: 2.93e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907083413 411 IAICLEQLSDPHPLLRQWVAICLGRIWqnfdsarwcgvRDSAHEKLYSLLSDPIPEVRCAAVFALG 476
Cdd:COG1413 80 VPALIAALKDEDPEVRRAAAEALGRLG-----------DPAAVPALLEALKDPDWEVRRAAARALG 134
|
|
| COG5096 |
COG5096 |
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular ... |
324-548 |
1.72e-04 |
|
Vesicle coat complex, various subunits [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 227427 [Multi-domain] Cd Length: 757 Bit Score: 45.87 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 324 IFPYVLKLLQSSARELRPLLVFIW---AK------ILAVDSscqadLVKDNGHKyflsvlaDPYMpaehRTMTAFILAVI 394
Cdd:COG5096 56 LFPDVIKNVATRDVELKRLLYLYLeryAKlkpelaLLAVNT-----IQKDLQDP-------NEEI----RGFALRTLSLL 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907083413 395 vnsyttgQEACLQGNLIAICLEQLSDPHPLLRQWVAICLGRIWQnFDSARWCGVRDSAHEKLysLLSDPIPEVRCAAVFA 474
Cdd:COG5096 120 -------RVKELLGNIIDPIKKLLTDPHAYVRKTAALAVAKLYR-LDKDLYHELGLIDILKE--LVADSDPIVIANALAS 189
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907083413 475 LGTFVGNSAErtDHSTTIDHNVAMMLAQLINDGSPMVRKELVVALSHLVVQYESN---FCT-VALQFMEEEKNYPLPS 548
Cdd:COG5096 190 LAEIDPELAH--GYSLEVILRIPQLDLLSLSVSTEWLLLIILEVLTERVPTTPDSaedFEErLSPPLQHNNAEVLLIA 265
|
|
| HEAT |
COG1413 |
HEAT repeat [General function prediction only]; |
450-522 |
1.38e-03 |
|
HEAT repeat [General function prediction only];
Pssm-ID: 441023 [Multi-domain] Cd Length: 137 Bit Score: 40.38 E-value: 1.38e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907083413 450 DSAHEKLYSLLSDPIPEVRCAAVFALGtfvgnsaeRTDHSTTIDHnvammLAQLINDGSPMVRKELVVALSHL 522
Cdd:COG1413 15 PAAVPALIAALADEDPDVRAAAARALG--------RLGDPRAVPA-----LLEALKDPDPEVRAAAAEALGRI 74
|
|
| HEAT |
pfam02985 |
HEAT repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see ... |
456-479 |
6.14e-03 |
|
HEAT repeat; The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514).
Pssm-ID: 460773 Cd Length: 31 Bit Score: 35.58 E-value: 6.14e-03
|
|