NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|157819841|ref|NP_001102059|]
View 

mitogen-activated protein kinase-binding protein 1 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
297-735 3.79e-41

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 157.38  E-value: 3.79e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  297 GCADGTVRLFNPSNLHFLSTLPRPHALGTDIATITEASRLFSGGANARypdtiALTFDPANQWLSCVYNDHSIYVWDVRD 376
Cdd:COG2319    35 LAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL-----SVAFSPDGRLLASASADGTVRLWDLAT 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  377 PKKVGKVYSalyHSSCVWSVEVYPeikDSNQaclppssFITCSSDNTIRLWNTESsgvhGSALHrnilsndlikiiyvdg 456
Cdd:COG2319   110 GLLLRTLTG---HTGAVRSVAFSP---DGKT-------LASGSADGTVRLWDLAT----GKLLR---------------- 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  457 ntqalldtelpggdkadgSLMDPRVGIRSVCISPNGQHLASGDRMGTLRVHELQSLSELLKVEAHDSEILCLEYSkPDtG 536
Cdd:COG2319   157 ------------------TLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  537 lKLLASASRDRLIHVLDAGREySLQQTLDEHSSSITAVKFAAsDGQvRMISCGADKSIYFRTAQkSGEGVQfTRTHHVVR 616
Cdd:COG2319   217 -KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSP-DGR-LLASGSADGTVRLWDLA-TGELLR-TLTGHSGG 290
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  617 KTTLydmdvepSW----KYTAIGCQDRNIRIFNISSGKQKKLFKGSQGEDGTlikVQTDPSGIYIATSCSDKNLSIFDFF 692
Cdd:COG2319   291 VNSV-------AFspdgKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLA 360
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 157819841  693 SGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWRLSS 735
Cdd:COG2319   361 TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
16-511 1.29e-31

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 129.26  E-value: 1.29e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   16 LLRSPSIKLRRSKAGNRREDLSSKVTLEKVLGVTVSGGRGLACDPRSG-LVAYPAGCVVVLFNPRKHKQHHILNSSRKTI 94
Cdd:COG2319    44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRlLASASADGTVRLWDLATGLLLRTLTGHTGAV 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   95 TALAFSPDGKYLVTGESGHmpAVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNVWAWKKNIVVAS 174
Cdd:COG2319   124 RSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSD--DGTVRLWDLATGKLLRT 199
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  175 -NKVSSRVTAVSFSEDCSYFVTAG-NRHIKFWyldDSKTSKVNATVPllGRSGLLgelrnnlfTDVAcgrgkkadstfci 252
Cdd:COG2319   200 lTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW---DLATGKLLRTLT--GHSGSV--------RSVA------------- 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  253 tssgllceFS-DRRLLdkwvelrntdsftttvahcisVSqeyifcGCADGTVRLFNPSNLHFLSTLPRPhalgtdiatit 331
Cdd:COG2319   254 --------FSpDGRLL---------------------AS------GSADGTVRLWDLATGELLRTLTGH----------- 287
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  332 easrlfSGGANarypdtiALTFDPANQWLSCVYNDHSIYVWDVRDPKKVgKVYSAlyHSSCVWSVEVYPeikDSNQaclp 411
Cdd:COG2319   288 ------SGGVN-------SVAFSPDGKLLASGSDDGTVRLWDLATGKLL-RTLTG--HTGAVRSVAFSP---DGKT---- 344
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  412 pssFITCSSDNTIRLWNTESSGvhgsalhrnilsndlikiiyvdgntqalLDTELPGGDKAdgslmdprvgIRSVCISPN 491
Cdd:COG2319   345 ---LASGSDDGTVRLWDLATGE----------------------------LLRTLTGHTGA----------VTSVAFSPD 383
                         490       500
                  ....*....|....*....|
gi 157819841  492 GQHLASGDRMGTLRVHELQS 511
Cdd:COG2319   384 GRTLASGSADGTVRLWDLAT 403
PHA03307 super family cl33723
transcriptional regulator ICP4; Provisional
843-1213 6.27e-05

transcriptional regulator ICP4; Provisional


The actual alignment was detected with superfamily member PHA03307:

Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 6.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  843 LGPAPTAN--TGPKRRGRWAQPGVELSVRSMLDLRQLETLAPSPRGPSQDSLAVSPTGPGKHSPQAADLSCAsqnerAPR 920
Cdd:PHA03307   59 AAACDRFEppTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP-----APD 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  921 LQASQPCSCPHIIQllSQEEGVFAQDLESAPIEDGIVYPEPSDSPTMDTSAfqvqAPTGGSLGRVYPGSRGSEKHSPDSa 1000
Cdd:PHA03307  134 LSEMLRPVGSPGPP--PAASPPAAGASPAAVASDAASSRQAALPLSSPEET----ARAPSSPPAEPPPSTPPAAASPRP- 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1001 csvdysssrlsSPEHPNEDSESTEPLSVDGVSSDLEEQAEGEEEEEEEGGTGLCGLQEGSPHTPDQEQFLKQHFETLANG 1080
Cdd:PHA03307  207 -----------PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1081 AAPGgpARALERTESRSISSRfllqvqtSPLREPSLSSSGlALMSRPDQVSQVSGEQLKGSGAT------------PPGA 1148
Cdd:PHA03307  276 NGPS--SRPGPASSSSSPRER-------SPSPSPSSPGSG-PAPSSPRASSSSSSSRESSSSSTssssessrgaavSPGP 345
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157819841 1149 PPEMEPSSGNsgPKQVAPVLLPRRRTNLDNSWASKKTAATRPLAGLQKAQSVHSLVPQDEVPSRP 1213
Cdd:PHA03307  346 SPSRSPSPSR--PPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1144-1408 1.87e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1144 TPPGAPPEMEPSSGNSGPKQVAPVLLPRRRTNLDNSWASKKTAATRPLaglqkaqsvhslvPQDEVPSRPLLFQAEVQGS 1223
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPL-------------PPDTHAPDPPPPSPSPAAN 2636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1224 LGSLPQADGCPSQSHSYWNPTTSSVAKLARSISVGENPGLAAEPQAP----APIRTSPFNKLALPSRahlvldiPKPLPD 1299
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPrrraARPTVGSLTSLADPPP-------PPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1300 RPTLTTFSPVSKGLAHSETEQSGPSVSLGKTHTAIEKHSCLGEGTTPKSRTecQAHPGPNHPCAQQLPVSnllqGPESMQ 1379
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPPAAPAA----GPPRRL 2783
                         250       260
                  ....*....|....*....|....*....
gi 157819841 1380 PLSPEKTRNPVESSRPGAALSQDSELALS 1408
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVL 2812
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
297-735 3.79e-41

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 157.38  E-value: 3.79e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  297 GCADGTVRLFNPSNLHFLSTLPRPHALGTDIATITEASRLFSGGANARypdtiALTFDPANQWLSCVYNDHSIYVWDVRD 376
Cdd:COG2319    35 LAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL-----SVAFSPDGRLLASASADGTVRLWDLAT 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  377 PKKVGKVYSalyHSSCVWSVEVYPeikDSNQaclppssFITCSSDNTIRLWNTESsgvhGSALHrnilsndlikiiyvdg 456
Cdd:COG2319   110 GLLLRTLTG---HTGAVRSVAFSP---DGKT-------LASGSADGTVRLWDLAT----GKLLR---------------- 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  457 ntqalldtelpggdkadgSLMDPRVGIRSVCISPNGQHLASGDRMGTLRVHELQSLSELLKVEAHDSEILCLEYSkPDtG 536
Cdd:COG2319   157 ------------------TLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  537 lKLLASASRDRLIHVLDAGREySLQQTLDEHSSSITAVKFAAsDGQvRMISCGADKSIYFRTAQkSGEGVQfTRTHHVVR 616
Cdd:COG2319   217 -KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSP-DGR-LLASGSADGTVRLWDLA-TGELLR-TLTGHSGG 290
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  617 KTTLydmdvepSW----KYTAIGCQDRNIRIFNISSGKQKKLFKGSQGEDGTlikVQTDPSGIYIATSCSDKNLSIFDFF 692
Cdd:COG2319   291 VNSV-------AFspdgKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLA 360
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 157819841  693 SGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWRLSS 735
Cdd:COG2319   361 TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
16-511 1.29e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 129.26  E-value: 1.29e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   16 LLRSPSIKLRRSKAGNRREDLSSKVTLEKVLGVTVSGGRGLACDPRSG-LVAYPAGCVVVLFNPRKHKQHHILNSSRKTI 94
Cdd:COG2319    44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRlLASASADGTVRLWDLATGLLLRTLTGHTGAV 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   95 TALAFSPDGKYLVTGESGHmpAVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNVWAWKKNIVVAS 174
Cdd:COG2319   124 RSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSD--DGTVRLWDLATGKLLRT 199
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  175 -NKVSSRVTAVSFSEDCSYFVTAG-NRHIKFWyldDSKTSKVNATVPllGRSGLLgelrnnlfTDVAcgrgkkadstfci 252
Cdd:COG2319   200 lTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW---DLATGKLLRTLT--GHSGSV--------RSVA------------- 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  253 tssgllceFS-DRRLLdkwvelrntdsftttvahcisVSqeyifcGCADGTVRLFNPSNLHFLSTLPRPhalgtdiatit 331
Cdd:COG2319   254 --------FSpDGRLL---------------------AS------GSADGTVRLWDLATGELLRTLTGH----------- 287
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  332 easrlfSGGANarypdtiALTFDPANQWLSCVYNDHSIYVWDVRDPKKVgKVYSAlyHSSCVWSVEVYPeikDSNQaclp 411
Cdd:COG2319   288 ------SGGVN-------SVAFSPDGKLLASGSDDGTVRLWDLATGKLL-RTLTG--HTGAVRSVAFSP---DGKT---- 344
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  412 pssFITCSSDNTIRLWNTESSGvhgsalhrnilsndlikiiyvdgntqalLDTELPGGDKAdgslmdprvgIRSVCISPN 491
Cdd:COG2319   345 ---LASGSDDGTVRLWDLATGE----------------------------LLRTLTGHTGA----------VTSVAFSPD 383
                         490       500
                  ....*....|....*....|
gi 157819841  492 GQHLASGDRMGTLRVHELQS 511
Cdd:COG2319   384 GRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
350-732 9.44e-31

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 123.60  E-value: 9.44e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  350 ALTFDPANQWLSCVYNDHSIYVWDVRDpkkvGKVYSALY-HSSCVWSVEVYPeikDSNQaclppssFITCSSDNTIRLWN 428
Cdd:cd00200    14 CVAFSPDGKLLATGSGDGTIKVWDLET----GELLRTLKgHTGPVRDVAASA---DGTY-------LASGSSDKTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  429 TESSGVhgsalhrnilsndlikIIYVDGNTQAlldtelpggdkadgslmdprvgIRSVCISPNGQHLASGDRMGTLRVHE 508
Cdd:cd00200    80 LETGEC----------------VRTLTGHTSY----------------------VSSVAFSPDGRILSSSSRDKTIKVWD 121
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  509 LQSLSELLKVEAHDSEILCLEYSKPDTglkLLASASRDRLIHVLDAgREYSLQQTLDEHSSSITAVKFaaSDGQVRMISC 588
Cdd:cd00200   122 VETGKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDL-RTGKCVATLTGHTGEVNSVAF--SPDGEKLLSS 195
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  589 GADKSIyfrtaqksgegvqftrthhvvrkttlydmdvepswkytaigcqdrniRIFNISSGKQKKLFkgsQGEDGTLIKV 668
Cdd:cd00200   196 SSDGTI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSV 225
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157819841  669 QTDPSGiYIATSCS-DKNLSIFDFFSGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWR 732
Cdd:cd00200   226 AFSPDG-YLLASGSeDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
84-428 8.04e-30

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 120.90  E-value: 8.04e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   84 HHILNSSRKTITALAFSPDGKYLVTGESGHMpaVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNV 163
Cdd:cd00200     2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRL 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  164 WAWKKNIVVAS-NKVSSRVTAVSFSEDCSYFVTAG-NRHIKFWYLDDSKTSKVnatvpLLGRSGllgelrnnlftDVACG 241
Cdd:cd00200    78 WDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTT-----LRGHTD-----------WVNSV 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  242 RGKKaDSTFcITSSGllcefSDR--RLldkWvELRNTDSFTTTVAH-----CISVS--QEYIFCGCADGTVRLFNPSNLH 312
Cdd:cd00200   142 AFSP-DGTF-VASSS-----QDGtiKL---W-DLRTGKCVATLTGHtgevnSVAFSpdGEKLLSSSSDGTIKLWDLSTGK 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  313 FLSTLpRPHalgTDIATiteasrlfsgganarypdtiALTFDPANQWLSCVYNDHSIYVWDVRDPKKVGKVYSalyHSSC 392
Cdd:cd00200   211 CLGTL-RGH---ENGVN--------------------SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNS 263
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 157819841  393 VWSVEVYPEIKDsnqaclppssFITCSSDNTIRLWN 428
Cdd:cd00200   264 VTSLAWSPDGKR----------LASGSADGTIRIWD 289
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
693-731 2.64e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.08  E-value: 2.64e-07
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 157819841    693 SGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVW 731
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 pfam00400
WD domain, G-beta repeat;
694-731 7.36e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 7.36e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 157819841   694 GECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVW 731
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
843-1213 6.27e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 6.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  843 LGPAPTAN--TGPKRRGRWAQPGVELSVRSMLDLRQLETLAPSPRGPSQDSLAVSPTGPGKHSPQAADLSCAsqnerAPR 920
Cdd:PHA03307   59 AAACDRFEppTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP-----APD 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  921 LQASQPCSCPHIIQllSQEEGVFAQDLESAPIEDGIVYPEPSDSPTMDTSAfqvqAPTGGSLGRVYPGSRGSEKHSPDSa 1000
Cdd:PHA03307  134 LSEMLRPVGSPGPP--PAASPPAAGASPAAVASDAASSRQAALPLSSPEET----ARAPSSPPAEPPPSTPPAAASPRP- 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1001 csvdysssrlsSPEHPNEDSESTEPLSVDGVSSDLEEQAEGEEEEEEEGGTGLCGLQEGSPHTPDQEQFLKQHFETLANG 1080
Cdd:PHA03307  207 -----------PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1081 AAPGgpARALERTESRSISSRfllqvqtSPLREPSLSSSGlALMSRPDQVSQVSGEQLKGSGAT------------PPGA 1148
Cdd:PHA03307  276 NGPS--SRPGPASSSSSPRER-------SPSPSPSSPGSG-PAPSSPRASSSSSSSRESSSSSTssssessrgaavSPGP 345
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157819841 1149 PPEMEPSSGNsgPKQVAPVLLPRRRTNLDNSWASKKTAATRPLAGLQKAQSVHSLVPQDEVPSRP 1213
Cdd:PHA03307  346 SPSRSPSPSR--PPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
117-282 2.97e-04

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 45.46  E-value: 2.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  117 VRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGyQHDMIVNVWAWKKNIVVASNKVSSRVTAVSFSEDCSYFVTA 196
Cdd:PLN00181  557 VQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASG-SDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAF 635
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  197 GNRHIKFWYLDDSktskvNATVPL---LGRSGLLGELRnnlFTDVACGRGKKADST-----FCITSSGL----LCEFSDR 264
Cdd:PLN00181  636 GSADHKVYYYDLR-----NPKLPLctmIGHSKTVSYVR---FVDSSTLVSSSTDNTlklwdLSMSISGInetpLHSFMGH 707
                         170
                  ....*....|....*...
gi 157819841  265 RLLDKWVELRNTDSFTTT 282
Cdd:PLN00181  708 TNVKNFVGLSVSDGYIAT 725
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
80-121 3.77e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.22  E-value: 3.77e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 157819841     80 KHKQHHILNSSRKTITALAFSPDGKYLVTG-ESGHmpaVRVWD 121
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGsDDGT---IKLWD 40
eIF2A pfam08662
Eukaryotic translation initiation factor eIF2A; This is a family of eukaryotic translation ...
73-151 5.40e-04

Eukaryotic translation initiation factor eIF2A; This is a family of eukaryotic translation initiation factors.


Pssm-ID: 462552 [Multi-domain]  Cd Length: 194  Bit Score: 42.65  E-value: 5.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841    73 VVLFNPRKHKQHHILNSSRKTItalAFSPDGKYLVTGESGHMP-AVRVWDVAERNQVAElQEHKYGVACvAFSPSAKYIV 151
Cdd:pfam08662   85 VSFFDLKGNVIHSFGEQPRNTI---FWSPFGRLVLLAGFGNLAgDIEFWDVVNKKKIAT-AEASNATLC-EWSPDGRYFL 159
PHA03247 PHA03247
large tegument protein UL36; Provisional
1144-1408 1.87e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1144 TPPGAPPEMEPSSGNSGPKQVAPVLLPRRRTNLDNSWASKKTAATRPLaglqkaqsvhslvPQDEVPSRPLLFQAEVQGS 1223
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPL-------------PPDTHAPDPPPPSPSPAAN 2636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1224 LGSLPQADGCPSQSHSYWNPTTSSVAKLARSISVGENPGLAAEPQAP----APIRTSPFNKLALPSRahlvldiPKPLPD 1299
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPrrraARPTVGSLTSLADPPP-------PPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1300 RPTLTTFSPVSKGLAHSETEQSGPSVSLGKTHTAIEKHSCLGEGTTPKSRTecQAHPGPNHPCAQQLPVSnllqGPESMQ 1379
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPPAAPAA----GPPRRL 2783
                         250       260
                  ....*....|....*....|....*....
gi 157819841 1380 PLSPEKTRNPVESSRPGAALSQDSELALS 1408
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVL 2812
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
297-735 3.79e-41

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 157.38  E-value: 3.79e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  297 GCADGTVRLFNPSNLHFLSTLPRPHALGTDIATITEASRLFSGGANARypdtiALTFDPANQWLSCVYNDHSIYVWDVRD 376
Cdd:COG2319    35 LAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL-----SVAFSPDGRLLASASADGTVRLWDLAT 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  377 PKKVGKVYSalyHSSCVWSVEVYPeikDSNQaclppssFITCSSDNTIRLWNTESsgvhGSALHrnilsndlikiiyvdg 456
Cdd:COG2319   110 GLLLRTLTG---HTGAVRSVAFSP---DGKT-------LASGSADGTVRLWDLAT----GKLLR---------------- 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  457 ntqalldtelpggdkadgSLMDPRVGIRSVCISPNGQHLASGDRMGTLRVHELQSLSELLKVEAHDSEILCLEYSkPDtG 536
Cdd:COG2319   157 ------------------TLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  537 lKLLASASRDRLIHVLDAGREySLQQTLDEHSSSITAVKFAAsDGQvRMISCGADKSIYFRTAQkSGEGVQfTRTHHVVR 616
Cdd:COG2319   217 -KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSP-DGR-LLASGSADGTVRLWDLA-TGELLR-TLTGHSGG 290
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  617 KTTLydmdvepSW----KYTAIGCQDRNIRIFNISSGKQKKLFKGSQGEDGTlikVQTDPSGIYIATSCSDKNLSIFDFF 692
Cdd:COG2319   291 VNSV-------AFspdgKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLA 360
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|...
gi 157819841  693 SGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWRLSS 735
Cdd:COG2319   361 TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
25-554 1.42e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 131.96  E-value: 1.42e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   25 RRSKAGNRREDLSSKVTLEKVLGVTVSGGRGLACDPRSGLVAYPAGCVVVLFNPRKHKQHHILNSSRKTITALAFSPDGK 104
Cdd:COG2319    12 ASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGR 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  105 YLVTGesGHMPAVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGYQHdmIVNVWAWKKNIVVAS-NKVSSRVTA 183
Cdd:COG2319    92 LLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGAVTS 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  184 VSFSEDCSYFVTAG-NRHIKFWYLDDSKTSKVnatvpLLGRSGllgelrnnlftdvacgrgkkadstfcitssgllcefs 262
Cdd:COG2319   168 VAFSPDGKLLASGSdDGTVRLWDLATGKLLRT-----LTGHTG------------------------------------- 205
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  263 drrlldkWVelrntdsftTTVAhcISVSQEYIFCGCADGTVRLFNPSNLHFLSTLPRPHALGTDIAtiteasrlfsggan 342
Cdd:COG2319   206 -------AV---------RSVA--FSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVA-------------- 253
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  343 arypdtialtFDPANQWLSCVYNDHSIYVWDVRDPKkvgKVYSALYHSSCVWSVEVYPeikDSNQaclppssFITCSSDN 422
Cdd:COG2319   254 ----------FSPDGRLLASGSADGTVRLWDLATGE---LLRTLTGHSGGVNSVAFSP---DGKL-------LASGSDDG 310
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  423 TIRLWNTESsgvhGSALHrnilsndlikiiyvdgntqalldtelpggdkadgSLMDPRVGIRSVCISPNGQHLASGDRMG 502
Cdd:COG2319   311 TVRLWDLAT----GKLLR----------------------------------TLTGHTGAVRSVAFSPDGKTLASGSDDG 352
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|..
gi 157819841  503 TLRVHELQSLSELLKVEAHDSEILCLEYSkPDTglKLLASASRDRLIHVLDA 554
Cdd:COG2319   353 TVRLWDLATGELLRTLTGHTGAVTSVAFS-PDG--RTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
16-511 1.29e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 129.26  E-value: 1.29e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   16 LLRSPSIKLRRSKAGNRREDLSSKVTLEKVLGVTVSGGRGLACDPRSG-LVAYPAGCVVVLFNPRKHKQHHILNSSRKTI 94
Cdd:COG2319    44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRlLASASADGTVRLWDLATGLLLRTLTGHTGAV 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   95 TALAFSPDGKYLVTGESGHmpAVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNVWAWKKNIVVAS 174
Cdd:COG2319   124 RSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSD--DGTVRLWDLATGKLLRT 199
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  175 -NKVSSRVTAVSFSEDCSYFVTAG-NRHIKFWyldDSKTSKVNATVPllGRSGLLgelrnnlfTDVAcgrgkkadstfci 252
Cdd:COG2319   200 lTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW---DLATGKLLRTLT--GHSGSV--------RSVA------------- 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  253 tssgllceFS-DRRLLdkwvelrntdsftttvahcisVSqeyifcGCADGTVRLFNPSNLHFLSTLPRPhalgtdiatit 331
Cdd:COG2319   254 --------FSpDGRLL---------------------AS------GSADGTVRLWDLATGELLRTLTGH----------- 287
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  332 easrlfSGGANarypdtiALTFDPANQWLSCVYNDHSIYVWDVRDPKKVgKVYSAlyHSSCVWSVEVYPeikDSNQaclp 411
Cdd:COG2319   288 ------SGGVN-------SVAFSPDGKLLASGSDDGTVRLWDLATGKLL-RTLTG--HTGAVRSVAFSP---DGKT---- 344
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  412 pssFITCSSDNTIRLWNTESSGvhgsalhrnilsndlikiiyvdgntqalLDTELPGGDKAdgslmdprvgIRSVCISPN 491
Cdd:COG2319   345 ---LASGSDDGTVRLWDLATGE----------------------------LLRTLTGHTGA----------VTSVAFSPD 383
                         490       500
                  ....*....|....*....|
gi 157819841  492 GQHLASGDRMGTLRVHELQS 511
Cdd:COG2319   384 GRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
350-732 9.44e-31

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 123.60  E-value: 9.44e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  350 ALTFDPANQWLSCVYNDHSIYVWDVRDpkkvGKVYSALY-HSSCVWSVEVYPeikDSNQaclppssFITCSSDNTIRLWN 428
Cdd:cd00200    14 CVAFSPDGKLLATGSGDGTIKVWDLET----GELLRTLKgHTGPVRDVAASA---DGTY-------LASGSSDKTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  429 TESSGVhgsalhrnilsndlikIIYVDGNTQAlldtelpggdkadgslmdprvgIRSVCISPNGQHLASGDRMGTLRVHE 508
Cdd:cd00200    80 LETGEC----------------VRTLTGHTSY----------------------VSSVAFSPDGRILSSSSRDKTIKVWD 121
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  509 LQSLSELLKVEAHDSEILCLEYSKPDTglkLLASASRDRLIHVLDAgREYSLQQTLDEHSSSITAVKFaaSDGQVRMISC 588
Cdd:cd00200   122 VETGKCLTTLRGHTDWVNSVAFSPDGT---FVASSSQDGTIKLWDL-RTGKCVATLTGHTGEVNSVAF--SPDGEKLLSS 195
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  589 GADKSIyfrtaqksgegvqftrthhvvrkttlydmdvepswkytaigcqdrniRIFNISSGKQKKLFkgsQGEDGTLIKV 668
Cdd:cd00200   196 SSDGTI-----------------------------------------------KLWDLSTGKCLGTL---RGHENGVNSV 225
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157819841  669 QTDPSGiYIATSCS-DKNLSIFDFFSGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWR 732
Cdd:cd00200   226 AFSPDG-YLLASGSeDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
84-428 8.04e-30

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 120.90  E-value: 8.04e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   84 HHILNSSRKTITALAFSPDGKYLVTGESGHMpaVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNV 163
Cdd:cd00200     2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRL 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  164 WAWKKNIVVAS-NKVSSRVTAVSFSEDCSYFVTAG-NRHIKFWYLDDSKTSKVnatvpLLGRSGllgelrnnlftDVACG 241
Cdd:cd00200    78 WDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVWDVETGKCLTT-----LRGHTD-----------WVNSV 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  242 RGKKaDSTFcITSSGllcefSDR--RLldkWvELRNTDSFTTTVAH-----CISVS--QEYIFCGCADGTVRLFNPSNLH 312
Cdd:cd00200   142 AFSP-DGTF-VASSS-----QDGtiKL---W-DLRTGKCVATLTGHtgevnSVAFSpdGEKLLSSSSDGTIKLWDLSTGK 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  313 FLSTLpRPHalgTDIATiteasrlfsgganarypdtiALTFDPANQWLSCVYNDHSIYVWDVRDPKKVGKVYSalyHSSC 392
Cdd:cd00200   211 CLGTL-RGH---ENGVN--------------------SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNS 263
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 157819841  393 VWSVEVYPEIKDsnqaclppssFITCSSDNTIRLWN 428
Cdd:cd00200   264 VTSLAWSPDGKR----------LASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
281-594 6.61e-29

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 121.17  E-value: 6.61e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  281 TTVAhcISVSQEYIFCGCADGTVRLFNPSNLHFLSTLPRPHALGTDIAtiteasrlfsgganarypdtialtFDPANQWL 360
Cdd:COG2319   124 RSVA--FSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVA------------------------FSPDGKLL 177
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  361 SCVYNDHSIYVWDVRDPKKVGKVYSalyHSSCVWSVEVYPeikDSNQaclppssFITCSSDNTIRLWNTE---------- 430
Cdd:COG2319   178 ASGSDDGTVRLWDLATGKLLRTLTG---HTGAVRSVAFSP---DGKL-------LASGSADGTVRLWDLAtgkllrtltg 244
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  431 -SSGVHGSALHRN---ILSNDLIKIIYV-DGNTQALLDTelpggdkadgsLMDPRVGIRSVCISPNGQHLASGDRMGTLR 505
Cdd:COG2319   245 hSGSVRSVAFSPDgrlLASGSADGTVRLwDLATGELLRT-----------LTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  506 VHELQSLSELLKVEAHDSEILCLEYSkPDTglKLLASASRDRLIHVLDAgREYSLQQTLDEHSSSITAVKFAAsDGQvRM 585
Cdd:COG2319   314 LWDLATGKLLRTLTGHTGAVRSVAFS-PDG--KTLASGSDDGTVRLWDL-ATGELLRTLTGHTGAVTSVAFSP-DGR-TL 387

                  ....*....
gi 157819841  586 ISCGADKSI 594
Cdd:COG2319   388 ASGSADGTV 396
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
482-736 1.31e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 117.44  E-value: 1.31e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  482 GIRSVCISPNGQHLASGDRMGTLRVHELQSLSELLKVEAHDSEILCLEYSkPDTglKLLASASRDRLIHVLDAGREYSLQ 561
Cdd:cd00200    11 GVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS-ADG--TYLASGSSDKTIRLWDLETGECVR 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  562 qTLDEHSSSITAVKFAASDgqvRMI-SCGADKSIYFRTAQkSGEGVQFTRTHhvvrKTTLYDMDVEPSWKYTAIGCQDRN 640
Cdd:cd00200    88 -TLTGHTSYVSSVAFSPDG---RILsSSSRDKTIKVWDVE-TGKCLTTLRGH----TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  641 IRIFNISSGKQKKLFKGSQGEdgtLIKVQTDPSGIYIATSCSDKNLSIFDFFSGECVATMFGHSEIVTGMKFSNDCKHLI 720
Cdd:cd00200   159 IKLWDLRTGKCVATLTGHTGE---VNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|....*.
gi 157819841  721 SVSGDSCIFVWRLSSE 736
Cdd:cd00200   236 SGSEDGTIRVWDLRTG 251
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
286-594 1.65e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.97  E-value: 1.65e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  286 CISVSQEYIFCGCADGTVRLFNPSNLHFLSTLpRPHALG-TDIATITEASRLFSGGANarypDTI--------------- 349
Cdd:cd00200    16 AFSPDGKLLATGSGDGTIKVWDLETGELLRTL-KGHTGPvRDVAASADGTYLASGSSD----KTIrlwdletgecvrtlt 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  350 -------ALTFDPANQWLSCVYNDHSIYVWDVRDPKkvgKVYSALYHSSCVWSVEVypeikdsnqacLPPSSFITCSS-D 421
Cdd:cd00200    91 ghtsyvsSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLRGHTDWVNSVAF-----------SPDGTFVASSSqD 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  422 NTIRLWNTESsgvhgsalhrnilsndlIKIIYVdgntqalldtelpggdkadgsLMDPRVGIRSVCISPNGQHLASGDRM 501
Cdd:cd00200   157 GTIKLWDLRT-----------------GKCVAT---------------------LTGHTGEVNSVAFSPDGEKLLSSSSD 198
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  502 GTLRVHELQSLSELLKVEAHDSEILCLEYSKPDtglKLLASASRDRLIHVLDaGREYSLQQTLDEHSSSITAVKFAAsDG 581
Cdd:cd00200   199 GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDG---YLLASGSEDGTIRVWD-LRTGECVQTLSGHTNSVTSLAWSP-DG 273
                         330
                  ....*....|...
gi 157819841  582 QvRMISCGADKSI 594
Cdd:cd00200   274 K-RLASGSADGTI 285
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-739 4.62e-26

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 109.73  E-value: 4.62e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  521 HDSEILCLEYSkPDTglKLLASASRDRLIHVLDAGREySLQQTLDEHSSSITAVKFAASDGQvrMISCGADKSIYFRTAQ 600
Cdd:cd00200     8 HTGGVTCVAFS-PDG--KLLATGSGDGTIKVWDLETG-ELLRTLKGHTGPVRDVAASADGTY--LASGSSDKTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  601 KSGEGVQFTrtHHvvrKTTLYDMDVEPSWKYTAIGCQDRNIRIFNISSGKQKKLFKGSQGedgTLIKVQTDPSGIYIATS 680
Cdd:cd00200    82 TGECVRTLT--GH---TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTFVASS 153
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 157819841  681 CSDKNLSIFDFFSGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWRLSSEMTI 739
Cdd:cd00200   154 SQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL 212
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
75-165 6.80e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.83  E-value: 6.80e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   75 LFNPRKHKQHHILNSSRKTITALAFSPDGKYLVTG-ESGHmpaVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSV 153
Cdd:cd00200   203 LWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGsEDGT---IRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
                          90
                  ....*....|..
gi 157819841  154 GyqHDMIVNVWA 165
Cdd:cd00200   280 S--ADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
73-219 5.99e-11

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 65.05  E-value: 5.99e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841   73 VVLFNPRKHKQHHILNSSRKTITALAFSPDGKYLVTGESGHMpaVRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVS 152
Cdd:cd00200   117 IKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT--IKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLS 194
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  153 VGyqHDMIVNVWAWKKNIVVASNKV-SSRVTAVSFSEDcSYFVTAG--NRHIKFWyldDSKTSKVNATVP 219
Cdd:cd00200   195 SS--SDGTIKLWDLSTGKCLGTLRGhENGVNSVAFSPD-GYLLASGseDGTIRVW---DLRTGECVQTLS 258
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
693-731 2.64e-07

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.08  E-value: 2.64e-07
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 157819841    693 SGECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVW 731
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 pfam00400
WD domain, G-beta repeat;
694-731 7.36e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 7.36e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 157819841   694 GECVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVW 731
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
843-1213 6.27e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 6.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  843 LGPAPTAN--TGPKRRGRWAQPGVELSVRSMLDLRQLETLAPSPRGPSQDSLAVSPTGPGKHSPQAADLSCAsqnerAPR 920
Cdd:PHA03307   59 AAACDRFEppTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSP-----APD 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  921 LQASQPCSCPHIIQllSQEEGVFAQDLESAPIEDGIVYPEPSDSPTMDTSAfqvqAPTGGSLGRVYPGSRGSEKHSPDSa 1000
Cdd:PHA03307  134 LSEMLRPVGSPGPP--PAASPPAAGASPAAVASDAASSRQAALPLSSPEET----ARAPSSPPAEPPPSTPPAAASPRP- 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1001 csvdysssrlsSPEHPNEDSESTEPLSVDGVSSDLEEQAEGEEEEEEEGGTGLCGLQEGSPHTPDQEQFLKQHFETLANG 1080
Cdd:PHA03307  207 -----------PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW 275
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1081 AAPGgpARALERTESRSISSRfllqvqtSPLREPSLSSSGlALMSRPDQVSQVSGEQLKGSGAT------------PPGA 1148
Cdd:PHA03307  276 NGPS--SRPGPASSSSSPRER-------SPSPSPSSPGSG-PAPSSPRASSSSSSSRESSSSSTssssessrgaavSPGP 345
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 157819841 1149 PPEMEPSSGNsgPKQVAPVLLPRRRTNLDNSWASKKTAATRPLAGLQKAQSVHSLVPQDEVPSRP 1213
Cdd:PHA03307  346 SPSRSPSPSR--PPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
45-121 2.68e-04

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 44.63  E-value: 2.68e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 157819841   45 VLGVTVSGGRGLacdprsgLVAYPAGCVVVLFNPRKHKQHHILNSSRKTITALAFSPDGKYLVTGESGHMpaVRVWD 121
Cdd:cd00200   222 VNSVAFSPDGYL-------LASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGT--IRIWD 289
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
117-282 2.97e-04

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 45.46  E-value: 2.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  117 VRVWDVAERNQVAELQEHKYGVACVAFSPSAKYIVSVGyQHDMIVNVWAWKKNIVVASNKVSSRVTAVSFSEDCSYFVTA 196
Cdd:PLN00181  557 VQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASG-SDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAF 635
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841  197 GNRHIKFWYLDDSktskvNATVPL---LGRSGLLGELRnnlFTDVACGRGKKADST-----FCITSSGL----LCEFSDR 264
Cdd:PLN00181  636 GSADHKVYYYDLR-----NPKLPLctmIGHSKTVSYVR---FVDSSTLVSSSTDNTlklwdLSMSISGInetpLHSFMGH 707
                         170
                  ....*....|....*...
gi 157819841  265 RLLDKWVELRNTDSFTTT 282
Cdd:PLN00181  708 TNVKNFVGLSVSDGYIAT 725
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
80-121 3.77e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.22  E-value: 3.77e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 157819841     80 KHKQHHILNSSRKTITALAFSPDGKYLVTG-ESGHmpaVRVWD 121
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGsDDGT---IKLWD 40
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
94-150 5.05e-04

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 44.64  E-value: 5.05e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 157819841   94 ITALAFSPDGKYLVTGESG--HMPAVRVWDVAErNQVAELQEHKYGVACVAFSPSAKYI 150
Cdd:COG4946   434 ISDLAWSPDSKWLAYSKPGpnQLSQIFLYDVET-GKTVQLTDGRYDDGSPAFSPDGKYL 491
eIF2A pfam08662
Eukaryotic translation initiation factor eIF2A; This is a family of eukaryotic translation ...
73-151 5.40e-04

Eukaryotic translation initiation factor eIF2A; This is a family of eukaryotic translation initiation factors.


Pssm-ID: 462552 [Multi-domain]  Cd Length: 194  Bit Score: 42.65  E-value: 5.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841    73 VVLFNPRKHKQHHILNSSRKTItalAFSPDGKYLVTGESGHMP-AVRVWDVAERNQVAElQEHKYGVACvAFSPSAKYIV 151
Cdd:pfam08662   85 VSFFDLKGNVIHSFGEQPRNTI---FWSPFGRLVLLAGFGNLAgDIEFWDVVNKKKIAT-AEASNATLC-EWSPDGRYFL 159
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
125-164 7.00e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.45  E-value: 7.00e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 157819841    125 RNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNVW 164
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLW 39
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
696-736 1.04e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 42.71  E-value: 1.04e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 157819841  696 CVATMFGHSEIVTGMKFSNDCKHLISVSGDSCIFVWRLSSE 736
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETG 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1144-1408 1.87e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1144 TPPGAPPEMEPSSGNSGPKQVAPVLLPRRRTNLDNSWASKKTAATRPLaglqkaqsvhslvPQDEVPSRPLLFQAEVQGS 1223
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPL-------------PPDTHAPDPPPPSPSPAAN 2636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1224 LGSLPQADGCPSQSHSYWNPTTSSVAKLARSISVGENPGLAAEPQAP----APIRTSPFNKLALPSRahlvldiPKPLPD 1299
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPrrraARPTVGSLTSLADPPP-------PPPTPE 2709
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 157819841 1300 RPTLTTFSPVSKGLAHSETEQSGPSVSLGKTHTAIEKHSCLGEGTTPKSRTecQAHPGPNHPCAQQLPVSnllqGPESMQ 1379
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP--PTTAGPPAPAPPAAPAA----GPPRRL 2783
                         250       260
                  ....*....|....*....|....*....
gi 157819841 1380 PLSPEKTRNPVESSRPGAALSQDSELALS 1408
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVL 2812
WD40 pfam00400
WD domain, G-beta repeat;
82-121 2.14e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 37.32  E-value: 2.14e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 157819841    82 KQHHILNSSRKTITALAFSPDGKYLVTGESGHMpaVRVWD 121
Cdd:pfam00400    2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGT--VKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
125-164 6.70e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 35.78  E-value: 6.70e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 157819841   125 RNQVAELQEHKYGVACVAFSPSAKYIVSVGYqhDMIVNVW 164
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSD--DGTVKVW 38
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH