|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-120 |
4.16e-76 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 240.41 E-value: 4.16e-76
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSAICAQMVPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 356991206 97 QEHQQQVLQAVDRAKQ-------------QQNQLQPL 120
Cdd:pfam03920 81 QEHQQQVAQAVERAKQvtmaelnaiigqqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
391-730 |
6.78e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.38 E-value: 6.78e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 391 LRGSSVSLPGIPVAKPAYSFHVSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCV 469
Cdd:COG2319 65 AAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTV 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 470 KVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAK 549
Cdd:COG2319 145 RLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGK 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 550 VCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHC 628
Cdd:COG2319 218 LLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 629 PNQDWLAVGMESSHVEVLHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKE-SSSVLS 706
Cdd:COG2319 298 PDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTS 377
|
330 340
....*....|....*....|....
gi 356991206 707 CDISRNNKYIVTGSGDKKATVYEV 730
Cdd:COG2319 378 VAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
445-729 |
1.80e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.01 E-value: 1.80e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 445 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 522
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 523 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 602
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 603 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 678
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 356991206 679 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYE 729
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
235-424 |
3.46e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 3.46e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 235 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 312
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 313 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglhlspqvsssvvyGRSPLMAFESHPHLR 392
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP----------------SRSPAAKPAAPARPP 2882
|
170 180 190
....*....|....*....|....*....|..
gi 356991206 393 GSSVSLPgiPVAKPAYSFHVSADGQmQPVPFP 424
Cdd:PHA03247 2883 VRRLARP--AVSRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
571-606 |
7.07e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.15 E-value: 7.07e-07
10 20 30
....*....|....*....|....*....|....*.
gi 356991206 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
571-606 |
9.94e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 9.94e-06
10 20 30
....*....|....*....|....*....|....*.
gi 356991206 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-120 |
4.16e-76 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 240.41 E-value: 4.16e-76
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSAICAQMVPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 356991206 97 QEHQQQVLQAVDRAKQ-------------QQNQLQPL 120
Cdd:pfam03920 81 QEHQQQVAQAVERAKQvtmaelnaiigqqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
391-730 |
6.78e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.38 E-value: 6.78e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 391 LRGSSVSLPGIPVAKPAYSFHVSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCV 469
Cdd:COG2319 65 AAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTV 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 470 KVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAK 549
Cdd:COG2319 145 RLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGK 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 550 VCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHC 628
Cdd:COG2319 218 LLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 629 PNQDWLAVGMESSHVEVLHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKE-SSSVLS 706
Cdd:COG2319 298 PDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTS 377
|
330 340
....*....|....*....|....
gi 356991206 707 CDISRNNKYIVTGSGDKKATVYEV 730
Cdd:COG2319 378 VAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
391-730 |
2.72e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 150.06 E-value: 2.72e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 391 LRGSSVSLPGIPVAKPAYSFHVSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCV 469
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASAdGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 470 KVWDVgqpgsKTPVAQLDCLNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAK 549
Cdd:COG2319 103 RLWDL-----ATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA--TGKLLRTLTGHSGAVTSVAFSPDGK 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 550 VCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHC 628
Cdd:COG2319 176 LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 629 PNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSK-ESSSVLS 706
Cdd:COG2319 256 PDGRLLASGSADGTVRLWDLAtGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRS 335
|
330 340
....*....|....*....|....
gi 356991206 707 CDISRNNKYIVTGSGDKKATVYEV 730
Cdd:COG2319 336 VAFSPDGKTLASGSDDGTVRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
438-690 |
9.56e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 142.74 E-value: 9.56e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 438 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 515
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 516 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 595
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 596 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 673
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 356991206 674 WFVSTGKDNLLNAWRTP 690
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
445-729 |
1.80e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.01 E-value: 1.80e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 445 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 522
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 523 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 602
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 603 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 678
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 356991206 679 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYE 729
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
431-688 |
9.05e-34 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.92 E-value: 9.05e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 431 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 505
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 506 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 585
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 586 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 663
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 356991206 664 LSLKFASCGRWFVSTGKDNLLNAWR 688
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
539-730 |
1.07e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 102.03 E-value: 1.07e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 539 CYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQ-QHD 617
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRtLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 618 FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRwFVSTGK-DNLLNAWRTPYGASI 695
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFSPDGT-FVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 356991206 696 --FQSkESSSVLSCDISRNNKYIVTGSGDKKATVYEV 730
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
500-730 |
1.09e-17 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 86.12 E-value: 1.09e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 500 LLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPAcyALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTD 579
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVA--SLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 580 GASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK-YQLR 657
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLT 159
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 356991206 658 LHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYEV 730
Cdd:COG2319 160 GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
570-730 |
1.49e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.92 E-value: 1.49e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 570 MVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREG---RQLQQHdfSSQIFSLGHCPNQDWLAVGMESSHVEVL 646
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 647 HVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISRNNKYIVTGSGDK 723
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 356991206 724 KATVYEV 730
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
438-567 |
9.33e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 70.71 E-value: 9.33e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 438 RQLHTLA-HGEVVCAVTISSSTQHVYTGGKGC-VKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 515
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 356991206 516 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQN 567
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
235-424 |
3.46e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 3.46e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 235 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 312
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 313 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglhlspqvsssvvyGRSPLMAFESHPHLR 392
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP----------------SRSPAAKPAAPARPP 2882
|
170 180 190
....*....|....*....|....*....|..
gi 356991206 393 GSSVSLPgiPVAKPAYSFHVSADGQmQPVPFP 424
Cdd:PHA03247 2883 VRRLARP--AVSRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
571-606 |
7.07e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.15 E-value: 7.07e-07
10 20 30
....*....|....*....|....*....|....*.
gi 356991206 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
571-606 |
9.94e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 9.94e-06
10 20 30
....*....|....*....|....*....|....*.
gi 356991206 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
500-607 |
1.21e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.22 E-value: 1.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 500 LLPDGQSLIVGGEAS-TLSIWDLAapTPRIKAEL-TSSAPacYALAVSPDAKVCFSCCSDGN-----IVVWDLQNQAMVR 572
Cdd:COG3391 117 VDPDGGRLYVADSGNgRVSVIDTA--TGKVVATIpVGAGP--HGIAVDPDGKRLYVANSGSNtvsviVSVIDTATGKVVA 192
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 356991206 573 QFQGHtDGASCIDISDYGTRLW--------TGGLDNTVRCWDL 607
Cdd:COG3391 193 TIPVG-GGPVGVAVSPDGRRLYvanrgsntSNGGSNTVSVIDL 234
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
525-564 |
1.66e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 1.66e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 356991206 525 TPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWD 564
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
188-407 |
3.26e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.92 E-value: 3.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 188 ESLVEEDHPSSRGGSGKQQRAEDkDLSGP----YDSEEDKSDyNLVVDEDQPSEPPSPVTTPCGKAPLCIPARRDLTDSP 263
Cdd:PHA03307 12 EAAAEGGEFFPRPPATPGDAADD-LLSGSqgqlVSDSAELAA-VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 264 ASLAS-SLGSPLPRSKDIALNDLPTGTPASR----SCGTSPPQDSSTPGPSSASHLCQLAAQPAAPTDSIAlRSPLTLSS 338
Cdd:PHA03307 90 WSLSTlAPASPAREGSPTPPGPSSPDPPPPTpppaSPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAAS 168
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 356991206 339 PFTSSFSlgshstlngdLSMPGSYVglHLSPQVSSSVVyGRSPLMAFESHPHLRGSSVSLP-GIPVAKPA 407
Cdd:PHA03307 169 SRQAALP----------LSSPEETA--RAPSSPPAEPP-PSTPPAAASPRPPRRSSPISASaSSPAPAPG 225
|
|
|