|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
789-1187 |
1.61e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 79.60 E-value: 1.61e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 789 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 868
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 869 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 942
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 943 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1022
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAgLLATLLPPLTETRAAQGPR 1098
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTE-SFALPPDQPERPPQPQAPP 2915
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1099 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994
|
....*....
gi 2022781840 1179 PAFGGVIPA 1187
Cdd:PHA03247 2995 PLTGHSLSR 3003
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
401-448 |
4.37e-14 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 67.63 E-value: 4.37e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2022781840 401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717 1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
404-447 |
1.13e-13 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 66.44 E-value: 1.13e-13
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2022781840 404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167 2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
401-447 |
2.64e-12 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 62.52 E-value: 2.64e-12
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2022781840 401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249 1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
|
|
| PLN03091 |
PLN03091 |
hypothetical protein; Provisional |
399-502 |
2.21e-08 |
|
hypothetical protein; Provisional
Pssm-ID: 215570 [Multi-domain] Cd Length: 459 Bit Score: 58.45 E-value: 2.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091 12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
|
90 100
....*....|....*....|....*....
gi 2022781840 474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091 87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
297-357 |
2.51e-08 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 51.54 E-value: 2.51e-08
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921 1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
294-342 |
2.86e-08 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 51.07 E-value: 2.86e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 2022781840 294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717 1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
346-397 |
1.51e-07 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 49.14 E-value: 1.51e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717 1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
|
|
| SANT_CDC5_II |
cd11659 |
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ... |
290-339 |
1.77e-07 |
|
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.
Pssm-ID: 212557 [Multi-domain] Cd Length: 53 Bit Score: 49.23 E-value: 1.77e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 2022781840 290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659 1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
808-1233 |
2.78e-07 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 54.97 E-value: 2.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 808 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 880
Cdd:pfam17823 11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 881 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 954
Cdd:pfam17823 90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 955 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1034
Cdd:pfam17823 167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1035 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1114
Cdd:pfam17823 247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1115 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSS----HADPPEAEPpwSGRLPAFG----GVIP 1186
Cdd:pfam17823 322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMipevEATSPTTQP--SPLLPTQGaagpGILL 399
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 1187 ATEPRG---TPGSPSGTQEPRGPLGLEKLPLR--QPGPEKGALDLEKPPLPQ 1233
Cdd:pfam17823 400 APEQVAteaTAGTASAGPTPRSSGDPKTLAMAscQLSTQGQYLVVTTDPLTP 451
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
256-357 |
2.90e-07 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 54.79 E-value: 2.90e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147 29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
|
90 100
....*....|....*....|....*...
gi 2022781840 330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147 106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
334-453 |
5.02e-07 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 54.02 E-value: 5.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147 6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 2022781840 406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147 77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
349-412 |
3.58e-06 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 45.76 E-value: 3.58e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022781840 349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921 1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
470-496 |
4.23e-06 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 44.87 E-value: 4.23e-06
10 20
....*....|....*....|....*..
gi 2022781840 470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167 16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
470-496 |
5.99e-06 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 44.52 E-value: 5.99e-06
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
399-495 |
4.50e-05 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 47.86 E-value: 4.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147 18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
|
90
....*....|....*..
gi 2022781840 479 IASELPHRSGSQCLSKW 495
Cdd:COG5147 97 IADYKDRRTAQQCVERY 113
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
262-305 |
1.40e-04 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 41.14 E-value: 1.40e-04
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2022781840 262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921 19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
193-359 |
1.91e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 43.03 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618 220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618 298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
|
170
....*....|....*
gi 2022781840 349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618 370 iscqQHTLTQHIHTL 384
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
789-1187 |
1.61e-14 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 79.60 E-value: 1.61e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 789 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 868
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 869 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 942
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 943 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1022
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAgLLATLLPPLTETRAAQGPR 1098
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTE-SFALPPDQPERPPQPQAPP 2915
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1099 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994
|
....*....
gi 2022781840 1179 PAFGGVIPA 1187
Cdd:PHA03247 2995 PLTGHSLSR 3003
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
401-448 |
4.37e-14 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 67.63 E-value: 4.37e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2022781840 401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717 1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
404-447 |
1.13e-13 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 66.44 E-value: 1.13e-13
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2022781840 404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167 2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
784-1204 |
2.97e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 75.36 E-value: 2.97e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 784 RKALPPRLPQAGARD--PPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPK 861
Cdd:PHA03247 2572 RPAPRPSEPAVTSRArrPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPER 2651
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 862 PKTVSELLQEKRLQEAR-------AREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAA 933
Cdd:PHA03247 2652 PRDDPAPGRVSRPRRARrlgraaqASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPpGPAAARQA 2731
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 934 KPGTSGSWQEAGTSAKdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLgqSQAPAASRKQGLPEAP 1013
Cdd:PHA03247 2732 SPALPAAPAPPAVPAG----------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL--TRPAVASLSESRESLP 2799
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1014 pfLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSlprPAGTPGPAGLLATLLPPLTETRA 1093
Cdd:PHA03247 2800 --SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP---LGGSVAPGGDVRRRPPSRSPAAK 2874
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1094 AQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAdgsvafvPGEAQVAREIPEPRTSSHADPPEAEPP 1173
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ-------PQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
|
410 420 430
....*....|....*....|....*....|.
gi 2022781840 1174 WSGRLPAFGGVIPAtePRGTPGSPSGTQEPR 1204
Cdd:PHA03247 2948 DPAGAGEPSGAVPQ--PWLGALVPGRVAVPR 2976
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
784-1279 |
7.00e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 74.20 E-value: 7.00e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 784 RKALPPRLPQA--GARDPPvHLLQASSSAQSTPGHLFPNVPAQEASKSASH-------KGSRRLASSRV---ERTLPQAS 851
Cdd:PHA03247 2481 RRPAEARFPFAagAAPDPG-GGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwiRGLEELASDDAgdpPPPLPPAA 2559
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 852 LLASTG-----PRPKPKTVSELLQEKRLQEARAREATRG--PVVLPSQLLVSSSVILQPPLPHTPHGrPAPGPTVLNVPL 924
Cdd:PHA03247 2560 PPAAPDrsvppPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPAANEP 2638
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 925 SGPGaPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASqaPALGPGQISVSCPESGLGQSQAP-AA 1003
Cdd:PHA03247 2639 DPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR--PTVGSLTSLADPPPPPPTPEPAPhAL 2715
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1004 SRKQGLPEAP-------PFLPAAPSPTPLPVQPLSlthIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVvSLPRPAGTP- 1075
Cdd:PHA03247 2716 VSATPLPPGPaaarqasPALPAAPAPPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASl 2791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1076 GPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANmnrEPEPSCRTDTPAPPTHALSQSPAEADGSVAfvPGeAQVARE 1155
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP---LPPPTSAQPTAPPPPPGPPPPSLPLGGSVA--PG-GDVRRR 2865
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1156 IP------------EPRTSSHADPPEAEPPWSGRLPAFGgviPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGA 1223
Cdd:PHA03247 2866 PPsrspaakpaapaRPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*.
gi 2022781840 1224 LDLEKPPLPQPGPEKGALDlgllsqegeaatqQWLGGQRGVRVPLLGSRLPYQPPA 1279
Cdd:PHA03247 2943 LAPTTDPAGAGEPSGAVPQ-------------PWLGALVPGRVAVPRFRVPQPAPS 2985
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
401-447 |
2.64e-12 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 62.52 E-value: 2.64e-12
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2022781840 401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249 1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
404-457 |
3.44e-11 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 59.63 E-value: 3.44e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 2022781840 404 WAPEEDAKLLQAVAKYGeQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWN 457
Cdd:pfam13921 1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWS 53
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
873-1239 |
3.81e-10 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 64.70 E-value: 3.81e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 873 RLQEARAREATRGPVVL----PSQLLVSSSVILQPPLPHTPHgRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSA 948
Cdd:PHA03378 437 RTEQPRATPHSQAPTVVlhrpPTQPLEGPTGPLSVQAPLEPW-QPLPHPQVTPVILHQPPAQGVQAHGSMLDLLEKDDED 515
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 949 KDKRLSTMQALPLAP----------VFSE---AEGTAPAASQA------PALGPGQISV-------------SCPESGLG 996
Cdd:PHA03378 516 MEQRVMATLLPPSPPqpragrrapcVYTEdldIESDEPASTEPvhdqllPAPGLGPLQIqpltspttsqlasSAPSYAQT 595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 997 QSQAPAASRKQGLPEAPPFLPA--APSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGT 1074
Cdd:PHA03378 596 PWPVPHPSQTPEPPTTQSHIPEtsAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQ 675
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1075 PGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSV---AFVPGEAQ 1151
Cdd:PHA03378 676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRArppAAAPGRAR 755
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1152 VAREIPEPRTSSHADPPEAEPpwsgRLPAFGGVIPATEPRgtpGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPL 1231
Cdd:PHA03378 756 PPAAAPGRARPPAAAPGAPTP----QPPPQAPPAPQQRPR---GAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828
|
....*...
gi 2022781840 1232 PQPGPEKG 1239
Cdd:PHA03378 829 LTGGVKRG 836
|
|
| PLN03091 |
PLN03091 |
hypothetical protein; Provisional |
399-502 |
2.21e-08 |
|
hypothetical protein; Provisional
Pssm-ID: 215570 [Multi-domain] Cd Length: 459 Bit Score: 58.45 E-value: 2.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091 12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
|
90 100
....*....|....*....|....*....
gi 2022781840 474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091 87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
297-357 |
2.51e-08 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 51.54 E-value: 2.51e-08
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921 1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
294-342 |
2.86e-08 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 51.07 E-value: 2.86e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 2022781840 294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717 1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
786-1172 |
4.63e-08 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 57.69 E-value: 4.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 786 ALPPRLPQAGARDPPVhlLQASSSAQSTPghlfPNVPAQEASKSASHKGSRRLASSrvertlPQASLLASTGPRPKPKTV 865
Cdd:PRK07764 380 RLERRLGVAGGAGAPA--AAAPSAAAAAP----AAAPAPAAAAPAAAAAPAPAAAP------QPAPAPAPAPAPPSPAGN 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 866 SELLQEKRLQEARAREATRGPVVLPSQllvssSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPgtSGSWQEAG 945
Cdd:PRK07764 448 APAGGAPSPPPAAAPSAQPAPAPAAAP-----EPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL--RERWPEIL 520
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 946 TSAKDKRLSTMQAL---------------------PLAPVFSEAE-----------------------GTAPAASQAPAL 981
Cdd:PRK07764 521 AAVPKRSRKTWAILlpeatvlgvrgdtlvlgfstgGLARRFASPGnaevlvtalaeelggdwqveavvGPAPGAAGGEGP 600
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 982 GPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPV- 1060
Cdd:PRK07764 601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAa 680
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1061 PVPAVVSLPRPAGTPGPAGLLATLLPPlteTRAAQGPRAPAlsSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAD 1140
Cdd:PRK07764 681 PPPAPAPAAPAAPAGAAPAQPAPAPAA---TPPAGQADDPA--AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP 755
|
410 420 430
....*....|....*....|....*....|..
gi 2022781840 1141 GSVAFVPGEAQVAREIPEPRTSSHADPPEAEP 1172
Cdd:PRK07764 756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAE 787
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
346-397 |
1.51e-07 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 49.14 E-value: 1.51e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717 1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
|
|
| SANT_CDC5_II |
cd11659 |
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ... |
290-339 |
1.77e-07 |
|
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.
Pssm-ID: 212557 [Multi-domain] Cd Length: 53 Bit Score: 49.23 E-value: 1.77e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 2022781840 290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659 1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
792-1199 |
2.66e-07 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 55.56 E-value: 2.66e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 792 PQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRvERTLPQASLLASTGPRPKPKTVSELLQE 871
Cdd:PHA03307 24 PPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPG-PGTEAPANESRSTPTWSLSTLAPASPAR 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 872 KRLQEARAREATRGPVVLPSQLLVSSSVilQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDK 951
Cdd:PHA03307 103 EGSPTPPGPSSPDPPPPTPPPASPPPSP--APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 952 RLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQA---------PAASRKQGLPEAPPFLPAAPSP 1022
Cdd:PHA03307 181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAAddagasssdSSSSESSGCGWGPENECPLPRP 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPLSLTHIgGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP-RAPA 1101
Cdd:PHA03307 261 APITLPTRIWEAS-GWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSEsSRGA 339
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1102 LSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLPAF 1181
Cdd:PHA03307 340 AVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
|
410
....*....|....*...
gi 2022781840 1182 GGVIPATEPRGTPGSPSG 1199
Cdd:PHA03307 420 GAASGAFYARYPLLTPSG 437
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
808-1233 |
2.78e-07 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 54.97 E-value: 2.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 808 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 880
Cdd:pfam17823 11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 881 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 954
Cdd:pfam17823 90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 955 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1034
Cdd:pfam17823 167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1035 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1114
Cdd:pfam17823 247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1115 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSS----HADPPEAEPpwSGRLPAFG----GVIP 1186
Cdd:pfam17823 322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMipevEATSPTTQP--SPLLPTQGaagpGILL 399
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 1187 ATEPRG---TPGSPSGTQEPRGPLGLEKLPLR--QPGPEKGALDLEKPPLPQ 1233
Cdd:pfam17823 400 APEQVAteaTAGTASAGPTPRSSGDPKTLAMAscQLSTQGQYLVVTTDPLTP 451
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
256-357 |
2.90e-07 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 54.79 E-value: 2.90e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147 29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
|
90 100
....*....|....*....|....*...
gi 2022781840 330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147 106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
296-340 |
3.08e-07 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 47.96 E-value: 3.08e-07
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 2022781840 296 EWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:cd00167 1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPG-RTPKQCRERWRNL 44
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
848-1220 |
3.43e-07 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 55.16 E-value: 3.43e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 848 PQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLpsqllVSSSVILQP---PLPHTP-HGRPAPGPTVLNVP 923
Cdd:pfam03154 189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL-----IQQTPTLHPqrlPSPHPPlQPMTQPPPPSQVSP 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 924 LSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTmQALPLAPVFSEAEGTAPAASQAPalgpgqisvscpesglGQSQApaa 1003
Cdd:pfam03154 264 QPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPP-QPFPLTPQSSQSQVPPGPSPAAP----------------GQSQQ--- 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1004 srkqgLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQG-LLPVPVPAVVSLPRPAGTPGPAGLLA 1082
Cdd:pfam03154 324 -----RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNLPPPPALKP 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1083 TLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPA-----PPTHALSQSPAEAD-GSVAFVPGEAQVAREI 1156
Cdd:pfam03154 399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaashPPTSGLHQVPSQSPfPQHPFVPGGPPPITPP 478
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1157 PEPRTSSHADPPEAEPPWSGRlPAFGGVIPATEPRGTPG------SPSGTQEPRGPlgleKLPLRQPGPE 1220
Cdd:pfam03154 479 SGPPTSTSSAMPGIQPPSSAS-VSSSGPVPAAVSCPLPPvqikeeALDEAEEPESP----PPPPRSPSPE 543
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
902-1236 |
4.58e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 54.61 E-value: 4.58e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 902 QPPLPHTPHGRPAPGPtvlnvplSGPGAPAAAKPGTSGSWQEAGT-SAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPA 980
Cdd:PRK07764 397 AAPSAAAAAPAAAPAP-------AAAAPAAAAAPAPAAAPQPAPApAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 981 LGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAA---------------------------PSPTPLPVQP--LS 1031
Cdd:PRK07764 470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDaatlrerwpeilaavpkrsrktwaillPEATVLGVRGdtLV 549
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1032 LTH--------IGGPHVATSVplpVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALS 1103
Cdd:PRK07764 550 LGFstgglarrFASPGNAEVL---VT-ALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPA 625
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1104 SSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEA-----DGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PRK07764 626 APAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDasdggDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 2022781840 1179 PAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGP 1236
Cdd:PRK07764 706 AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
334-453 |
5.02e-07 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 54.02 E-value: 5.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147 6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 2022781840 406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147 77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
850-1206 |
1.28e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 53.25 E-value: 1.28e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 850 ASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVIlqPPLPHTPHGRPAPGPTVLNVPLSGPGA 929
Cdd:PHA03307 57 AGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT--PPGPSSPDPPPPTPPPASPPPSPAPDL 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 930 PAAAKPGTSGSwqeagtsakdKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESG---LGQSQAPAASRK 1006
Cdd:PHA03307 135 SEMLRPVGSPG----------PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPpaePPPSTPPAAASP 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1007 QGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLP 1086
Cdd:PHA03307 205 RPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC-GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1087 PLTETRAAQGPRAPALSSSwqppanmnrepepSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHAD 1166
Cdd:PHA03307 284 PASSSSSPRERSPSPSPSS-------------PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 2022781840 1167 PPEAEPPwsgrlPAFGGVIPATEPRGTPGSPSGTQEPRGP 1206
Cdd:PHA03307 351 PSPSRPP-----PPADPSSPRKRPRPSRAPSSPAASAGRP 385
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
349-412 |
3.58e-06 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 45.76 E-value: 3.58e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022781840 349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921 1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
932-1142 |
3.91e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 51.42 E-value: 3.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 932 AAKPGTSGSWQEAGTSAKDKRLSTMQAL----PLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQ 1007
Cdd:PRK12323 362 AFRPGQSGGGAGPATAAAAPVAQPAPAAaapaAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1008 GLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGP----AGLLAT 1083
Cdd:PRK12323 442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpdAAPAGW 521
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 2022781840 1084 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGS 1142
Cdd:PRK12323 522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| SANT |
cd00167 |
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ... |
470-496 |
4.23e-06 |
|
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Pssm-ID: 238096 [Multi-domain] Cd Length: 45 Bit Score: 44.87 E-value: 4.23e-06
10 20
....*....|....*....|....*..
gi 2022781840 470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167 16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
972-1242 |
4.34e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 51.39 E-value: 4.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 972 APAASQAPAlGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVAtsvPLPVTWV 1051
Cdd:PRK07003 361 AVTGGGAPG-GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA---PAPPATA 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1052 LTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSswqppanmnREPEPSCrtdtpAPPTHA 1131
Cdd:PRK07003 437 DRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA---------FEPAPRA-----AAPSAA 502
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1132 LSQSPAEADGSVAFVPGEAQVAREIPEPRTSShADPPEAEPPWSGrlpafGGVIPATEPRGTPGSPSGTQEPRGPLGLEK 1211
Cdd:PRK07003 503 TPAAVPDARAPAAASREDAPAAAAPPAPEARP-PTPAAAAPAARA-----GGAAAALDVLRNAGMRVSSDRGARAAAAAK 576
|
250 260 270
....*....|....*....|....*....|.
gi 2022781840 1212 LPLRQPGPEKGALDLEKPPLPQPGPEKGALD 1242
Cdd:PRK07003 577 PAAAPAAAPKPAAPRVAVQVPTPRARAATGD 607
|
|
| SANT |
smart00717 |
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains; |
470-496 |
5.99e-06 |
|
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
Pssm-ID: 197842 [Multi-domain] Cd Length: 49 Bit Score: 44.52 E-value: 5.99e-06
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
959-1187 |
8.40e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 50.26 E-value: 8.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 959 LPLAPVFSEAEGTAPAASQAPALGPG----QISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1034
Cdd:PRK12323 361 LAFRPGQSGGGAGPATAAAAPVAQPApaaaAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQAS 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1035 IGGPhVATSVPLPVtwvltaqgllPVPVPAVVSLPRPAGTPGPAglLATLLPPLTETRAAQGPRAPALSSSWQPPANMNR 1114
Cdd:PRK12323 441 ARGP-GGAPAPAPA----------PAAAPAAAARPAAAGPRPVA--AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022781840 1115 EPEPSCRTDTPAPPTHALSQSPAEADGSVAF-VPGEAQVAREIPEPRTSSHADPPEAEPPWS--GRLPAFGGVIPA 1187
Cdd:PRK12323 508 SPAPAQPDAAPAGWVAESIPDPATADPDDAFeTLAPAPAAAPAPRAAAATEPVVAPRPPRASasGLPDMFDGDWPA 583
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
903-1110 |
9.71e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 50.26 E-value: 9.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 903 PPLPHTPHGRPAPG---PTVLNVPLSGPGAPAAAKPGTSGSWQEAGtSAKDKRLSTMQALPLApvfSEAEGTAPAASQAP 979
Cdd:PRK12323 375 ATAAAAPVAQPAPAaaaPAAAAPAPAAPPAAPAAAPAAAAAARAVA-AAPARRSPAPEALAAA---RQASARGPGGAPAP 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 980 ALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAqglLP 1059
Cdd:PRK12323 451 APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES---IP 527
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 2022781840 1060 VPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAqgPRAPALSSSWQPPA 1110
Cdd:PRK12323 528 DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA--PRPPRASASGLPDM 576
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
902-1244 |
1.28e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 1.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 902 QPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGT---SAKDKRLSTMQALPLAPVFSEaegtaPAASQA 978
Cdd:PHA03247 2414 QPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTilgAPFSLSLLLGELFPGAPVYRR-----PAEARF 2488
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 979 P-ALGPGqisvscPESGLGQSQAPAASRKQGLPeAPPFLPAAPSPTPLPVQPLSLTH---------IGGPhvatSVPLPv 1048
Cdd:PHA03247 2489 PfAAGAA------PDPGGGGPPDPDAPPAPSRL-APAILPDEPVGEPVHPRMLTWIRgleelasddAGDP----PPPLP- 2556
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1049 twvltaqgllPVPVPAVV--SLPRPAGTPGPAGLLAtllpplteTRAAQGPRAPALSSSWQPPANmNREPEPSCRTDTPA 1126
Cdd:PHA03247 2557 ----------PAAPPAAPdrSVPPPRPAPRPSEPAV--------TSRARRPDAPPQSARPRAPVD-DRGDPRGPAPPSPL 2617
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1127 PPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLP--AFGGVIPATEPR------------- 1191
Cdd:PHA03247 2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgrAAQASSPPQRPRrraarptvgslts 2697
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 2022781840 1192 -GTPGSPSGTQEPRGPLGLEKLPLrQPGPEKGALDLEKPPL---PQPGPEKGALDLG 1244
Cdd:PHA03247 2698 lADPPPPPPTPEPAPHALVSATPL-PPGPAAARQASPALPAapaPPAVPAGPATPGG 2753
|
|
| Myb_DNA-binding |
pfam00249 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
294-340 |
2.42e-05 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 459731 [Multi-domain] Cd Length: 46 Bit Score: 42.88 E-value: 2.42e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2022781840 294 KQEWSREEEERLQAIAAAHGHlEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:pfam00249 1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPG-RTDNQCKNRWQNY 45
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
875-1174 |
2.88e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 48.69 E-value: 2.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 875 QEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEaGTSAKDKRLS 954
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGD-DAADGDAPVP 450
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 955 TMQALPLAPVFSEAEGTA-PAASQAPALGPGqisvscpesglgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1033
Cdd:PRK07003 451 AKANARASADSRCDERDAqPPADSGSASAPA-------------SDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAAS 517
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1034 HIGGPHVAtSVPLPvtwvltaqgLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMN 1113
Cdd:PRK07003 518 REDAPAAA-APPAP---------EARPPTPAAAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKP 587
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022781840 1114 REPEPSCRTDTPAPPTHALSQSPAEAdgsvafvpgeaqvareipePRTSSHADPPEAEPPW 1174
Cdd:PRK07003 588 AAPRVAVQVPTPRARAATGDAPPNGA-------------------ARAEQAAESRGAPPPW 629
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
903-1188 |
3.57e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 48.37 E-value: 3.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 903 PPLPHTPHGRPAP---GPTVLNVPLSGP---GAPAAAKPGT-SGSWQEAGTSAKDKRLSTmqalplaPVFSEAEGTAPAA 975
Cdd:pfam05109 449 PSSTHVPTNLTAPastGPTVSTADVTSPtpaGTTSGASPVTpSPSPRDNGTESKAPDMTS-------PTSAVTTPTPNAT 521
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 976 SQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQP-LSLThigGPHVATSVPLPVTWVLTA 1054
Cdd:pfam05109 522 SPTPAVTTPTPNATSPTLG---KTSPTSAVTTPTPNATSPTPAVTTPTPNATIPtLGKT---SPTSAVTTPTPNATSPTV 595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1055 QGLLP------------VPVPAVVSLPRPAGTPGPAGLLATLLppltETRAAQGPRAPALSSSWQPPANMNR-------- 1114
Cdd:pfam05109 596 GETSPqanttnhtlggtSSTPVVTSPPKNATSAVTTGQHNITS----SSTSSMSLRPSSISETLSPSTSDNStshmpllt 671
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1115 EPEPS-----------------CRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPePRTSSHADPPEAEPPWSGR 1177
Cdd:pfam05109 672 SAHPTggenitqvtpaststhhVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTP-PKNATSPQAPSGQKTAVPT 750
|
330
....*....|.
gi 2022781840 1178 LPAFGGVIPAT 1188
Cdd:pfam05109 751 VTSTGGKANST 761
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1036-1274 |
3.63e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 48.33 E-value: 3.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1036 GGPHVATSVPLPVTWVLtaqgllPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNRE 1115
Cdd:PRK12323 370 GGAGPATAAAAPVAQPA------PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1116 PEPSCRTDTPAPPTHALSQSPAEADgsvafvpgeaqvareiPEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPG 1195
Cdd:PRK12323 444 PGGAPAPAPAPAAAPAAAARPAAAG----------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1196 SPSGTQEPRGPLGLEKLPLRQPG---PEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSR 1272
Cdd:PRK12323 508 SPAPAQPDAAPAGWVAESIPDPAtadPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAAR 587
|
..
gi 2022781840 1273 LP 1274
Cdd:PRK12323 588 LP 589
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
399-495 |
4.50e-05 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 47.86 E-value: 4.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147 18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
|
90
....*....|....*..
gi 2022781840 479 IASELPHRSGSQCLSKW 495
Cdd:COG5147 97 IADYKDRRTAQQCVERY 113
|
|
| SANT_CDC5_II |
cd11659 |
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ... |
397-443 |
5.86e-05 |
|
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.
Pssm-ID: 212557 [Multi-domain] Cd Length: 53 Bit Score: 41.91 E-value: 5.86e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2022781840 397 PGLKKGYWAPEEDAKLLQAVAKYGEQdWFKIREEVpGRSDAQCRDRY 443
Cdd:cd11659 1 PSIKKTEWTREEDEKLLHLAKLLPTQ-WRTIAPIV-GRTAQQCLERY 45
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
910-1128 |
6.90e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 47.67 E-value: 6.90e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 910 HGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRlstmqalplAPVFSEAEGTAPAASQAPALGPGQISVS 989
Cdd:PRK07764 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA---------AAAPAEASAAPAPGVAAPEHHPKHVAVP 660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 990 CPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLP 1069
Cdd:PRK07764 661 DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 2022781840 1070 RPaGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPAnmnrEPEPSCRTDTPAPP 1128
Cdd:PRK07764 741 LP-PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS----EEEEMAEDDAPSMD 794
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
846-1181 |
8.35e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.37 E-value: 8.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 846 TLPQASLLASTGP----RPKPKTVSELLQEKRLQEARAREAT---RGPVVL---PSQLLVSSSVILQPPLPHTPHGRPAP 915
Cdd:PHA03378 578 TSPTTSQLASSAPsyaqTPWPVPHPSQTPEPPTTQSHIPETSaprQWPMPLrpiPMRPLRMQPITFNVLVFPTPHQPPQV 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 916 GPTVLNV----PLSGPGAPAAAKPGTSGSWQEAGTsakdkrlsTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCP 991
Cdd:PHA03378 658 EITPYKPtwtqIGHIPYQPSPTGANTMLPIQWAPG--------TMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAA 729
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 992 ESGLGQSQAPAASRKQGlPEAPPFLPAAPSPTPLPVQPLSlthiGGPHVATSVPLPVTWVLTAQ----GLLPVPVPAV-- 1065
Cdd:PHA03378 730 APGRARPPAAAPGRARP-PAAAPGRARPPAAAPGRARPPA----AAPGAPTPQPPPQAPPAPQQrprgAPTPQPPPQAgp 804
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1066 ----VSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSwQPPANMNREPEPSCRTDT-PAPPTHALSQSPAEAD 1140
Cdd:PHA03378 805 tsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALER-QAAAGPTPSPGSGTSDKIvQAPVFYPPVLQPIQVM 883
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022781840 1141 GSVAFV---------------PGEAQVA-----REIPEPRTSSHADPPEAEPPWSGRLPAF 1181
Cdd:PHA03378 884 RQLGSVraaaastvtqapteyTGERRGVgpmhpTDIPPSKRAKTDAYVESQPPHGGQSHSF 944
|
|
| SANT_TRF |
cd11660 |
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ... |
404-443 |
1.33e-04 |
|
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.
Pssm-ID: 212558 [Multi-domain] Cd Length: 50 Bit Score: 41.01 E-value: 1.33e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2022781840 404 WAPEEDAKLLQAVAKYGEQDWFKIREE---VPGRSDAQCRDRY 443
Cdd:cd11660 3 WTDEEDEALVEGVEKYGVGNWAKILKDyffVNNRTSVDLKDKW 45
|
|
| Myb_DNA-bind_6 |
pfam13921 |
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ... |
262-305 |
1.40e-04 |
|
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
Pssm-ID: 372817 [Multi-domain] Cd Length: 60 Bit Score: 41.14 E-value: 1.40e-04
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 2022781840 262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921 19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
292-411 |
2.99e-04 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 45.16 E-value: 2.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 292 INKQEWSREEEERLQAIAAAHGHLEWQKIAEELgTSRSAFQC-LQKFQQHNKALKRKEWTEEEDRMLTQLVQEMrvGSHI 370
Cdd:COG5147 18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLL-ISSTGKQSsNRWNNHLNPQLKKKNWSEEEDEQLIDLDKEL--GTQW 94
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 2022781840 371 pyRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAK 411
Cdd:COG5147 95 --STIADYKDRRTAQQCVERYVNTLEDLSSTHDSKLQRRNE 133
|
|
| REB1 |
COG5147 |
Myb superfamily proteins, including transcription factors and mRNA splicing factors ... |
233-375 |
3.43e-04 |
|
Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];
Pssm-ID: 227476 [Multi-domain] Cd Length: 512 Bit Score: 45.16 E-value: 3.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 233 KQGREAEKEiQDINQLPE-----EALLGNRLDSHDWEKISNINFE----GSRSAEEIRKFWQNSEHPSINKQEWSREEEE 303
Cdd:COG5147 222 KKGETLALE-QEINEYKEkkglsRKQFCERIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQ 300
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022781840 304 RLQAIAAAHGHLeWQKIAEELGTSRSafQCLQKFQQHNK---ALKRKEWTEEEDRMLTQLVQEMRVGSHiPYRRI 375
Cdd:COG5147 301 ELAKLVVEHGGS-WTEIGKLLGRMPN--DCRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRI 371
|
|
| PksD |
COG3321 |
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ... |
764-1291 |
5.93e-04 |
|
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 442550 [Multi-domain] Cd Length: 1386 Bit Score: 44.48 E-value: 5.93e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 764 TLFTQLFHIDTAGCLEVVRERKALPPRLP----QAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLA 839
Cdd:COG3321 839 QLWVAGVPVDWSALYPGRGRRRVPLPTYPfqreDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAA 918
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 840 SSRVERTLPQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTV 919
Cdd:COG3321 919 LALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAA 998
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 920 LNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQ 999
Cdd:COG3321 999 AAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELA 1078
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1000 APAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAG 1079
Cdd:COG3321 1079 LAAAALALAAALAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAA 1158
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1080 LLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEP 1159
Cdd:COG3321 1159 LAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLAL 1238
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1160 RTSSHADPPEAE-PPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEK 1238
Cdd:COG3321 1239 AAAAAAVAALAAaAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAA 1318
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|...
gi 2022781840 1239 GALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSRLPYQPPALCSLRALSGLLL 1291
Cdd:COG3321 1319 AALAAALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAA 1371
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
938-1207 |
6.02e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.39 E-value: 6.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 938 SGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPflP 1017
Cdd:PHA03307 37 SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLS---TLAPASPAREGSPTPPG--P 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1018 AAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGllpVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP 1097
Cdd:PHA03307 112 SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA---SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSS 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1098 RAPALSSSWQPPANMNREPEP------SCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVARE----IPEPRTSSHADP 1167
Cdd:PHA03307 189 PPAEPPPSTPPAAASPRPPRRsspisaSASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEnecpLPRPAPITLPTR 268
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 2022781840 1168 PEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPL 1207
Cdd:PHA03307 269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPA 308
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
978-1279 |
9.65e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.82 E-value: 9.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 978 APALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPVQplslthigGPHVATSVPLPVTWVLTAQGL 1057
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAA--APAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAP 454
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1058 LPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPAlSSSWQPPANMNREPEPS-----------CRTDTPA 1126
Cdd:PRK07764 455 SPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA-APAAPAGADDAATLRERwpeilaavpkrSRKTWAI 533
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1127 PPTHAlsqSPAEADGSV---AFV----------PGEAQVAREIPEPRT-------------SSHADPPEAEPPWSGRLPA 1180
Cdd:PRK07764 534 LLPEA---TVLGVRGDTlvlGFStgglarrfasPGNAEVLVTALAEELggdwqveavvgpaPGAAGGEGPPAPASSGPPE 610
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1181 FGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLG- 1259
Cdd:PRK07764 611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAp 690
|
330 340
....*....|....*....|.
gi 2022781840 1260 -GQRGVRVPLLGSRLPYQPPA 1279
Cdd:PRK07764 691 aAPAGAAPAQPAPAPAATPPA 711
|
|
| PLN03212 |
PLN03212 |
Transcription repressor MYB5; Provisional |
398-502 |
1.08e-03 |
|
Transcription repressor MYB5; Provisional
Pssm-ID: 178751 [Multi-domain] Cd Length: 249 Bit Score: 42.37 E-value: 1.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 398 GLKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVP-GRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGvGHW 476
Cdd:PLN03212 22 GMKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGlLRCGKSCRLRWMNYLRPSVKRGGITSDEEDLILRLHRLLG-NRW 100
|
90 100
....*....|....*....|....*.
gi 2022781840 477 AKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03212 101 SLIAGRIPGRTDNEIKNYWNTHLRKK 126
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
914-1158 |
1.10e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 914 APGPTVLNVPLSGpGAPAAAKPGTSGSWQ-EAGTSAKDKRlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPE 992
Cdd:PHA03247 254 APAPPPVVGEGAD-RAPETARGATGPPPPpEAAAPNGAAA-------PPDGVWGAALAGAPLALPAPPDPPPPAPAGDAE 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 993 SGLGQSQAPAASRKQGLPEA--PPFLPAAPSPTPLPvqPLSLTHI-GGPHVATSVPLPVTWVLTA--------------- 1054
Cdd:PHA03247 326 EEDDEDGAMEVVSPLPRPRQhyPLGFPKRRRPTWTP--PSSLEDLsAGRHHPKRASLPTRKRRSArhaatpfargpggdd 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1055 QGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTEtraAQGPRAPALSSSWQPPANMNREPEPSCRTDT---------- 1124
Cdd:PHA03247 404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE---PGSDDGPAPPPERQPPAPATEPAPDDPDDATrkaldalrer 480
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 2022781840 1125 --PAPPTHALSQ----SPAEADGSVAFVPGEAQVAREIPE 1158
Cdd:PHA03247 481 rpPEPPGADLAEllgrHPDTAGTVVRLAAREAAIAREVAE 520
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
193-359 |
1.91e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 43.03 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618 220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618 298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
|
170
....*....|....*
gi 2022781840 349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618 370 iscqQHTLTQHIHTL 384
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
854-1206 |
2.43e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 42.35 E-value: 2.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 854 ASTGPRPkPKTVSELLQEKRLQEARA-REATRGPV-VLPSQL-LVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAP 930
Cdd:PHA03379 573 APWTPNP-PRSPSQMSVRDRLARLRAeAQPYQASVeVQPPQLtQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVP 651
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 931 AAAKPGTSGSWQEAGTsakdkrlstmQALPLAPVFSEAEGTAPAASQAPALgpgqisvscPESGLGQSQAPAASRKQGLP 1010
Cdd:PHA03379 652 AMQPQYFDLPLQQPIS----------QGAPLAPLRASMGPVPPVPATQPQY---------FDIPLTEPINQGASAAHFLP 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1011 EAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGllpvpvPAVVSLPRPAgTPGP-AGLLATLLPPLT 1089
Cdd:PHA03379 713 QQPMEGPLVPERWMFQGATLSQSVRPGVAQSQYFDLPLTQPINHGA------PAAHFLHQPP-MEGPwVPEQWMFQGAPP 785
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1090 ETRAAQGPRapALSSSWQPPANMNREPEPScrtdTPAPPTHALSQS----PAEADGSvafvpGEAQVAREIPEP-RTSSH 1164
Cdd:PHA03379 786 SQGTDVVQH--QLDALGYVLHVLNHPGVPV----SPAVNQYHVSQAafglPIDEDES-----GEGSDTSEPCEAlDLSIH 854
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 2022781840 1165 ADPPEAEPPWSGRLPafgGVIPATEPRGTpgSPSGTQEPRGP 1206
Cdd:PHA03379 855 GRPCPQAPEWPVQGE---GGQDATEVLDL--SIHGRPRPRTP 891
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
926-1206 |
2.71e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 42.14 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 926 GPGAPAAAKPGtsgswqeagtsakdkrlstmqALPlapvfseaegtAPAASQAPALGPGQISVSCPESGlgqsQAPAASR 1005
Cdd:PRK07003 368 PGGGVPARVAG---------------------AVP-----------APGARAAAAVGASAVPAVTAVTG----AAGAALA 411
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1006 KQGLPEAPPFLPAAPSPTPLPVQplslthiggphVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTP--GPAGLLAT 1083
Cdd:PRK07003 412 PKAAAAAAATRAEAPPAAPAPPA-----------TADRGDDAADGDAPVPAKANARASADSRCDERDAQPpaDSGSASAP 480
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1084 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHA--LSQSPAEADGSVAFVPGEAQVAREI----- 1156
Cdd:PRK07003 481 ASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPapEARPPTPAAAAPAARAGGAAAALDVlrnag 560
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2022781840 1157 -----PEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGP 1206
Cdd:PRK07003 561 mrvssDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAAR 615
|
|
| SANT_TRF |
cd11660 |
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ... |
470-499 |
4.30e-03 |
|
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.
Pssm-ID: 212558 [Multi-domain] Cd Length: 50 Bit Score: 36.78 E-value: 4.30e-03
10 20 30
....*....|....*....|....*....|...
gi 2022781840 470 KYGVGHWAKIASELP---HRSGSQCLSKWKIMM 499
Cdd:cd11660 17 KYGVGNWAKILKDYFfvnNRTSVDLKDKWRNLK 49
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
877-1204 |
6.70e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.22 E-value: 6.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 877 ARAREATRGPVVLPSQLLVSSSVILQP-------PLPHTPHGRPAPGPTvlnvplSGPGAPAAAKPGTSGS--WQEagts 947
Cdd:PRK10263 330 TQSWAAPVEPVTQTPPVASVDVPPAQPtvawqpvPGPQTGEPVIAPAPE------GYPQQSQYAQPAVQYNepLQQ---- 399
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 948 akdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPV 1027
Cdd:PRK10263 400 ------------PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS--TFAPQSTYQTE 465
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1028 QPlslthiggphvatsVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRapaLSSSWQ 1107
Cdd:PRK10263 466 QT--------------YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQ---LAAWYQ 528
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1108 PPANMNREPEPSCRTdtpAPPTHALSQSPAEAdgsvafVPGEAQVAREIPEPRTSSHADPPEAEPPWSgrlPAFGGVipa 1187
Cdd:PRK10263 529 PIPEPVKEPEPIKSS---LKAPSVAAVPPVEA------AAAVSPLASGVKKATLATGAAATVAAPVFS---LANSGG--- 593
|
330
....*....|....*..
gi 2022781840 1188 tePRGTPGSPSGTQEPR 1204
Cdd:PRK10263 594 --PRPQVKEGIGPQLPR 608
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
911-1033 |
8.78e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 40.47 E-value: 8.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 911 GRPAPGPTVLNVPLSGPGAPAAAKPGTSGSwQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSC 990
Cdd:PRK14951 369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAA-AAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAL 447
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 2022781840 991 PESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1033
Cdd:PRK14951 448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
1077-1208 |
9.89e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 40.47 E-value: 9.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1077 PAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREI 1156
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV 445
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 1157 PEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPrgtPGSPSGTQEPRGPLG 1208
Cdd:PRK14951 446 ALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAP---AAAPAAARLTPTEEG 494
|
|
|