NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2022781840|ref|NP_001381132|]
View 

snRNA-activating protein complex subunit 4 isoform b [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
789-1187 1.61e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  789 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 868
Cdd:PHA03247  2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  869 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 942
Cdd:PHA03247  2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  943 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1022
Cdd:PHA03247  2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAgLLATLLPPLTETRAAQGPR 1098
Cdd:PHA03247  2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTE-SFALPPDQPERPPQPQAPP 2915
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1099 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PHA03247  2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*....
gi 2022781840 1179 PAFGGVIPA 1187
Cdd:PHA03247  2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.37e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.37e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 2022781840   401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
PLN03091 super family cl33633
hypothetical protein; Provisional
399-502 2.21e-08

hypothetical protein; Provisional


The actual alignment was detected with superfamily member PLN03091:

Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.45  E-value: 2.21e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091    12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                           90       100
                   ....*....|....*....|....*....
gi 2022781840  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091    87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.51e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.54  E-value: 2.51e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022781840  297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.51e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.51e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2022781840   346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT super family cl21498
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
262-305 1.40e-04

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


The actual alignment was detected with superfamily member pfam13921:

Pssm-ID: 473887 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.40e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 2022781840  262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
789-1187 1.61e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  789 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 868
Cdd:PHA03247  2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  869 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 942
Cdd:PHA03247  2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  943 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1022
Cdd:PHA03247  2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAgLLATLLPPLTETRAAQGPR 1098
Cdd:PHA03247  2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTE-SFALPPDQPERPPQPQAPP 2915
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1099 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PHA03247  2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*....
gi 2022781840 1179 PAFGGVIPA 1187
Cdd:PHA03247  2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.37e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.37e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 2022781840   401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
404-447 1.13e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 66.44  E-value: 1.13e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 2022781840  404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167      2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
401-447 2.64e-12

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 62.52  E-value: 2.64e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2022781840  401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
PLN03091 PLN03091
hypothetical protein; Provisional
399-502 2.21e-08

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.45  E-value: 2.21e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091    12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                           90       100
                   ....*....|....*....|....*....
gi 2022781840  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091    87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.51e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.54  E-value: 2.51e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022781840  297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
294-342 2.86e-08

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 51.07  E-value: 2.86e-08
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 2022781840   294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.51e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.51e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2022781840   346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
290-339 1.77e-07

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 49.23  E-value: 1.77e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2022781840  290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659      1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
808-1233 2.78e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.97  E-value: 2.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  808 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 880
Cdd:pfam17823   11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  881 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 954
Cdd:pfam17823   90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  955 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1034
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1035 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1114
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1115 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSS----HADPPEAEPpwSGRLPAFG----GVIP 1186
Cdd:pfam17823  322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMipevEATSPTTQP--SPLLPTQGaagpGILL 399
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 1187 ATEPRG---TPGSPSGTQEPRGPLGLEKLPLR--QPGPEKGALDLEKPPLPQ 1233
Cdd:pfam17823  400 APEQVAteaTAGTASAGPTPRSSGDPKTLAMAscQLSTQGQYLVVTTDPLTP 451
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
256-357 2.90e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.79  E-value: 2.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147     29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                           90       100
                   ....*....|....*....|....*...
gi 2022781840  330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147    106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
334-453 5.02e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.02  E-value: 5.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147      6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2022781840  406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147     77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
349-412 3.58e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 45.76  E-value: 3.58e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022781840  349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
470-496 4.23e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 44.87  E-value: 4.23e-06
                           10        20
                   ....*....|....*....|....*..
gi 2022781840  470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167     16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
470-496 5.99e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 44.52  E-value: 5.99e-06
                            10        20
                    ....*....|....*....|....*..
gi 2022781840   470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:smart00717   18 KYGKNNWEKIAKELPGRTAEQCRERWR 44
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
399-495 4.50e-05

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 47.86  E-value: 4.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147     18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                           90
                   ....*....|....*..
gi 2022781840  479 IASELPHRSGSQCLSKW 495
Cdd:COG5147     97 IADYKDRRTAQQCVERY 113
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
262-305 1.40e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.40e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 2022781840  262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
193-359 1.91e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618  220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618  298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
                          170
                   ....*....|....*
gi 2022781840  349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618  370 iscqQHTLTQHIHTL 384
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
789-1187 1.61e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.60  E-value: 1.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  789 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 868
Cdd:PHA03247  2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  869 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 942
Cdd:PHA03247  2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  943 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1022
Cdd:PHA03247  2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAgLLATLLPPLTETRAAQGPR 1098
Cdd:PHA03247  2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTE-SFALPPDQPERPPQPQAPP 2915
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1099 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PHA03247  2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*....
gi 2022781840 1179 PAFGGVIPA 1187
Cdd:PHA03247  2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.37e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.37e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 2022781840   401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
404-447 1.13e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 66.44  E-value: 1.13e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 2022781840  404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167      2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
PHA03247 PHA03247
large tegument protein UL36; Provisional
784-1204 2.97e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 75.36  E-value: 2.97e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  784 RKALPPRLPQAGARD--PPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPK 861
Cdd:PHA03247  2572 RPAPRPSEPAVTSRArrPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPER 2651
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  862 PKTVSELLQEKRLQEAR-------AREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAA 933
Cdd:PHA03247  2652 PRDDPAPGRVSRPRRARrlgraaqASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPpGPAAARQA 2731
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  934 KPGTSGSWQEAGTSAKdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLgqSQAPAASRKQGLPEAP 1013
Cdd:PHA03247  2732 SPALPAAPAPPAVPAG----------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL--TRPAVASLSESRESLP 2799
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1014 pfLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSlprPAGTPGPAGLLATLLPPLTETRA 1093
Cdd:PHA03247  2800 --SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP---LGGSVAPGGDVRRRPPSRSPAAK 2874
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1094 AQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAdgsvafvPGEAQVAREIPEPRTSSHADPPEAEPP 1173
Cdd:PHA03247  2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ-------PQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
                          410       420       430
                   ....*....|....*....|....*....|.
gi 2022781840 1174 WSGRLPAFGGVIPAtePRGTPGSPSGTQEPR 1204
Cdd:PHA03247  2948 DPAGAGEPSGAVPQ--PWLGALVPGRVAVPR 2976
PHA03247 PHA03247
large tegument protein UL36; Provisional
784-1279 7.00e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.20  E-value: 7.00e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  784 RKALPPRLPQA--GARDPPvHLLQASSSAQSTPGHLFPNVPAQEASKSASH-------KGSRRLASSRV---ERTLPQAS 851
Cdd:PHA03247  2481 RRPAEARFPFAagAAPDPG-GGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwiRGLEELASDDAgdpPPPLPPAA 2559
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  852 LLASTG-----PRPKPKTVSELLQEKRLQEARAREATRG--PVVLPSQLLVSSSVILQPPLPHTPHGrPAPGPTVLNVPL 924
Cdd:PHA03247  2560 PPAAPDrsvppPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPAANEP 2638
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  925 SGPGaPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASqaPALGPGQISVSCPESGLGQSQAP-AA 1003
Cdd:PHA03247  2639 DPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR--PTVGSLTSLADPPPPPPTPEPAPhAL 2715
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1004 SRKQGLPEAP-------PFLPAAPSPTPLPVQPLSlthIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVvSLPRPAGTP- 1075
Cdd:PHA03247  2716 VSATPLPPGPaaarqasPALPAAPAPPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASl 2791
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1076 GPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANmnrEPEPSCRTDTPAPPTHALSQSPAEADGSVAfvPGeAQVARE 1155
Cdd:PHA03247  2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP---LPPPTSAQPTAPPPPPGPPPPSLPLGGSVA--PG-GDVRRR 2865
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1156 IP------------EPRTSSHADPPEAEPPWSGRLPAFGgviPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGA 1223
Cdd:PHA03247  2866 PPsrspaakpaapaRPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2022781840 1224 LDLEKPPLPQPGPEKGALDlgllsqegeaatqQWLGGQRGVRVPLLGSRLPYQPPA 1279
Cdd:PHA03247  2943 LAPTTDPAGAGEPSGAVPQ-------------PWLGALVPGRVAVPRFRVPQPAPS 2985
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
401-447 2.64e-12

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 62.52  E-value: 2.64e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2022781840  401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
404-457 3.44e-11

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 59.63  E-value: 3.44e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2022781840  404 WAPEEDAKLLQAVAKYGeQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWN 457
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWS 53
PHA03378 PHA03378
EBNA-3B; Provisional
873-1239 3.81e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 64.70  E-value: 3.81e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  873 RLQEARAREATRGPVVL----PSQLLVSSSVILQPPLPHTPHgRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSA 948
Cdd:PHA03378   437 RTEQPRATPHSQAPTVVlhrpPTQPLEGPTGPLSVQAPLEPW-QPLPHPQVTPVILHQPPAQGVQAHGSMLDLLEKDDED 515
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  949 KDKRLSTMQALPLAP----------VFSE---AEGTAPAASQA------PALGPGQISV-------------SCPESGLG 996
Cdd:PHA03378   516 MEQRVMATLLPPSPPqpragrrapcVYTEdldIESDEPASTEPvhdqllPAPGLGPLQIqpltspttsqlasSAPSYAQT 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  997 QSQAPAASRKQGLPEAPPFLPA--APSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGT 1074
Cdd:PHA03378   596 PWPVPHPSQTPEPPTTQSHIPEtsAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQ 675
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1075 PGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSV---AFVPGEAQ 1151
Cdd:PHA03378   676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRArppAAAPGRAR 755
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1152 VAREIPEPRTSSHADPPEAEPpwsgRLPAFGGVIPATEPRgtpGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPL 1231
Cdd:PHA03378   756 PPAAAPGRARPPAAAPGAPTP----QPPPQAPPAPQQRPR---GAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828

                   ....*...
gi 2022781840 1232 PQPGPEKG 1239
Cdd:PHA03378   829 LTGGVKRG 836
PLN03091 PLN03091
hypothetical protein; Provisional
399-502 2.21e-08

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.45  E-value: 2.21e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091    12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                           90       100
                   ....*....|....*....|....*....
gi 2022781840  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091    87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.51e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.54  E-value: 2.51e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2022781840  297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
294-342 2.86e-08

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 51.07  E-value: 2.86e-08
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 2022781840   294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
786-1172 4.63e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.69  E-value: 4.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  786 ALPPRLPQAGARDPPVhlLQASSSAQSTPghlfPNVPAQEASKSASHKGSRRLASSrvertlPQASLLASTGPRPKPKTV 865
Cdd:PRK07764   380 RLERRLGVAGGAGAPA--AAAPSAAAAAP----AAAPAPAAAAPAAAAAPAPAAAP------QPAPAPAPAPAPPSPAGN 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  866 SELLQEKRLQEARAREATRGPVVLPSQllvssSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPgtSGSWQEAG 945
Cdd:PRK07764   448 APAGGAPSPPPAAAPSAQPAPAPAAAP-----EPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL--RERWPEIL 520
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  946 TSAKDKRLSTMQAL---------------------PLAPVFSEAE-----------------------GTAPAASQAPAL 981
Cdd:PRK07764   521 AAVPKRSRKTWAILlpeatvlgvrgdtlvlgfstgGLARRFASPGnaevlvtalaeelggdwqveavvGPAPGAAGGEGP 600
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  982 GPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPV- 1060
Cdd:PRK07764   601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAa 680
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1061 PVPAVVSLPRPAGTPGPAGLLATLLPPlteTRAAQGPRAPAlsSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAD 1140
Cdd:PRK07764   681 PPPAPAPAAPAAPAGAAPAQPAPAPAA---TPPAGQADDPA--AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP 755
                          410       420       430
                   ....*....|....*....|....*....|..
gi 2022781840 1141 GSVAFVPGEAQVAREIPEPRTSSHADPPEAEP 1172
Cdd:PRK07764   756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAE 787
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.51e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.51e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 2022781840   346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
290-339 1.77e-07

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 49.23  E-value: 1.77e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2022781840  290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659      1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
792-1199 2.66e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.56  E-value: 2.66e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  792 PQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRvERTLPQASLLASTGPRPKPKTVSELLQE 871
Cdd:PHA03307    24 PPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPG-PGTEAPANESRSTPTWSLSTLAPASPAR 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  872 KRLQEARAREATRGPVVLPSQLLVSSSVilQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDK 951
Cdd:PHA03307   103 EGSPTPPGPSSPDPPPPTPPPASPPPSP--APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  952 RLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQA---------PAASRKQGLPEAPPFLPAAPSP 1022
Cdd:PHA03307   181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAAddagasssdSSSSESSGCGWGPENECPLPRP 260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1023 TPLPVQPLSLTHIgGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP-RAPA 1101
Cdd:PHA03307   261 APITLPTRIWEAS-GWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSEsSRGA 339
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1102 LSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLPAF 1181
Cdd:PHA03307   340 AVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
                          410
                   ....*....|....*...
gi 2022781840 1182 GGVIPATEPRGTPGSPSG 1199
Cdd:PHA03307   420 GAASGAFYARYPLLTPSG 437
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
808-1233 2.78e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.97  E-value: 2.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  808 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 880
Cdd:pfam17823   11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  881 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 954
Cdd:pfam17823   90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  955 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1034
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1035 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1114
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1115 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSS----HADPPEAEPpwSGRLPAFG----GVIP 1186
Cdd:pfam17823  322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMipevEATSPTTQP--SPLLPTQGaagpGILL 399
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 1187 ATEPRG---TPGSPSGTQEPRGPLGLEKLPLR--QPGPEKGALDLEKPPLPQ 1233
Cdd:pfam17823  400 APEQVAteaTAGTASAGPTPRSSGDPKTLAMAscQLSTQGQYLVVTTDPLTP 451
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
256-357 2.90e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.79  E-value: 2.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147     29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                           90       100
                   ....*....|....*....|....*...
gi 2022781840  330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147    106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
296-340 3.08e-07

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 47.96  E-value: 3.08e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2022781840  296 EWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:cd00167      1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPG-RTPKQCRERWRNL 44
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
848-1220 3.43e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 55.16  E-value: 3.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  848 PQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLpsqllVSSSVILQP---PLPHTP-HGRPAPGPTVLNVP 923
Cdd:pfam03154  189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL-----IQQTPTLHPqrlPSPHPPlQPMTQPPPPSQVSP 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  924 LSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTmQALPLAPVFSEAEGTAPAASQAPalgpgqisvscpesglGQSQApaa 1003
Cdd:pfam03154  264 QPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPP-QPFPLTPQSSQSQVPPGPSPAAP----------------GQSQQ--- 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1004 srkqgLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQG-LLPVPVPAVVSLPRPAGTPGPAGLLA 1082
Cdd:pfam03154  324 -----RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1083 TLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPA-----PPTHALSQSPAEAD-GSVAFVPGEAQVAREI 1156
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaashPPTSGLHQVPSQSPfPQHPFVPGGPPPITPP 478
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1157 PEPRTSSHADPPEAEPPWSGRlPAFGGVIPATEPRGTPG------SPSGTQEPRGPlgleKLPLRQPGPE 1220
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSSAS-VSSSGPVPAAVSCPLPPvqikeeALDEAEEPESP----PPPPRSPSPE 543
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
902-1236 4.58e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 4.58e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  902 QPPLPHTPHGRPAPGPtvlnvplSGPGAPAAAKPGTSGSWQEAGT-SAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPA 980
Cdd:PRK07764   397 AAPSAAAAAPAAAPAP-------AAAAPAAAAAPAPAAAPQPAPApAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  981 LGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAA---------------------------PSPTPLPVQP--LS 1031
Cdd:PRK07764   470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDaatlrerwpeilaavpkrsrktwaillPEATVLGVRGdtLV 549
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1032 LTH--------IGGPHVATSVplpVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALS 1103
Cdd:PRK07764   550 LGFstgglarrFASPGNAEVL---VT-ALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPA 625
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1104 SSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEA-----DGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1178
Cdd:PRK07764   626 APAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDasdggDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2022781840 1179 PAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGP 1236
Cdd:PRK07764   706 AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
334-453 5.02e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.02  E-value: 5.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147      6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2022781840  406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147     77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
850-1206 1.28e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.25  E-value: 1.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  850 ASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVIlqPPLPHTPHGRPAPGPTVLNVPLSGPGA 929
Cdd:PHA03307    57 AGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT--PPGPSSPDPPPPTPPPASPPPSPAPDL 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  930 PAAAKPGTSGSwqeagtsakdKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESG---LGQSQAPAASRK 1006
Cdd:PHA03307   135 SEMLRPVGSPG----------PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPpaePPPSTPPAAASP 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1007 QGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLP 1086
Cdd:PHA03307   205 RPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC-GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1087 PLTETRAAQGPRAPALSSSwqppanmnrepepSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHAD 1166
Cdd:PHA03307   284 PASSSSSPRERSPSPSPSS-------------PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 2022781840 1167 PPEAEPPwsgrlPAFGGVIPATEPRGTPGSPSGTQEPRGP 1206
Cdd:PHA03307   351 PSPSRPP-----PPADPSSPRKRPRPSRAPSSPAASAGRP 385
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
349-412 3.58e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 45.76  E-value: 3.58e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2022781840  349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
932-1142 3.91e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.42  E-value: 3.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  932 AAKPGTSGSWQEAGTSAKDKRLSTMQAL----PLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQ 1007
Cdd:PRK12323   362 AFRPGQSGGGAGPATAAAAPVAQPAPAAaapaAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1008 GLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGP----AGLLAT 1083
Cdd:PRK12323   442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpdAAPAGW 521
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2022781840 1084 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGS 1142
Cdd:PRK12323   522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
470-496 4.23e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 44.87  E-value: 4.23e-06
                           10        20
                   ....*....|....*....|....*..
gi 2022781840  470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167     16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
972-1242 4.34e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  972 APAASQAPAlGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVAtsvPLPVTWV 1051
Cdd:PRK07003   361 AVTGGGAPG-GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA---PAPPATA 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1052 LTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSswqppanmnREPEPSCrtdtpAPPTHA 1131
Cdd:PRK07003   437 DRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA---------FEPAPRA-----AAPSAA 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1132 LSQSPAEADGSVAFVPGEAQVAREIPEPRTSShADPPEAEPPWSGrlpafGGVIPATEPRGTPGSPSGTQEPRGPLGLEK 1211
Cdd:PRK07003   503 TPAAVPDARAPAAASREDAPAAAAPPAPEARP-PTPAAAAPAARA-----GGAAAALDVLRNAGMRVSSDRGARAAAAAK 576
                          250       260       270
                   ....*....|....*....|....*....|.
gi 2022781840 1212 LPLRQPGPEKGALDLEKPPLPQPGPEKGALD 1242
Cdd:PRK07003   577 PAAAPAAAPKPAAPRVAVQVPTPRARAATGD 607
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
470-496 5.99e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 44.52  E-value: 5.99e-06
                            10        20
                    ....*....|....*....|....*..
gi 2022781840   470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:smart00717   18 KYGKNNWEKIAKELPGRTAEQCRERWR 44
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
959-1187 8.40e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 8.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  959 LPLAPVFSEAEGTAPAASQAPALGPG----QISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1034
Cdd:PRK12323   361 LAFRPGQSGGGAGPATAAAAPVAQPApaaaAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQAS 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1035 IGGPhVATSVPLPVtwvltaqgllPVPVPAVVSLPRPAGTPGPAglLATLLPPLTETRAAQGPRAPALSSSWQPPANMNR 1114
Cdd:PRK12323   441 ARGP-GGAPAPAPA----------PAAAPAAAARPAAAGPRPVA--AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2022781840 1115 EPEPSCRTDTPAPPTHALSQSPAEADGSVAF-VPGEAQVAREIPEPRTSSHADPPEAEPPWS--GRLPAFGGVIPA 1187
Cdd:PRK12323   508 SPAPAQPDAAPAGWVAESIPDPATADPDDAFeTLAPAPAAAPAPRAAAATEPVVAPRPPRASasGLPDMFDGDWPA 583
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
903-1110 9.71e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 9.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  903 PPLPHTPHGRPAPG---PTVLNVPLSGPGAPAAAKPGTSGSWQEAGtSAKDKRLSTMQALPLApvfSEAEGTAPAASQAP 979
Cdd:PRK12323   375 ATAAAAPVAQPAPAaaaPAAAAPAPAAPPAAPAAAPAAAAAARAVA-AAPARRSPAPEALAAA---RQASARGPGGAPAP 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  980 ALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAqglLP 1059
Cdd:PRK12323   451 APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES---IP 527
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2022781840 1060 VPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAqgPRAPALSSSWQPPA 1110
Cdd:PRK12323   528 DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA--PRPPRASASGLPDM 576
PHA03247 PHA03247
large tegument protein UL36; Provisional
902-1244 1.28e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 1.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  902 QPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGT---SAKDKRLSTMQALPLAPVFSEaegtaPAASQA 978
Cdd:PHA03247  2414 QPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTilgAPFSLSLLLGELFPGAPVYRR-----PAEARF 2488
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  979 P-ALGPGqisvscPESGLGQSQAPAASRKQGLPeAPPFLPAAPSPTPLPVQPLSLTH---------IGGPhvatSVPLPv 1048
Cdd:PHA03247  2489 PfAAGAA------PDPGGGGPPDPDAPPAPSRL-APAILPDEPVGEPVHPRMLTWIRgleelasddAGDP----PPPLP- 2556
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1049 twvltaqgllPVPVPAVV--SLPRPAGTPGPAGLLAtllpplteTRAAQGPRAPALSSSWQPPANmNREPEPSCRTDTPA 1126
Cdd:PHA03247  2557 ----------PAAPPAAPdrSVPPPRPAPRPSEPAV--------TSRARRPDAPPQSARPRAPVD-DRGDPRGPAPPSPL 2617
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1127 PPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLP--AFGGVIPATEPR------------- 1191
Cdd:PHA03247  2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgrAAQASSPPQRPRrraarptvgslts 2697
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2022781840 1192 -GTPGSPSGTQEPRGPLGLEKLPLrQPGPEKGALDLEKPPL---PQPGPEKGALDLG 1244
Cdd:PHA03247  2698 lADPPPPPPTPEPAPHALVSATPL-PPGPAAARQASPALPAapaPPAVPAGPATPGG 2753
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
294-340 2.42e-05

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 42.88  E-value: 2.42e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2022781840  294 KQEWSREEEERLQAIAAAHGHlEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPG-RTDNQCKNRWQNY 45
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
875-1174 2.88e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.69  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  875 QEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEaGTSAKDKRLS 954
Cdd:PRK07003   372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGD-DAADGDAPVP 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  955 TMQALPLAPVFSEAEGTA-PAASQAPALGPGqisvscpesglgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1033
Cdd:PRK07003   451 AKANARASADSRCDERDAqPPADSGSASAPA-------------SDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAAS 517
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1034 HIGGPHVAtSVPLPvtwvltaqgLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMN 1113
Cdd:PRK07003   518 REDAPAAA-APPAP---------EARPPTPAAAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKP 587
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022781840 1114 REPEPSCRTDTPAPPTHALSQSPAEAdgsvafvpgeaqvareipePRTSSHADPPEAEPPW 1174
Cdd:PRK07003   588 AAPRVAVQVPTPRARAATGDAPPNGA-------------------ARAEQAAESRGAPPPW 629
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
903-1188 3.57e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 48.37  E-value: 3.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  903 PPLPHTPHGRPAP---GPTVLNVPLSGP---GAPAAAKPGT-SGSWQEAGTSAKDKRLSTmqalplaPVFSEAEGTAPAA 975
Cdd:pfam05109  449 PSSTHVPTNLTAPastGPTVSTADVTSPtpaGTTSGASPVTpSPSPRDNGTESKAPDMTS-------PTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  976 SQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQP-LSLThigGPHVATSVPLPVTWVLTA 1054
Cdd:pfam05109  522 SPTPAVTTPTPNATSPTLG---KTSPTSAVTTPTPNATSPTPAVTTPTPNATIPtLGKT---SPTSAVTTPTPNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1055 QGLLP------------VPVPAVVSLPRPAGTPGPAGLLATLLppltETRAAQGPRAPALSSSWQPPANMNR-------- 1114
Cdd:pfam05109  596 GETSPqanttnhtlggtSSTPVVTSPPKNATSAVTTGQHNITS----SSTSSMSLRPSSISETLSPSTSDNStshmpllt 671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1115 EPEPS-----------------CRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPePRTSSHADPPEAEPPWSGR 1177
Cdd:pfam05109  672 SAHPTggenitqvtpaststhhVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTP-PKNATSPQAPSGQKTAVPT 750
                          330
                   ....*....|.
gi 2022781840 1178 LPAFGGVIPAT 1188
Cdd:pfam05109  751 VTSTGGKANST 761
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1036-1274 3.63e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.33  E-value: 3.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1036 GGPHVATSVPLPVTWVLtaqgllPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNRE 1115
Cdd:PRK12323   370 GGAGPATAAAAPVAQPA------PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1116 PEPSCRTDTPAPPTHALSQSPAEADgsvafvpgeaqvareiPEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPG 1195
Cdd:PRK12323   444 PGGAPAPAPAPAAAPAAAARPAAAG----------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1196 SPSGTQEPRGPLGLEKLPLRQPG---PEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSR 1272
Cdd:PRK12323   508 SPAPAQPDAAPAGWVAESIPDPAtadPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAAR 587

                   ..
gi 2022781840 1273 LP 1274
Cdd:PRK12323   588 LP 589
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
399-495 4.50e-05

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 47.86  E-value: 4.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147     18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                           90
                   ....*....|....*..
gi 2022781840  479 IASELPHRSGSQCLSKW 495
Cdd:COG5147     97 IADYKDRRTAQQCVERY 113
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
397-443 5.86e-05

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 41.91  E-value: 5.86e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2022781840  397 PGLKKGYWAPEEDAKLLQAVAKYGEQdWFKIREEVpGRSDAQCRDRY 443
Cdd:cd11659      1 PSIKKTEWTREEDEKLLHLAKLLPTQ-WRTIAPIV-GRTAQQCLERY 45
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
910-1128 6.90e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 6.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  910 HGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRlstmqalplAPVFSEAEGTAPAASQAPALGPGQISVS 989
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA---------AAAPAEASAAPAPGVAAPEHHPKHVAVP 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  990 CPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLP 1069
Cdd:PRK07764   661 DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2022781840 1070 RPaGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPAnmnrEPEPSCRTDTPAPP 1128
Cdd:PRK07764   741 LP-PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS----EEEEMAEDDAPSMD 794
PHA03378 PHA03378
EBNA-3B; Provisional
846-1181 8.35e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.37  E-value: 8.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  846 TLPQASLLASTGP----RPKPKTVSELLQEKRLQEARAREAT---RGPVVL---PSQLLVSSSVILQPPLPHTPHGRPAP 915
Cdd:PHA03378   578 TSPTTSQLASSAPsyaqTPWPVPHPSQTPEPPTTQSHIPETSaprQWPMPLrpiPMRPLRMQPITFNVLVFPTPHQPPQV 657
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  916 GPTVLNV----PLSGPGAPAAAKPGTSGSWQEAGTsakdkrlsTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCP 991
Cdd:PHA03378   658 EITPYKPtwtqIGHIPYQPSPTGANTMLPIQWAPG--------TMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAA 729
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  992 ESGLGQSQAPAASRKQGlPEAPPFLPAAPSPTPLPVQPLSlthiGGPHVATSVPLPVTWVLTAQ----GLLPVPVPAV-- 1065
Cdd:PHA03378   730 APGRARPPAAAPGRARP-PAAAPGRARPPAAAPGRARPPA----AAPGAPTPQPPPQAPPAPQQrprgAPTPQPPPQAgp 804
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1066 ----VSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSwQPPANMNREPEPSCRTDT-PAPPTHALSQSPAEAD 1140
Cdd:PHA03378   805 tsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALER-QAAAGPTPSPGSGTSDKIvQAPVFYPPVLQPIQVM 883
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2022781840 1141 GSVAFV---------------PGEAQVA-----REIPEPRTSSHADPPEAEPPWSGRLPAF 1181
Cdd:PHA03378   884 RQLGSVraaaastvtqapteyTGERRGVgpmhpTDIPPSKRAKTDAYVESQPPHGGQSHSF 944
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
404-443 1.33e-04

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 41.01  E-value: 1.33e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2022781840  404 WAPEEDAKLLQAVAKYGEQDWFKIREE---VPGRSDAQCRDRY 443
Cdd:cd11660      3 WTDEEDEALVEGVEKYGVGNWAKILKDyffVNNRTSVDLKDKW 45
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
262-305 1.40e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.40e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 2022781840  262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
292-411 2.99e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 2.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  292 INKQEWSREEEERLQAIAAAHGHLEWQKIAEELgTSRSAFQC-LQKFQQHNKALKRKEWTEEEDRMLTQLVQEMrvGSHI 370
Cdd:COG5147     18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLL-ISSTGKQSsNRWNNHLNPQLKKKNWSEEEDEQLIDLDKEL--GTQW 94
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2022781840  371 pyRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAK 411
Cdd:COG5147     95 --STIADYKDRRTAQQCVERYVNTLEDLSSTHDSKLQRRNE 133
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
233-375 3.43e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  233 KQGREAEKEiQDINQLPE-----EALLGNRLDSHDWEKISNINFE----GSRSAEEIRKFWQNSEHPSINKQEWSREEEE 303
Cdd:COG5147    222 KKGETLALE-QEINEYKEkkglsRKQFCERIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQ 300
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2022781840  304 RLQAIAAAHGHLeWQKIAEELGTSRSafQCLQKFQQHNK---ALKRKEWTEEEDRMLTQLVQEMRVGSHiPYRRI 375
Cdd:COG5147    301 ELAKLVVEHGGS-WTEIGKLLGRMPN--DCRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRI 371
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
764-1291 5.93e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 44.48  E-value: 5.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  764 TLFTQLFHIDTAGCLEVVRERKALPPRLP----QAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLA 839
Cdd:COG3321    839 QLWVAGVPVDWSALYPGRGRRRVPLPTYPfqreDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAA 918
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  840 SSRVERTLPQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTV 919
Cdd:COG3321    919 LALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAA 998
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  920 LNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQ 999
Cdd:COG3321    999 AAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELA 1078
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1000 APAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAG 1079
Cdd:COG3321   1079 LAAAALALAAALAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAA 1158
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1080 LLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEP 1159
Cdd:COG3321   1159 LAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLAL 1238
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1160 RTSSHADPPEAE-PPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEK 1238
Cdd:COG3321   1239 AAAAAAVAALAAaAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAA 1318
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2022781840 1239 GALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSRLPYQPPALCSLRALSGLLL 1291
Cdd:COG3321   1319 AALAAALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAA 1371
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
938-1207 6.02e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 6.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  938 SGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPflP 1017
Cdd:PHA03307    37 SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLS---TLAPASPAREGSPTPPG--P 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1018 AAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGllpVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP 1097
Cdd:PHA03307   112 SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA---SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSS 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1098 RAPALSSSWQPPANMNREPEP------SCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVARE----IPEPRTSSHADP 1167
Cdd:PHA03307   189 PPAEPPPSTPPAAASPRPPRRsspisaSASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEnecpLPRPAPITLPTR 268
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 2022781840 1168 PEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPL 1207
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPA 308
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
978-1279 9.65e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 9.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  978 APALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPVQplslthigGPHVATSVPLPVTWVLTAQGL 1057
Cdd:PRK07764   385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAA--APAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAP 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1058 LPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPAlSSSWQPPANMNREPEPS-----------CRTDTPA 1126
Cdd:PRK07764   455 SPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA-APAAPAGADDAATLRERwpeilaavpkrSRKTWAI 533
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1127 PPTHAlsqSPAEADGSV---AFV----------PGEAQVAREIPEPRT-------------SSHADPPEAEPPWSGRLPA 1180
Cdd:PRK07764   534 LLPEA---TVLGVRGDTlvlGFStgglarrfasPGNAEVLVTALAEELggdwqveavvgpaPGAAGGEGPPAPASSGPPE 610
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1181 FGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLG- 1259
Cdd:PRK07764   611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAp 690
                          330       340
                   ....*....|....*....|.
gi 2022781840 1260 -GQRGVRVPLLGSRLPYQPPA 1279
Cdd:PRK07764   691 aAPAGAAPAQPAPAPAATPPA 711
PLN03212 PLN03212
Transcription repressor MYB5; Provisional
398-502 1.08e-03

Transcription repressor MYB5; Provisional


Pssm-ID: 178751 [Multi-domain]  Cd Length: 249  Bit Score: 42.37  E-value: 1.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  398 GLKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVP-GRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGvGHW 476
Cdd:PLN03212    22 GMKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGlLRCGKSCRLRWMNYLRPSVKRGGITSDEEDLILRLHRLLG-NRW 100
                           90       100
                   ....*....|....*....|....*.
gi 2022781840  477 AKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03212   101 SLIAGRIPGRTDNEIKNYWNTHLRKK 126
PHA03247 PHA03247
large tegument protein UL36; Provisional
914-1158 1.10e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  914 APGPTVLNVPLSGpGAPAAAKPGTSGSWQ-EAGTSAKDKRlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPE 992
Cdd:PHA03247   254 APAPPPVVGEGAD-RAPETARGATGPPPPpEAAAPNGAAA-------PPDGVWGAALAGAPLALPAPPDPPPPAPAGDAE 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  993 SGLGQSQAPAASRKQGLPEA--PPFLPAAPSPTPLPvqPLSLTHI-GGPHVATSVPLPVTWVLTA--------------- 1054
Cdd:PHA03247   326 EEDDEDGAMEVVSPLPRPRQhyPLGFPKRRRPTWTP--PSSLEDLsAGRHHPKRASLPTRKRRSArhaatpfargpggdd 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1055 QGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTEtraAQGPRAPALSSSWQPPANMNREPEPSCRTDT---------- 1124
Cdd:PHA03247   404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE---PGSDDGPAPPPERQPPAPATEPAPDDPDDATrkaldalrer 480
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 2022781840 1125 --PAPPTHALSQ----SPAEADGSVAFVPGEAQVAREIPE 1158
Cdd:PHA03247   481 rpPEPPGADLAEllgrHPDTAGTVVRLAAREAAIAREVAE 520
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
193-359 1.91e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618  220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618  298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
                          170
                   ....*....|....*
gi 2022781840  349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618  370 iscqQHTLTQHIHTL 384
PHA03379 PHA03379
EBNA-3A; Provisional
854-1206 2.43e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 42.35  E-value: 2.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  854 ASTGPRPkPKTVSELLQEKRLQEARA-REATRGPV-VLPSQL-LVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAP 930
Cdd:PHA03379   573 APWTPNP-PRSPSQMSVRDRLARLRAeAQPYQASVeVQPPQLtQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVP 651
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  931 AAAKPGTSGSWQEAGTsakdkrlstmQALPLAPVFSEAEGTAPAASQAPALgpgqisvscPESGLGQSQAPAASRKQGLP 1010
Cdd:PHA03379   652 AMQPQYFDLPLQQPIS----------QGAPLAPLRASMGPVPPVPATQPQY---------FDIPLTEPINQGASAAHFLP 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1011 EAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGllpvpvPAVVSLPRPAgTPGP-AGLLATLLPPLT 1089
Cdd:PHA03379   713 QQPMEGPLVPERWMFQGATLSQSVRPGVAQSQYFDLPLTQPINHGA------PAAHFLHQPP-MEGPwVPEQWMFQGAPP 785
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1090 ETRAAQGPRapALSSSWQPPANMNREPEPScrtdTPAPPTHALSQS----PAEADGSvafvpGEAQVAREIPEP-RTSSH 1164
Cdd:PHA03379   786 SQGTDVVQH--QLDALGYVLHVLNHPGVPV----SPAVNQYHVSQAafglPIDEDES-----GEGSDTSEPCEAlDLSIH 854
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 2022781840 1165 ADPPEAEPPWSGRLPafgGVIPATEPRGTpgSPSGTQEPRGP 1206
Cdd:PHA03379   855 GRPCPQAPEWPVQGE---GGQDATEVLDL--SIHGRPRPRTP 891
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
926-1206 2.71e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 2.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  926 GPGAPAAAKPGtsgswqeagtsakdkrlstmqALPlapvfseaegtAPAASQAPALGPGQISVSCPESGlgqsQAPAASR 1005
Cdd:PRK07003   368 PGGGVPARVAG---------------------AVP-----------APGARAAAAVGASAVPAVTAVTG----AAGAALA 411
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1006 KQGLPEAPPFLPAAPSPTPLPVQplslthiggphVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTP--GPAGLLAT 1083
Cdd:PRK07003   412 PKAAAAAAATRAEAPPAAPAPPA-----------TADRGDDAADGDAPVPAKANARASADSRCDERDAQPpaDSGSASAP 480
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1084 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHA--LSQSPAEADGSVAFVPGEAQVAREI----- 1156
Cdd:PRK07003   481 ASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPapEARPPTPAAAAPAARAGGAAAALDVlrnag 560
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2022781840 1157 -----PEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGP 1206
Cdd:PRK07003   561 mrvssDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAAR 615
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
470-499 4.30e-03

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 36.78  E-value: 4.30e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 2022781840  470 KYGVGHWAKIASELP---HRSGSQCLSKWKIMM 499
Cdd:cd11660     17 KYGVGNWAKILKDYFfvnNRTSVDLKDKWRNLK 49
PRK10263 PRK10263
DNA translocase FtsK; Provisional
877-1204 6.70e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 6.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  877 ARAREATRGPVVLPSQLLVSSSVILQP-------PLPHTPHGRPAPGPTvlnvplSGPGAPAAAKPGTSGS--WQEagts 947
Cdd:PRK10263   330 TQSWAAPVEPVTQTPPVASVDVPPAQPtvawqpvPGPQTGEPVIAPAPE------GYPQQSQYAQPAVQYNepLQQ---- 399
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  948 akdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPV 1027
Cdd:PRK10263   400 ------------PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS--TFAPQSTYQTE 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1028 QPlslthiggphvatsVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRapaLSSSWQ 1107
Cdd:PRK10263   466 QT--------------YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQ---LAAWYQ 528
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1108 PPANMNREPEPSCRTdtpAPPTHALSQSPAEAdgsvafVPGEAQVAREIPEPRTSSHADPPEAEPPWSgrlPAFGGVipa 1187
Cdd:PRK10263   529 PIPEPVKEPEPIKSS---LKAPSVAAVPPVEA------AAAVSPLASGVKKATLATGAAATVAAPVFS---LANSGG--- 593
                          330
                   ....*....|....*..
gi 2022781840 1188 tePRGTPGSPSGTQEPR 1204
Cdd:PRK10263   594 --PRPQVKEGIGPQLPR 608
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
911-1033 8.78e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 8.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840  911 GRPAPGPTVLNVPLSGPGAPAAAKPGTSGSwQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSC 990
Cdd:PRK14951   369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAA-AAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAL 447
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2022781840  991 PESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1033
Cdd:PRK14951   448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1077-1208 9.89e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 9.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2022781840 1077 PAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREI 1156
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV 445
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2022781840 1157 PEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPrgtPGSPSGTQEPRGPLG 1208
Cdd:PRK14951   446 ALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAP---AAAPAAARLTPTEEG 494
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH