NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622920921|ref|XP_014988452|]
View 

protein SON isoform X4 [Macaca mulatta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
159-460 1.30e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920921  396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03379 super family cl33730
EBNA-3A; Provisional
340-673 1.56e-06

EBNA-3A; Provisional


The actual alignment was detected with superfamily member PHA03379:

Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 53.52  E-value: 1.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1622920921  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
rne super family cl35953
ribonuclease E; Reviewed
1220-1381 8.65e-05

ribonuclease E; Reviewed


The actual alignment was detected with superfamily member PRK10811:

Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 47.73  E-value: 8.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1622920921 1379 STV 1381
Cdd:PRK10811   995 TAV 997
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
159-460 1.30e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920921  396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
188-493 3.00e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 55.55  E-value: 3.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  188 EPPVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPE--PSMTKILDSFaaapvptTTVVLKSSEPVVT 265
Cdd:NF033839   158 KPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAvaTYMSKILDDI-------QKHHLQKEKHRQI 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  266 MSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLV------VSSETPTE-VYPEPSTSTT--MDFPESSAIEA 336
Cdd:NF033839   231 VALIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVtkfkkgLTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEV 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  337 LRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV 413
Cdd:NF033839   311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQ 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  414 PELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVEL 491
Cdd:NF033839   391 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEK 470

                   ..
gi 1622920921  492 PE 493
Cdd:NF033839   471 PK 472
PHA03379 PHA03379
EBNA-3A; Provisional
340-673 1.56e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 53.52  E-value: 1.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1622920921  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
82-579 3.48e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 3.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921   82 DLKEASRKSRCVSV--QTDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----AKTKSHHDGNIDLESDSF 155
Cdd:pfam03154   38 DLRSSGRNSPSAAStsSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGE 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  156 LKfDSEPSAMALELPTRAFGLSETNESpavvleppvVSVEVPEPHILETlkPATKTAELSVASTSVISEQSEQSVAVTPE 235
Cdd:pfam03154  118 GE-SSDGRSVNDEGSSDPKDIDQDNRS---------TSPSIPSPQDNES--DSDSSAQQQILQTQPPVLQAQSGAASPPS 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  236 PSMTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSV--LKSVEST-SPEPSKIMLVEPPVAKVLEPSEtlvvSSET 312
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTpTLHPQRLPSPHPPLQPMTQPPP----PSQV 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  313 PTEVYPEPSTSTTMDfPESSAIEA------LRLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASA 380
Cdd:pfam03154  262 SPQPLPQPSLHGQMP-PMPHSLQTgpshmqHPVPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  381 MELPGPPA-TSMPELQGPPVTPVPELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL-- 449
Cdd:pfam03154  341 REQPLPPApLSMPHIKPPPTTPIPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmp 419
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  450 -SQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHP 528
Cdd:pfam03154  420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSA 498
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622920921  529 EVTTATGLLGQPEATmvleLPGQPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154  499 SVSSSGPVPAAVSCP----LPPVQIKEEALDEAEEPESPPPPPRSPSPEPT 545
rne PRK10811
ribonuclease E; Reviewed
1220-1381 8.65e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 47.73  E-value: 8.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1622920921 1379 STV 1381
Cdd:PRK10811   995 TAV 997
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
249-430 1.78e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.22  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  249 PVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPE-PSKIMLVEPPVAKVLE-PSETLVVSSETPT-EVYPEPSTSTT 325
Cdd:NF033839   306 EKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEvKPQLETPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPETPKP 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  326 MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTalelqeSSVASAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:NF033839   386 EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK------PEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 459
                          170       180
                   ....*....|....*....|....*
gi 1622920921  406 PGPSATPVPELPGPLSTPVPELPGP 430
Cdd:NF033839   460 PKPEVKPQPEKPKPEVKPQPEKPKP 484
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
312-524 5.92e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 41.59  E-value: 5.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKttalelqeSSVASAMELPG--PPAT 389
Cdd:TIGR01645  283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPS--------SSLPTDIGNKAvvSSAK 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  390 SMPELQG--PPVTPVPELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645  353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622920921  462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645  431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
159-460 1.30e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 1.30e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247  2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247  2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920921  396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247  2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
188-493 3.00e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 55.55  E-value: 3.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  188 EPPVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPE--PSMTKILDSFaaapvptTTVVLKSSEPVVT 265
Cdd:NF033839   158 KPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAvaTYMSKILDDI-------QKHHLQKEKHRQI 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  266 MSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLV------VSSETPTE-VYPEPSTSTT--MDFPESSAIEA 336
Cdd:NF033839   231 VALIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVtkfkkgLTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEV 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  337 LRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV 413
Cdd:NF033839   311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQ 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  414 PELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVEL 491
Cdd:NF033839   391 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEK 470

                   ..
gi 1622920921  492 PE 493
Cdd:NF033839   471 PK 472
PHA03379 PHA03379
EBNA-3A; Provisional
340-673 1.56e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 53.52  E-value: 1.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379   416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379   478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379   558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379   635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1622920921  632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379   711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
PHA03247 PHA03247
large tegument protein UL36; Provisional
318-642 5.48e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 5.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  318 PEPSTSTTMDFPESSAIEALRLPEQPVDVPS---------EIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPA 388
Cdd:PHA03247  2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrprrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPP 2706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  389 TSMPElqgppvtPVPELPGPSATPVPELPGPLSTPVPELPGPPATavpelPGPSVTPVPQLSQELPGLPA---------- 458
Cdd:PHA03247  2707 TPEPA-------PHALVSATPLPPGPAAARQASPALPAAPAPPAV-----PAGPATPGGPARPARPPTTAgppapappaa 2774
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  459 PSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQ-PAVTVAMELTEQPVTtteleqPVGMTTVEHPGHPevTTATGLL 537
Cdd:PHA03247  2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAG------PLPPPTSAQPTAP--PPPPGPP 2846
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  538 GQPEATMVLELPGQPVATTAlelPGQPSVTgVPELPGLPSATR----ALELSGQPVATGALELPGPLMAAGALEFSGQSg 613
Cdd:PHA03247  2847 PPSLPLGGSVAPGGDVRRRP---PSRSPAA-KPAAPARPPVRRlarpAVSRSTESFALPPDQPERPPQPQAPPPPQPQP- 2921
                          330       340
                   ....*....|....*....|....*....
gi 1622920921  614 aagalELLGQPLATGVLELPGQPGAPELP 642
Cdd:PHA03247  2922 -----QPPPPPQPQPPPPPPPRPQPPLAP 2945
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
82-579 3.48e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.00  E-value: 3.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921   82 DLKEASRKSRCVSV--QTDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----AKTKSHHDGNIDLESDSF 155
Cdd:pfam03154   38 DLRSSGRNSPSAAStsSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGE 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  156 LKfDSEPSAMALELPTRAFGLSETNESpavvleppvVSVEVPEPHILETlkPATKTAELSVASTSVISEQSEQSVAVTPE 235
Cdd:pfam03154  118 GE-SSDGRSVNDEGSSDPKDIDQDNRS---------TSPSIPSPQDNES--DSDSSAQQQILQTQPPVLQAQSGAASPPS 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  236 PSMTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSV--LKSVEST-SPEPSKIMLVEPPVAKVLEPSEtlvvSSET 312
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTpTLHPQRLPSPHPPLQPMTQPPP----PSQV 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  313 PTEVYPEPSTSTTMDfPESSAIEA------LRLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASA 380
Cdd:pfam03154  262 SPQPLPQPSLHGQMP-PMPHSLQTgpshmqHPVPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  381 MELPGPPA-TSMPELQGPPVTPVPELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL-- 449
Cdd:pfam03154  341 REQPLPPApLSMPHIKPPPTTPIPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmp 419
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  450 -SQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHP 528
Cdd:pfam03154  420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSA 498
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622920921  529 EVTTATGLLGQPEATmvleLPGQPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154  499 SVSSSGPVPAAVSCP----LPPVQIKEEALDEAEEPESPPPPPRSPSPEPT 545
rne PRK10811
ribonuclease E; Reviewed
1220-1381 8.65e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 47.73  E-value: 8.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811   843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811   915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994

                   ...
gi 1622920921 1379 STV 1381
Cdd:PRK10811   995 TAV 997
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
387-558 1.77e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  387 PATSMPELQGPPVTPVPeLPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPP 466
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAP-AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  467 QEVPEPPVMAQELPGL-----PLVTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGMT----TVEHPGHPEVTTATGLL 537
Cdd:PRK07994   440 KSEPAAASRARPVNSAlerlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKalkkALEHEKTPELAAKLAAE 519
                          170       180
                   ....*....|....*....|....*.
gi 1622920921  538 GQPE---ATMV--LELPGqPVATTAL 558
Cdd:PRK07994   520 AIERdpwAALVsqLGLPG-LVEQLAL 544
rne PRK10811
ribonuclease E; Reviewed
1196-1509 2.06e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 46.57  E-value: 2.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1196 PAEVPSLPSEESVSQPEPPVSQSEISEP--------------SAVPTDYSMSASDPSVLVSEATVTVPEPPPEPESSITS 1261
Cdd:PRK10811   691 QQEAKALNVEEQSVQETEQEERVQQVQPrrkqrqlnqkvrieQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLP 770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1262 TPVESAVVAEEH--------EVVPER----PVTCMVS----------ETPTVSAEPTVVA-------------SEPPVLS 1306
Cdd:PRK10811   771 VVAQTAPEQDEEnnaenrdnNGMPRRsrrsPRHLRVSgqrrrryrdeRYPTQSPMPLTVAcaspemasgkvwiRYPVVRP 850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1307 ETAETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLESSTVTvlep 1386
Cdd:PRK10811   851 QDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTE---- 926
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1387 svvtvpePPVVAEPDYITIPVPVVSVLEPSVPVLEPAVSVLQPS----MIVSEPSVSVQESTVTVSEPAVTVSEQTQVIP 1462
Cdd:PRK10811   927 -------QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAetaeVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEP 999
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1622920921 1463 TEVAIESTPMILESSIMSSHVMKGinlPsgdqnlAPEIgMPEIPLHS 1509
Cdd:PRK10811  1000 EVAPAQVPEATVEHNHATAPMTRA---P------APEY-VPEAPRHS 1036
PHA03247 PHA03247
large tegument protein UL36; Provisional
233-525 2.90e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  233 TPEPSMTKILdsfAAAPVPTTTVVLKSSEPVVTMSveyqmksvlksveSTSPEPSKIMLVEPPVAKVLEPSETLVVSSET 312
Cdd:PHA03247  2707 TPEPAPHALV---SATPLPPGPAAARQASPALPAA-------------PAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  313 PTEV---YPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQEL----PELPKTTALElqessvASAMELPG 385
Cdd:PHA03247  2771 PPAApaaGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagPLPPPTSAQP------TAPPPPPG 2844
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  386 PPATSMP-----------ELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELP 454
Cdd:PHA03247  2845 PPPPSLPlggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622920921  455 GLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPA-----VTVAMELTEQPVTTTELEQPVGMTTVEHP 525
Cdd:PHA03247  2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgrVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
PHA03378 PHA03378
EBNA-3B; Provisional
176-480 5.02e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.44  E-value: 5.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  176 LSETNESPAVVLEPPVVSVEVPEPhILETLKPATKTAELSVASTSViseqseqsvavtpEPSMTKILDSFAAAPVPTTTV 255
Cdd:PHA03378   481 LPHPQVTPVILHQPPAQGVQAHGS-MLDLLEKDDEDMEQRVMATLL-------------PPSPPQPRAGRRAPCVYTEDL 546
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  256 VLKSSEPVVTMSVEYQMKSV-----LKSVESTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDFPE 330
Cdd:PHA03378   547 DIESDEPASTEPVHDQLLPApglgpLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPM 626
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  331 SSAIEALRL---------------PEQPVDVPSEIADSSMTRPQELPELPKTT--ALELQESSVASAMELPGPPATSMPE 393
Cdd:PHA03378   627 PLRPIPMRPlrmqpitfnvlvfptPHQPPQVEITPYKPTWTQIGHIPYQPSPTgaNTMLPIQWAPGTMQPPPRAPTPMRP 706
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  394 LQGPPV-------TPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSqelPGLPAPsmglepp 466
Cdd:PHA03378   707 PAAPPGraqrpaaATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA---PGAPTP------- 776
                          330
                   ....*....|....
gi 1622920921  467 QEVPEPPVMAQELP 480
Cdd:PHA03378   777 QPPPQAPPAPQQRP 790
rne PRK10811
ribonuclease E; Reviewed
1180-1345 6.39e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 45.03  E-value: 6.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1180 PALPTEQSALTAENTWPAEVpslpSEESVSQPEPPVSQSEISEPSAVPtdysMSASDPSVL---VSEATVTVPEpppepe 1256
Cdd:PRK10811   868 PVVAEVPVAAAVEPVVSAPV----VEAVAEVVEEPVVVAEPQPEEVVV----VETTHPEVIaapVTEQPQVITE------ 933
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1257 ssiTSTPVESAVVAEEHEVVPERPVTcmVSETPTVSAEPTVVAsEPPVLSETAETFESMRAsgyvASEVSTSLLEPAVTT 1336
Cdd:PRK10811   934 ---SDVAVAQEVAEHAEPVVEPQDET--ADIEEAAETAEVVVA-EPEVVAQPAAPVVAEVA----AEVETVTAVEPEVAP 1003

                   ....*....
gi 1622920921 1337 PVLAESILE 1345
Cdd:PRK10811  1004 AQVPEATVE 1012
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
369-452 9.05e-04

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 40.83  E-value: 9.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  369 ALELQESSVASAMELPGPPATSMPelqgPPVTPVPELPGPSATPVPELPGPlsTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:pfam12526   31 PPESAHPDPPPPVGDPRPPVVDTP----PPVSAVWVLPPPSEPAAPEPDLV--PPVTGPAGPPSPLAPPAPAQKPPLPPP 104

                   ....
gi 1622920921  449 LSQE 452
Cdd:pfam12526  105 RPQR 108
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
319-677 1.21e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  319 EPSTSTTMDFPESSAIEALRLPeQPVDVPSEIADSSMTRPQEL---PELPKTTALELQESSVASAMELP-GPPATSMPEL 394
Cdd:PHA03307     1 SDNAPDLYDLIEAAAEGGEFFP-RPPATPGDAADDLLSGSQGQlvsDSAELAAVTVVAGAAACDRFEPPtGPPPGPGTEA 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  395 QGPPVTPVPELPGPSATPVPELPGPLSTP--VPELPGPPATAVPELPGPSvtPVPQLSQELPGLPAPSMGLEPPQEVPEP 472
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGSPTPpgPSSPDPPPPTPPPASPPPS--PAPDLSEMLRPVGSPGPPPAASPPAAGA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  473 PVMAqelpglplVTAAVELPEQPAVTVAM-ELTEQPVTTTELEQPVGMTTVEHPGHPEVttatglLGQPEATMVLELPGQ 551
Cdd:PHA03307   158 SPAA--------VASDAASSRQAALPLSSpEETARAPSSPPAEPPPSTPPAAASPRPPR------RSSPISASASSPAPA 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  552 PVATTALELPGQPSVTGVPELPGLPSATRALELSGQPvatGALELPGPLMAA-----GALEFSGQSGAAGALELLGQPla 626
Cdd:PHA03307   224 PGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP---APITLPTRIWEAsgwngPSSRPGPASSSSSPRERSPSP-- 298
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622920921  627 tgvleLPGQPGAPELPGQPVAtVALEISVQSVVTTSELSTMTVSQSLEVPS 677
Cdd:PHA03307   299 -----SPSSPGSGPAPSSPRA-SSSSSSSRESSSSSTSSSSESSRGAAVSP 343
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
379-445 1.60e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.52  E-value: 1.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  379 SAMELPG--------PPATSMPELQGP-----PVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTP 445
Cdd:PRK14959   384 SAAEGPAsggaatipTPGTQGPQGTAPaagmtPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVP 463
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
249-430 1.78e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.22  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  249 PVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPE-PSKIMLVEPPVAKVLE-PSETLVVSSETPT-EVYPEPSTSTT 325
Cdd:NF033839   306 EKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEvKPQLETPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPETPKP 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  326 MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTalelqeSSVASAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:NF033839   386 EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK------PEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 459
                          170       180
                   ....*....|....*....|....*
gi 1622920921  406 PGPSATPVPELPGPLSTPVPELPGP 430
Cdd:NF033839   460 PKPEVKPQPEKPKPEVKPQPEKPKP 484
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
332-520 3.07e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.56  E-value: 3.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  332 SAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGP---PATSMPELQGPPVTPVPElPGP 408
Cdd:PRK12323   376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPealAAARQASARGPGGAPAPA-PAP 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  409 SATPVPELPGPLSTPVPelPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAA 488
Cdd:PRK12323   455 AAAPAAAARPAAAGPRP--VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATA 532
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1622920921  489 VELPEQPAVTVAMELTEQPVTTTELEQPVGMT 520
Cdd:PRK12323   533 DPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564
PHA03377 PHA03377
EBNA-3C; Provisional
159-445 3.82e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 42.35  E-value: 3.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  159 DSEPSAMALELPTRAFGLSETNESPAVVleppVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPEPSM 238
Cdd:PHA03377   380 DVELESSDDELPYIDPNMEPVQQRPVMF----VSRVPWRKPRTLPWPTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGP 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  239 TKildsfaAAPVPTttvvlkssEPVVTMSVEYQMKSVLKSVESTSPEPskimlVEPPVAKVLEPSETLVVSSETPTEVyp 318
Cdd:PHA03377   456 SD------QPSVPV--------EPAHLTPVEHTTVILHQPPQSPPTVA-----IKPAPPPSRRRRGACVVYDDDIIEV-- 514
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  319 epststtMDFPESSAIEALRLPEQPVDVPSEIADSSmTRPQELPELPKTTALElqessvasamelPGPPATSmPELQGPP 398
Cdd:PHA03377   515 -------IDVETTEEEESVTQPAKPHRKVQDGFQRS-GRRQKRATPPKVSPSD------------RGPPKAS-PPVMAPP 573
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1622920921  399 VTPVPELPGPSATPvPELPGPLSTP---VPELPGPPATAVPELPGPSVTP 445
Cdd:PHA03377   574 STGPRVMATPSTGP-RDMAPPSTGPrqqAKCKDGPPASGPHEKQPPSSAP 622
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
312-524 5.92e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 41.59  E-value: 5.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKttalelqeSSVASAMELPG--PPAT 389
Cdd:TIGR01645  283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPS--------SSLPTDIGNKAvvSSAK 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  390 SMPELQG--PPVTPVPELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645  353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622920921  462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645  431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
PHA03379 PHA03379
EBNA-3A; Provisional
161-462 7.20e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 41.58  E-value: 7.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  161 EPSAMALELPTRAFGLSETNESPAVVLEPPVVSVEVPEP--HILETLKPAtktaeLSVASTSVISEQSEQSVAVTPEPSM 238
Cdd:PHA03379   463 APCPVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPagPIVRPWEAS-----LSQVPGVAFAPVMPQPMPVEPVPVP 537
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  239 TKILDSfAAAPVPTTTVVLKSSEPvvTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKV------------LEPSETL 306
Cdd:PHA03379   538 TVALER-PVCPAPPLIAMQGPGET--SGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLARLraeaqpyqasveVQPPQLT 614
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  307 VVSSETPTEVYPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIaDSSMTRPQE--------------LPELPKTTALEL 372
Cdd:PHA03379   615 QVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYF-DLPLQQPISqgaplaplrasmgpVPPVPATQPQYF 693
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  373 Q-------ESSVASAMELPGPPATS--MPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPE-LPGPPATAVPELPGPS 442
Cdd:PHA03379   694 DipltepiNQGASAAHFLPQQPMEGplVPERWMFQGATLSQSVRPGVAQSQYFDLPLTQPINHgAPAAHFLHQPPMEGPW 773
                          330       340
                   ....*....|....*....|
gi 1622920921  443 VtPVPQLSQELPGLPAPSMG 462
Cdd:PHA03379   774 V-PEQWMFQGAPPSQGTDVV 792
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
340-460 7.53e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.24  E-value: 7.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  340 PEQPVDVPSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPATSMPE-LQGPPVTPVPELPGPSATPVPELPG 418
Cdd:PRK14951   371 EAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPApVAAPAAAAPAAAPAAAPAAVALAPA 450
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1622920921  419 PLSTPVPELPGPPATAVPELPGPSVTPVPqlsqelPGLPAPS 460
Cdd:PRK14951   451 PPAQAAPETVAIPVRVAPEPAVASAAPAP------AAAPAAA 486
PHA02030 PHA02030
hypothetical protein
326-440 7.69e-03

hypothetical protein


Pssm-ID: 222843 [Multi-domain]  Cd Length: 336  Bit Score: 40.73  E-value: 7.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  326 MDFPeSSAIEALRLPEQPVDVPSEIADSSMTrpqeLPELPKTTAlelqessvaSAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:PHA02030   236 TDFP-GSALHILLGGGEDLIIKPKSKAAGSN----LPAVPNVAA---------DAGSAAAPAVPAAAAAVAQAAPSVPQV 301
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1622920921  406 PGPSATPVPELPGPLSTP-VPELPGPPatAVPELPG 440
Cdd:PHA02030   302 PNVAVLPDVPQVAPVAAPaAPEVPAVP--VVPAAPQ 335
dnaA PRK14086
chromosomal replication initiator protein DnaA;
369-460 9.63e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 40.96  E-value: 9.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921  369 ALELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:PRK14086    85 AITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADD 164
                           90
                   ....*....|..
gi 1622920921  449 LSQELPGLPAPS 460
Cdd:PRK14086   165 YGWQQQRLGFPP 176
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH