|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
159-460 |
1.30e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 1.30e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920921 396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247 2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
188-493 |
3.00e-07 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 55.55 E-value: 3.00e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 188 EPPVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPE--PSMTKILDSFaaapvptTTVVLKSSEPVVT 265
Cdd:NF033839 158 KPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAvaTYMSKILDDI-------QKHHLQKEKHRQI 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 266 MSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLV------VSSETPTE-VYPEPSTSTT--MDFPESSAIEA 336
Cdd:NF033839 231 VALIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVtkfkkgLTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEV 310
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 337 LRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV 413
Cdd:NF033839 311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQ 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 414 PELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVEL 491
Cdd:NF033839 391 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEK 470
|
..
gi 1622920921 492 PE 493
Cdd:NF033839 471 PK 472
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
340-673 |
1.56e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 53.52 E-value: 1.56e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379 416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379 478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379 558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379 635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1622920921 632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379 711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
82-579 |
3.48e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.00 E-value: 3.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 82 DLKEASRKSRCVSV--QTDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----AKTKSHHDGNIDLESDSF 155
Cdd:pfam03154 38 DLRSSGRNSPSAAStsSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGE 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 156 LKfDSEPSAMALELPTRAFGLSETNESpavvleppvVSVEVPEPHILETlkPATKTAELSVASTSVISEQSEQSVAVTPE 235
Cdd:pfam03154 118 GE-SSDGRSVNDEGSSDPKDIDQDNRS---------TSPSIPSPQDNES--DSDSSAQQQILQTQPPVLQAQSGAASPPS 185
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 236 PSMTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSV--LKSVEST-SPEPSKIMLVEPPVAKVLEPSEtlvvSSET 312
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTpTLHPQRLPSPHPPLQPMTQPPP----PSQV 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 313 PTEVYPEPSTSTTMDfPESSAIEA------LRLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASA 380
Cdd:pfam03154 262 SPQPLPQPSLHGQMP-PMPHSLQTgpshmqHPVPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 381 MELPGPPA-TSMPELQGPPVTPVPELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL-- 449
Cdd:pfam03154 341 REQPLPPApLSMPHIKPPPTTPIPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmp 419
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 450 -SQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHP 528
Cdd:pfam03154 420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSA 498
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1622920921 529 EVTTATGLLGQPEATmvleLPGQPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154 499 SVSSSGPVPAAVSCP----LPPVQIKEEALDEAEEPESPPPPPRSPSPEPT 545
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
1220-1381 |
8.65e-05 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 47.73 E-value: 8.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811 843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811 915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994
|
...
gi 1622920921 1379 STV 1381
Cdd:PRK10811 995 TAV 997
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
249-430 |
1.78e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 43.22 E-value: 1.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 249 PVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPE-PSKIMLVEPPVAKVLE-PSETLVVSSETPT-EVYPEPSTSTT 325
Cdd:NF033839 306 EKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEvKPQLETPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPETPKP 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 326 MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTalelqeSSVASAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:NF033839 386 EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK------PEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 459
|
170 180
....*....|....*....|....*
gi 1622920921 406 PGPSATPVPELPGPLSTPVPELPGP 430
Cdd:NF033839 460 PKPEVKPQPEKPKPEVKPQPEKPKP 484
|
|
| half-pint |
TIGR01645 |
poly-U binding splicing factor, half-pint family; The proteins represented by this model ... |
312-524 |
5.92e-03 |
|
poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Pssm-ID: 130706 [Multi-domain] Cd Length: 612 Bit Score: 41.59 E-value: 5.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKttalelqeSSVASAMELPG--PPAT 389
Cdd:TIGR01645 283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPS--------SSLPTDIGNKAvvSSAK 352
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 390 SMPELQG--PPVTPVPELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645 353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622920921 462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645 431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
159-460 |
1.30e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 1.30e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 159 DSEPSAMALELPTRAFGLSETNESPAVVLEPpvvsveVPEPHILETLKPATKTAELSVASTSVISEQSEQSV-AVTPEPS 237
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAP------APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAApAAGPPRR 2782
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 238 MTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAK-VLEPSETLVVSSETPTEV 316
Cdd:PHA03247 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPgPPPPSLPLGGSVAPGGDV 2862
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 317 YPEPSTSTTMDFPESSA-IEALRLPEQPVDVPSEiadsSMTRPQELPELPKTTalelqessvasamELPGPPATSMPELQ 395
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPERPPQP-------------QAPPPPQPQPQPPP 2925
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622920921 396 GPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPS 460
Cdd:PHA03247 2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
188-493 |
3.00e-07 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 55.55 E-value: 3.00e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 188 EPPVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPE--PSMTKILDSFaaapvptTTVVLKSSEPVVT 265
Cdd:NF033839 158 KPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAvaTYMSKILDDI-------QKHHLQKEKHRQI 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 266 MSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKVLEPSETLV------VSSETPTE-VYPEPSTSTT--MDFPESSAIEA 336
Cdd:NF033839 231 VALIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVtkfkkgLTQDTPKEpGNKKPSAPKPgmQPSPQPEKKEV 310
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 337 LRLPEQPVDVPSEIADSSMTRPQELPELPKTTA---LELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPV 413
Cdd:NF033839 311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVkpqLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQ 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 414 PELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQE--LPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVEL 491
Cdd:NF033839 391 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEK 470
|
..
gi 1622920921 492 PE 493
Cdd:NF033839 471 PK 472
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
340-673 |
1.56e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 53.52 E-value: 1.56e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 340 PEQPVDVPSeiadssmtrpqelPELPKttalELQESSVASAMELPGPPATSMPElQGPPVTPVPELPGPSA----TPVPE 415
Cdd:PHA03379 416 PRPPVEKPR-------------PEVPQ----SLETATSHGSAQVPEPPPVHDLE-PGPLHDQHSMAPCPVAqlppGPLQD 477
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 416 L-PGPLSTPVPELPGPPATAVPELPGPSVTP-VPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVT-AAVELP 492
Cdd:PHA03379 478 LePGDQLPGVVQDGRPACAPVPAPAGPIVRPwEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAPPlIAMQGP 557
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 493 EQPA--VTVAMELTEQPVTTTELEQPVGMTTVEHP--GHPE--VTTATGLLGQPEATMV-----LELPGQPVATTaleLP 561
Cdd:PHA03379 558 GETSgiVRVRERWRPAPWTPNPPRSPSQMSVRDRLarLRAEaqPYQASVEVQPPQLTQVspqqpMEYPLEPEQQM---FP 634
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 562 GQP--SVTGVPELPGLPS---ATRALELSgQPVATGAlelPGPLMAAGALEFS--GQSGAAGALELLGQPLATGVLE--- 631
Cdd:PHA03379 635 GSPfsQVADVMRAGGVPAmqpQYFDLPLQ-QPISQGA---PLAPLRASMGPVPpvPATQPQYFDIPLTEPINQGASAahf 710
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1622920921 632 LPGQPGAPEL--PGQPVATVALEISVQSVVTTSELSTMTVSQSL 673
Cdd:PHA03379 711 LPQQPMEGPLvpERWMFQGATLSQSVRPGVAQSQYFDLPLTQPI 754
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
318-642 |
5.48e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 5.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 318 PEPSTSTTMDFPESSAIEALRLPEQPVDVPS---------EIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPA 388
Cdd:PHA03247 2627 PPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrprrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPP 2706
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 389 TSMPElqgppvtPVPELPGPSATPVPELPGPLSTPVPELPGPPATavpelPGPSVTPVPQLSQELPGLPA---------- 458
Cdd:PHA03247 2707 TPEPA-------PHALVSATPLPPGPAAARQASPALPAAPAPPAV-----PAGPATPGGPARPARPPTTAgppapappaa 2774
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 459 PSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQ-PAVTVAMELTEQPVTtteleqPVGMTTVEHPGHPevTTATGLL 537
Cdd:PHA03247 2775 PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAG------PLPPPTSAQPTAP--PPPPGPP 2846
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 538 GQPEATMVLELPGQPVATTAlelPGQPSVTgVPELPGLPSATR----ALELSGQPVATGALELPGPLMAAGALEFSGQSg 613
Cdd:PHA03247 2847 PPSLPLGGSVAPGGDVRRRP---PSRSPAA-KPAAPARPPVRRlarpAVSRSTESFALPPDQPERPPQPQAPPPPQPQP- 2921
|
330 340
....*....|....*....|....*....
gi 1622920921 614 aagalELLGQPLATGVLELPGQPGAPELP 642
Cdd:PHA03247 2922 -----QPPPPPQPQPPPPPPPRPQPPLAP 2945
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
82-579 |
3.48e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.00 E-value: 3.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 82 DLKEASRKSRCVSV--QTDPTDEIPTKKSKKHKKHKNKKKKKKKEKEKKYKRQPEESE----AKTKSHHDGNIDLESDSF 155
Cdd:pfam03154 38 DLRSSGRNSPSAAStsSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGASDTEEPErataKKSKTQEISRPNSPSEGE 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 156 LKfDSEPSAMALELPTRAFGLSETNESpavvleppvVSVEVPEPHILETlkPATKTAELSVASTSVISEQSEQSVAVTPE 235
Cdd:pfam03154 118 GE-SSDGRSVNDEGSSDPKDIDQDNRS---------TSPSIPSPQDNES--DSDSSAQQQILQTQPPVLQAQSGAASPPS 185
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 236 PSMTKILDSFAAAPVPTTTVVLKSSEPVVTMSVEYQMKSV--LKSVEST-SPEPSKIMLVEPPVAKVLEPSEtlvvSSET 312
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTpTLHPQRLPSPHPPLQPMTQPPP----PSQV 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 313 PTEVYPEPSTSTTMDfPESSAIEA------LRLPEQPVDVPSEIADS------SMTRPQELPELPKTTALELQESSVASA 380
Cdd:pfam03154 262 SPQPLPQPSLHGQMP-PMPHSLQTgpshmqHPVPPQPFPLTPQSSQSqvppgpSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 381 MELPGPPA-TSMPELQGPPVTPVPELPGPSATPVP---ELPGPLSTPvPELPGPPA----TAVPELPGPSVTPVP-QL-- 449
Cdd:pfam03154 341 REQPLPPApLSMPHIKPPPTTPIPQLPNPQSHKHPphlSGPSPFQMN-SNLPPPPAlkplSSLSTHHPPSAHPPPlQLmp 419
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 450 -SQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPvTTTELEQPVGMTTVEHPGHP 528
Cdd:pfam03154 420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGIQPPSSA 498
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1622920921 529 EVTTATGLLGQPEATmvleLPGQPVATTALELPGQPSVTGVPELPGLPSAT 579
Cdd:pfam03154 499 SVSSSGPVPAAVSCP----LPPVQIKEEALDEAEEPESPPPPPRSPSPEPT 545
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
1220-1381 |
8.65e-05 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 47.73 E-value: 8.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1220 ISEPSAVPTDYSMSASDPSVLVSEATVTVPEPPPEpessiTSTPVESAVVAEEHEVVPERPVtcmVSETPTVSAEPTVVA 1299
Cdd:PRK10811 843 IRYPVVRPQDVQVEEQREAEEVQVQPVVAEVPVAA-----AVEPVVSAPVVEAVAEVVEEPV---VVAEPQPEEVVVVET 914
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1300 SEPPVLSETA-ETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLES 1378
Cdd:PRK10811 915 THPEVIAAPVtEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETV 994
|
...
gi 1622920921 1379 STV 1381
Cdd:PRK10811 995 TAV 997
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
387-558 |
1.77e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 46.78 E-value: 1.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 387 PATSMPELQGPPVTPVPeLPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPP 466
Cdd:PRK07994 361 PAAPLPEPEVPPQSAAP-AASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAK 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 467 QEVPEPPVMAQELPGL-----PLVTAAVELPEQPAVTVAMELTEQPVTTTELEQPVGMT----TVEHPGHPEVTTATGLL 537
Cdd:PRK07994 440 KSEPAAASRARPVNSAlerlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKalkkALEHEKTPELAAKLAAE 519
|
170 180
....*....|....*....|....*.
gi 1622920921 538 GQPE---ATMV--LELPGqPVATTAL 558
Cdd:PRK07994 520 AIERdpwAALVsqLGLPG-LVEQLAL 544
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
1196-1509 |
2.06e-04 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 46.57 E-value: 2.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1196 PAEVPSLPSEESVSQPEPPVSQSEISEP--------------SAVPTDYSMSASDPSVLVSEATVTVPEPPPEPESSITS 1261
Cdd:PRK10811 691 QQEAKALNVEEQSVQETEQEERVQQVQPrrkqrqlnqkvrieQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLP 770
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1262 TPVESAVVAEEH--------EVVPER----PVTCMVS----------ETPTVSAEPTVVA-------------SEPPVLS 1306
Cdd:PRK10811 771 VVAQTAPEQDEEnnaenrdnNGMPRRsrrsPRHLRVSgqrrrryrdeRYPTQSPMPLTVAcaspemasgkvwiRYPVVRP 850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1307 ETAETFESMRASGYVASEVSTSLLEPAVTTPVLAESILEPPDMAVPESSAMAVLESSAVTVLESSTVTVLESSTVTvlep 1386
Cdd:PRK10811 851 QDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTE---- 926
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1387 svvtvpePPVVAEPDYITIPVPVVSVLEPSVPVLEPAVSVLQPS----MIVSEPSVSVQESTVTVSEPAVTVSEQTQVIP 1462
Cdd:PRK10811 927 -------QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAetaeVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEP 999
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 1622920921 1463 TEVAIESTPMILESSIMSSHVMKGinlPsgdqnlAPEIgMPEIPLHS 1509
Cdd:PRK10811 1000 EVAPAQVPEATVEHNHATAPMTRA---P------APEY-VPEAPRHS 1036
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
233-525 |
2.90e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 2.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 233 TPEPSMTKILdsfAAAPVPTTTVVLKSSEPVVTMSveyqmksvlksveSTSPEPSKIMLVEPPVAKVLEPSETLVVSSET 312
Cdd:PHA03247 2707 TPEPAPHALV---SATPLPPGPAAARQASPALPAA-------------PAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 313 PTEV---YPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQEL----PELPKTTALElqessvASAMELPG 385
Cdd:PHA03247 2771 PPAApaaGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAspagPLPPPTSAQP------TAPPPPPG 2844
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 386 PPATSMP-----------ELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSQELP 454
Cdd:PHA03247 2845 PPPPSLPlggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622920921 455 GLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPA-----VTVAMELTEQPVTTTELEQPVGMTTVEHP 525
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAlvpgrVAVPRFRVPQPAPSREAPASSTPPLTGHS 3000
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
176-480 |
5.02e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 5.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 176 LSETNESPAVVLEPPVVSVEVPEPhILETLKPATKTAELSVASTSViseqseqsvavtpEPSMTKILDSFAAAPVPTTTV 255
Cdd:PHA03378 481 LPHPQVTPVILHQPPAQGVQAHGS-MLDLLEKDDEDMEQRVMATLL-------------PPSPPQPRAGRRAPCVYTEDL 546
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 256 VLKSSEPVVTMSVEYQMKSV-----LKSVESTSPEPSKIMLVEPPVAKVLEPSETLVVSSETPTEVYPEPSTSTTMDFPE 330
Cdd:PHA03378 547 DIESDEPASTEPVHDQLLPApglgpLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPM 626
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 331 SSAIEALRL---------------PEQPVDVPSEIADSSMTRPQELPELPKTT--ALELQESSVASAMELPGPPATSMPE 393
Cdd:PHA03378 627 PLRPIPMRPlrmqpitfnvlvfptPHQPPQVEITPYKPTWTQIGHIPYQPSPTgaNTMLPIQWAPGTMQPPPRAPTPMRP 706
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 394 LQGPPV-------TPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQLSqelPGLPAPsmglepp 466
Cdd:PHA03378 707 PAAPPGraqrpaaATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA---PGAPTP------- 776
|
330
....*....|....
gi 1622920921 467 QEVPEPPVMAQELP 480
Cdd:PHA03378 777 QPPPQAPPAPQQRP 790
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
1180-1345 |
6.39e-04 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 45.03 E-value: 6.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1180 PALPTEQSALTAENTWPAEVpslpSEESVSQPEPPVSQSEISEPSAVPtdysMSASDPSVL---VSEATVTVPEpppepe 1256
Cdd:PRK10811 868 PVVAEVPVAAAVEPVVSAPV----VEAVAEVVEEPVVVAEPQPEEVVV----VETTHPEVIaapVTEQPQVITE------ 933
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 1257 ssiTSTPVESAVVAEEHEVVPERPVTcmVSETPTVSAEPTVVAsEPPVLSETAETFESMRAsgyvASEVSTSLLEPAVTT 1336
Cdd:PRK10811 934 ---SDVAVAQEVAEHAEPVVEPQDET--ADIEEAAETAEVVVA-EPEVVAQPAAPVVAEVA----AEVETVTAVEPEVAP 1003
|
....*....
gi 1622920921 1337 PVLAESILE 1345
Cdd:PRK10811 1004 AQVPEATVE 1012
|
|
| DUF3729 |
pfam12526 |
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ... |
369-452 |
9.05e-04 |
|
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.
Pssm-ID: 372164 [Multi-domain] Cd Length: 115 Bit Score: 40.83 E-value: 9.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 369 ALELQESSVASAMELPGPPATSMPelqgPPVTPVPELPGPSATPVPELPGPlsTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:pfam12526 31 PPESAHPDPPPPVGDPRPPVVDTP----PPVSAVWVLPPPSEPAAPEPDLV--PPVTGPAGPPSPLAPPAPAQKPPLPPP 104
|
....
gi 1622920921 449 LSQE 452
Cdd:pfam12526 105 RPQR 108
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
319-677 |
1.21e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.01 E-value: 1.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 319 EPSTSTTMDFPESSAIEALRLPeQPVDVPSEIADSSMTRPQEL---PELPKTTALELQESSVASAMELP-GPPATSMPEL 394
Cdd:PHA03307 1 SDNAPDLYDLIEAAAEGGEFFP-RPPATPGDAADDLLSGSQGQlvsDSAELAAVTVVAGAAACDRFEPPtGPPPGPGTEA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 395 QGPPVTPVPELPGPSATPVPELPGPLSTP--VPELPGPPATAVPELPGPSvtPVPQLSQELPGLPAPSMGLEPPQEVPEP 472
Cdd:PHA03307 80 PANESRSTPTWSLSTLAPASPAREGSPTPpgPSSPDPPPPTPPPASPPPS--PAPDLSEMLRPVGSPGPPPAASPPAAGA 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 473 PVMAqelpglplVTAAVELPEQPAVTVAM-ELTEQPVTTTELEQPVGMTTVEHPGHPEVttatglLGQPEATMVLELPGQ 551
Cdd:PHA03307 158 SPAA--------VASDAASSRQAALPLSSpEETARAPSSPPAEPPPSTPPAAASPRPPR------RSSPISASASSPAPA 223
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 552 PVATTALELPGQPSVTGVPELPGLPSATRALELSGQPvatGALELPGPLMAA-----GALEFSGQSGAAGALELLGQPla 626
Cdd:PHA03307 224 PGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP---APITLPTRIWEAsgwngPSSRPGPASSSSSPRERSPSP-- 298
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1622920921 627 tgvleLPGQPGAPELPGQPVAtVALEISVQSVVTTSELSTMTVSQSLEVPS 677
Cdd:PHA03307 299 -----SPSSPGSGPAPSSPRA-SSSSSSSRESSSSSTSSSSESSRGAAVSP 343
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
379-445 |
1.60e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 43.52 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 379 SAMELPG--------PPATSMPELQGP-----PVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTP 445
Cdd:PRK14959 384 SAAEGPAsggaatipTPGTQGPQGTAPaagmtPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVP 463
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
249-430 |
1.78e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 43.22 E-value: 1.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 249 PVPTTTVVLKSSEPVVTMSVEYQMKSVLKSVESTSPE-PSKIMLVEPPVAKVLE-PSETLVVSSETPT-EVYPEPSTSTT 325
Cdd:NF033839 306 EKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEvKPQLETPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPETPKP 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 326 MDFPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTalelqeSSVASAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:NF033839 386 EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK------PEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 459
|
170 180
....*....|....*....|....*
gi 1622920921 406 PGPSATPVPELPGPLSTPVPELPGP 430
Cdd:NF033839 460 PKPEVKPQPEKPKPEVKPQPEKPKP 484
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
332-520 |
3.07e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.56 E-value: 3.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 332 SAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGP---PATSMPELQGPPVTPVPElPGP 408
Cdd:PRK12323 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPealAAARQASARGPGGAPAPA-PAP 454
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 409 SATPVPELPGPLSTPVPelPGPPATAVPELPGPSVTPVPQLSQELPGLPAPSMGLEPPQEVPEPPVMAQELPGLPLVTAA 488
Cdd:PRK12323 455 AAAPAAAARPAAAGPRP--VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATA 532
|
170 180 190
....*....|....*....|....*....|..
gi 1622920921 489 VELPEQPAVTVAMELTEQPVTTTELEQPVGMT 520
Cdd:PRK12323 533 DPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
159-445 |
3.82e-03 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 42.35 E-value: 3.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 159 DSEPSAMALELPTRAFGLSETNESPAVVleppVVSVEVPEPHILETLKPATKTAELSVASTSVISEQSEQSVAVTPEPSM 238
Cdd:PHA03377 380 DVELESSDDELPYIDPNMEPVQQRPVMF----VSRVPWRKPRTLPWPTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGP 455
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 239 TKildsfaAAPVPTttvvlkssEPVVTMSVEYQMKSVLKSVESTSPEPskimlVEPPVAKVLEPSETLVVSSETPTEVyp 318
Cdd:PHA03377 456 SD------QPSVPV--------EPAHLTPVEHTTVILHQPPQSPPTVA-----IKPAPPPSRRRRGACVVYDDDIIEV-- 514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 319 epststtMDFPESSAIEALRLPEQPVDVPSEIADSSmTRPQELPELPKTTALElqessvasamelPGPPATSmPELQGPP 398
Cdd:PHA03377 515 -------IDVETTEEEESVTQPAKPHRKVQDGFQRS-GRRQKRATPPKVSPSD------------RGPPKAS-PPVMAPP 573
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1622920921 399 VTPVPELPGPSATPvPELPGPLSTP---VPELPGPPATAVPELPGPSVTP 445
Cdd:PHA03377 574 STGPRVMATPSTGP-RDMAPPSTGPrqqAKCKDGPPASGPHEKQPPSSAP 622
|
|
| half-pint |
TIGR01645 |
poly-U binding splicing factor, half-pint family; The proteins represented by this model ... |
312-524 |
5.92e-03 |
|
poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Pssm-ID: 130706 [Multi-domain] Cd Length: 612 Bit Score: 41.59 E-value: 5.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 312 TPTEVYPEPSTSTTMdfPESSAIEALRLPEQPVDVPSEIADSSMTRPQELPELPKttalelqeSSVASAMELPG--PPAT 389
Cdd:TIGR01645 283 TPPDALLQPATVSAI--PAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPS--------SSLPTDIGNKAvvSSAK 352
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 390 SMPELQG--PPVTPVPELPGPSATPVPELPGPLstPVPELPGPPATAVPELPGPSVTPVPQLSQELPGLPA------PSM 461
Cdd:TIGR01645 353 KEAEEVPplPQAAPAVVKPGPMEIPTPVPPPGL--AIPSLVAPPGLVAPTEINPSFLASPRKKMKREKLPVtfgaldDTL 430
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622920921 462 GLEPPQEVPEPPVMAQELPGLPLVTAAVELPEQPAVTVAMELTEQPVTTTElEQPVGMTTVEH 524
Cdd:TIGR01645 431 AWKEPSKEDQTSEDGKMLAIMGEAAAALALEPKKKKKEKEGEELQPKLVMN-SEDASLASQEG 492
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
161-462 |
7.20e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 41.58 E-value: 7.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 161 EPSAMALELPTRAFGLSETNESPAVVLEPPVVSVEVPEP--HILETLKPAtktaeLSVASTSVISEQSEQSVAVTPEPSM 238
Cdd:PHA03379 463 APCPVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPagPIVRPWEAS-----LSQVPGVAFAPVMPQPMPVEPVPVP 537
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 239 TKILDSfAAAPVPTTTVVLKSSEPvvTMSVEYQMKSVLKSVESTSPEPSKIMLVEPPVAKV------------LEPSETL 306
Cdd:PHA03379 538 TVALER-PVCPAPPLIAMQGPGET--SGIVRVRERWRPAPWTPNPPRSPSQMSVRDRLARLraeaqpyqasveVQPPQLT 614
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 307 VVSSETPTEVYPEPSTSTTMDFPESSAIEALRLPEQPVDVPSEIaDSSMTRPQE--------------LPELPKTTALEL 372
Cdd:PHA03379 615 QVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYF-DLPLQQPISqgaplaplrasmgpVPPVPATQPQYF 693
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 373 Q-------ESSVASAMELPGPPATS--MPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPE-LPGPPATAVPELPGPS 442
Cdd:PHA03379 694 DipltepiNQGASAAHFLPQQPMEGplVPERWMFQGATLSQSVRPGVAQSQYFDLPLTQPINHgAPAAHFLHQPPMEGPW 773
|
330 340
....*....|....*....|
gi 1622920921 443 VtPVPQLSQELPGLPAPSMG 462
Cdd:PHA03379 774 V-PEQWMFQGAPPSQGTDVV 792
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
340-460 |
7.53e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 41.24 E-value: 7.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 340 PEQPVDVPSEIADSSMTRPQELPELPKTTALELQESSVASAMELPGPPATSMPE-LQGPPVTPVPELPGPSATPVPELPG 418
Cdd:PRK14951 371 EAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPApVAAPAAAAPAAAPAAAPAAVALAPA 450
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1622920921 419 PLSTPVPELPGPPATAVPELPGPSVTPVPqlsqelPGLPAPS 460
Cdd:PRK14951 451 PPAQAAPETVAIPVRVAPEPAVASAAPAP------AAAPAAA 486
|
|
| PHA02030 |
PHA02030 |
hypothetical protein |
326-440 |
7.69e-03 |
|
hypothetical protein
Pssm-ID: 222843 [Multi-domain] Cd Length: 336 Bit Score: 40.73 E-value: 7.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 326 MDFPeSSAIEALRLPEQPVDVPSEIADSSMTrpqeLPELPKTTAlelqessvaSAMELPGPPATSMPELQGPPVTPVPEL 405
Cdd:PHA02030 236 TDFP-GSALHILLGGGEDLIIKPKSKAAGSN----LPAVPNVAA---------DAGSAAAPAVPAAAAAVAQAAPSVPQV 301
|
90 100 110
....*....|....*....|....*....|....*.
gi 1622920921 406 PGPSATPVPELPGPLSTP-VPELPGPPatAVPELPG 440
Cdd:PHA02030 302 PNVAVLPDVPQVAPVAAPaAPEVPAVP--VVPAAPQ 335
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
369-460 |
9.63e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 40.96 E-value: 9.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622920921 369 ALELQESSVASAMELPGPPATSMPELQGPPVTPVPELPGPSATPVPELPGPLSTPVPELPGPPATAVPELPGPSVTPVPQ 448
Cdd:PRK14086 85 AITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADD 164
|
90
....*....|..
gi 1622920921 449 LSQELPGLPAPS 460
Cdd:PRK14086 165 YGWQQQRLGFPP 176
|
|
|