|
Name |
Accession |
Description |
Interval |
E-value |
| cwf21_SRRM2 |
cd21375 |
cwf21 domain found in serine/arginine repetitive matrix protein 2; Serine/arginine repetitive ... |
39-102 |
1.34e-30 |
|
cwf21 domain found in serine/arginine repetitive matrix protein 2; Serine/arginine repetitive matrix protein 2 (SRRM2) is also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803). It is required for pre-mRNA splicing as component of the spliceosome. It contains a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.
Pssm-ID: 410601 [Multi-domain] Cd Length: 64 Bit Score: 115.88 E-value: 1.34e-30
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391393 39 EEELRHLEAALVKRPNPDILDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEKD 102
Cdd:cd21375 1 EEELRRLEAALVKKPNPDILDHERKRRVELKCLELEEMMEEQGYSEEEIQEKVATFRLMLLEKD 64
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
309-686 |
1.17e-09 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 64.42 E-value: 1.17e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 309 PSVEPGATNIQQPSSPAPSTKQSSSPyeDKDKKEKSAVRPSPSPERSSTGPELPAPTPLLVEQHVDSPRPLAAIPSSQEP 388
Cdd:PHA03307 75 PGTEAPANESRSTPTWSLSTLAPASP--AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASP 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 389 VNPSSEASPTRGCSPPKSPEKPPQSTSSES--CPPSPQPTKGSRHASSSPESLKPTPAPGSRREISSSPTSKNRSHGRAK 466
Cdd:PHA03307 153 PAAGASPAAVASDAASSRQAALPLSSPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 467 RDKSHSHTPSHRAGRSRSPATKRGRSRSRTPTKRGHSRSRSPQWRRSRSAQRWGKSRSPQRRGRSRSPQRPGwSRSRNTQ 546
Cdd:PHA03307 233 GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG-SGPAPSS 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 547 RRGRSRSARRGRSHSRSPATRGRSRSRTPARRGRSRSRTPARRRSRSRTPARRRSRSRTPARRGRSRSRTPARRRSRTRS 626
Cdd:PHA03307 312 PRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRAR 391
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 627 pvrrrsrsrsqarrsgrsrsrtPARRSGRSRSRTPARRGRSRSRTPARRSARSRSRTPAR 686
Cdd:PHA03307 392 ----------------------AAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYAR 429
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1982-2466 |
2.20e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.80 E-value: 2.20e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 1982 SRSRTPLLPRKRSRSRSPLAIRRRSRSRTPraargkrsltrsPPAIRRRSASGSSSDRSRSATPPATRNHSGSRTPPVAL 2061
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPD------------PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2062 sSSRMSCFSRPSMSPTPLDRCRSPGMLEPLGSArtpmsvlqqtgGSMMDGPGPRIPDHPRSsvpenhaqsrialalTAIS 2141
Cdd:PHA03247 2664 -PRRARRLGRAAQASSPPQRPRRRAARPTVGSL-----------TSLADPPPPPPTPEPAP---------------HALV 2716
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2142 LGTARPPPSMSAAGLAARMSQVPAPVPLMslrTAPAANLASRIPAASAAAMNLASARTSAIPASvnladsrTPAAAAAMN 2221
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAA-------GPPRRLTRP 2786
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2222 LASPRTAVAPSAVNLADPRTPAASAVNLAGARTPAALAAlsltgSGTPPTAANYPSSSRTPQAPTPANLVVGprsaHGTA 2301
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----GPLPPPTSAQPTAPPPPPGPPPPSLPLG----GSVA 2857
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2302 PVNIAGSRTPAGLAPTNLSSSRMAPALSganLTSPRVPLSAydrvsgRTSPLMLDRARSRTPPSAPSQSRMTSERERAPS 2381
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR---LARPAVSRST------ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2382 PASRM-VQASSQSLLPPAQDRPRSPVPS-AFSDQSRSVVQTTPVAGSQSLSSGTVAKSTSSASDHNGMLSGPAPGIS--- 2456
Cdd:PHA03247 2929 PQPPPpPPPRPQPPLAPTTDPAGAGEPSgAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSswa 3008
|
490
....*....|....*
gi 1720391393 2457 -----HAEGGEPPAS 2466
Cdd:PHA03247 3009 sslalHEETDPPPVS 3023
|
|
| cwf21 |
pfam08312 |
cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a ... |
58-101 |
1.45e-07 |
|
cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a subcomplex of the splicosome in Schizosaccharomyces pombe. The function of the cwf21 domain is to bind directly to the spliceosomal protein Prp8. Mutations in the cwf21 domain prevent Prp8 from binding. The structure of this domain has recently been solved which shows this domain to be composed of two alpha helices.
Pssm-ID: 462421 [Multi-domain] Cd Length: 44 Bit Score: 49.73 E-value: 1.45e-07
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1720391393 58 LDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEK 101
Cdd:pfam08312 1 LEHERKREIEVKVLELRDELEEQGLSEEEIEEKVDELRKKLLAE 44
|
|
| U2AF_lg |
TIGR01642 |
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ... |
499-603 |
2.03e-05 |
|
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.
Pssm-ID: 273727 [Multi-domain] Cd Length: 509 Bit Score: 49.89 E-value: 2.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 499 KRGHSRSRSPQWRRSRSAQRWgKSRSPQRRGRSRSpqRPGWSRSRNtqrrgrSRSARRGRSHSRSPATRGRSRSRTPARR 578
Cdd:TIGR01642 12 SRGRDRDRSSERPRRRSRDRS-RFRDRHRRSRERS--YREDSRPRD------RRRYDSRSPRSLRYSSVRRSRDRPRRRS 82
|
90 100
....*....|....*....|....*
gi 1720391393 579 GRSRSRTPARRRSRSRTPARRRSRS 603
Cdd:TIGR01642 83 RSVRSIEQHRRRLRDRSPSNQWRKD 107
|
|
| RSRP |
pfam17069 |
Arginine/Serine-Rich protein 1; RSRP1 is an eukaryotic protein family. Its function is unknown. |
436-604 |
1.96e-03 |
|
Arginine/Serine-Rich protein 1; RSRP1 is an eukaryotic protein family. Its function is unknown.
Pssm-ID: 293674 [Multi-domain] Cd Length: 299 Bit Score: 42.84 E-value: 1.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 436 PESLKPTPAPGSRREISSS-PTSKNRSHGRAKRDKSHSHTPSHRAGRSRS-PATKRGRSRSRTPTKR---GHSRSRSPQW 510
Cdd:pfam17069 10 PGSPQEKKSPSTSSSGSSSrLSSRSRSRSSSRSSRSHSRSSSRFSSRSRSrPRRSRSRSRSRRRHQRkyrRYSRSYSRSR 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 511 RRSRSAQRWGKSRSPQRRGRSRSPQRpgwSRSRNTQRRGRSRSARRGRSHSRSpatrGRSRSRTPARRGRSRSRTpaRRR 590
Cdd:pfam17069 90 SRSRRRRYYRRSRYRYSRRYYRSPSR---SRSRSRSRSRGRSYYAIWRGSRYY----GFGRTVYPERSPRWRSRS--RTR 160
|
170
....*....|....
gi 1720391393 591 SRSRTPARRRSRSR 604
Cdd:pfam17069 161 SRSRTPFRLSEKER 174
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2280-2690 |
5.32e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.47 E-value: 5.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2280 RTPQAPTPANLVVGPRSAHGTAPVNIAGSRTPAGLAPTNLSSSRMAPALSGANLTSPRVPLSAYDRVSGRTSPLMLDRAR 2359
Cdd:PHA03307 25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2360 SRTPPSAPSqsrmTSERERAPSPASRmvqassqsllPPAQDRPRSPVPSAFSDQSRSVVQTTPVAGSQSLSSGTVAKSTS 2439
Cdd:PHA03307 105 SPTPPGPSS----PDPPPPTPPPASP----------PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSR 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2440 SASDHNGMLSGPAPGIShaeggEPPASTGAQQPSTLAALQPAKERRSSSSSSSSSSSSSSSSSSSSSSSSSGSSSSDSEG 2519
Cdd:PHA03307 171 QAALPLSSPEETARAPS-----SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESS 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2520 SSLPAQPEVALKRVPSPTPVPKEAIREGRPQEPTPAKRKRRSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 2599
Cdd:PHA03307 246 GCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS 325
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2600 SSSPSPAKPGPQALPKPASPKKPPPGERRSRSPRKPIDSLRDSRSLSYSPVERRQPSPQPSPRDLQSSERVSWRGQRGDS 2679
Cdd:PHA03307 326 SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATG 405
|
410
....*....|.
gi 1720391393 2680 HSPGHKRKETP 2690
Cdd:PHA03307 406 RFPAGRPRPSP 416
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| cwf21_SRRM2 |
cd21375 |
cwf21 domain found in serine/arginine repetitive matrix protein 2; Serine/arginine repetitive ... |
39-102 |
1.34e-30 |
|
cwf21 domain found in serine/arginine repetitive matrix protein 2; Serine/arginine repetitive matrix protein 2 (SRRM2) is also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803). It is required for pre-mRNA splicing as component of the spliceosome. It contains a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.
Pssm-ID: 410601 [Multi-domain] Cd Length: 64 Bit Score: 115.88 E-value: 1.34e-30
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391393 39 EEELRHLEAALVKRPNPDILDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEKD 102
Cdd:cd21375 1 EEELRRLEAALVKKPNPDILDHERKRRVELKCLELEEMMEEQGYSEEEIQEKVATFRLMLLEKD 64
|
|
| cwf21_SRRM3 |
cd21376 |
cwf21 domain found in serine/arginine repetitive matrix protein 3 and similar proteins; Serine ... |
37-102 |
5.11e-23 |
|
cwf21 domain found in serine/arginine repetitive matrix protein 3 and similar proteins; Serine/arginine repetitive matrix protein 3 (SRRM3) may play a role in regulating breast cancer cell invasiveness. It may also be involved in RYBP-mediated breast cancer progression. SRRM3 contains a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.
Pssm-ID: 410602 [Multi-domain] Cd Length: 68 Bit Score: 94.42 E-value: 5.11e-23
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720391393 37 KGEEELRHLEAALVKRPNPDILDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEKD 102
Cdd:cd21376 1 KSEEEIKKLDAALVKKPNREILDHERKRKVELKCMEMQELMEEQGYTEEEIRQKVSTFRQMLMEKE 66
|
|
| cwf21_SRRM2-like |
cd21373 |
cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar ... |
53-102 |
6.06e-17 |
|
cwf21 domain found in serine/arginine repetitive matrix proteins, SRRM2, SRRM3 and similar proteins; This subfamily includes SRRM2 and SRRM3, both of which contain a cwf21 domain at the N-terminus. SRRM2, also called 300 kDa nuclear matrix antigen, serine/arginine-rich splicing factor-related nuclear matrix protein of 300 kDa, SR-related nuclear matrix protein of 300 kDa, Ser/Arg-related nuclear matrix protein of 300 kDa, splicing coactivator subunit SRm300, or Tax-responsive enhancer element-binding protein 803 (TaxREB803), is required for pre-mRNA splicing as component of the spliceosome. SRRM3 may play a role in regulating breast cancer cell invasiveness. It may be involved in RYBP-mediated breast cancer progression. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.
Pssm-ID: 410600 [Multi-domain] Cd Length: 50 Bit Score: 76.46 E-value: 6.06e-17
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 1720391393 53 PNPDILDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEKD 102
Cdd:cd21373 1 PNKEILDHERKRKIEVKCLELEDLLEEQGYTEEEIQAKVDEYRALLLEKD 50
|
|
| cwf21_CWC21-like |
cd21372 |
cwf21 domain found in fungal complexed with CEF1 protein 21 (CWC21) and similar proteins; This ... |
54-102 |
4.26e-10 |
|
cwf21 domain found in fungal complexed with CEF1 protein 21 (CWC21) and similar proteins; This subfamily includes complexed with CEF1 protein 21 (CWC21) from budding yeast, complexed with cdc5 protein 21 (CWF21) from fission yeast, as well as their orthologs, serine/arginine repetitive matrix proteins (SRRM2 and SRRM3) from vertebrates. Both CWC21 and CWF21 are pre-mRNA-splicing factors that may function at or prior to the first catalytic step of splicing at the catalytic center of the spliceosome, together with ISY1. SRRM2 is required for pre-mRNA splicing as a component of the spliceosome. SRRM3 may play a role in regulating breast cancer cell invasiveness. It may be involved in RYBP-mediated breast cancer progression. Members of this family contain a cwf21 domain at the N-terminus. The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8.
Pssm-ID: 410599 [Multi-domain] Cd Length: 49 Bit Score: 57.10 E-value: 4.26e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 1720391393 54 NPDILDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEKD 102
Cdd:cd21372 1 DKEILEHERKRQIELKCLELRDELEDEGLSEEEIEEKVDELREKLLKEL 49
|
|
| cwf21 |
cd21369 |
cwf21 domain; The cwf21 domain is involved in mRNA splicing; it binds directly to the ... |
55-101 |
6.38e-10 |
|
cwf21 domain; The cwf21 domain is involved in mRNA splicing; it binds directly to the spliceosomal protein Prp8. Mutations in the cwf21 domain prevents its binding to Prp8. The domain is composed of two alpha helices. Proteins containing the cwf21 domain include complexed with CEF1 protein 21 (CWC21) from budding yeast, complexed with cdc5 protein 21 (CWF21) from fission yeast, as well as their orthologs, serine/arginine repetitive matrix proteins (SRRM2 and SRRM3) from vertebrates. This domain family also includes U2-associated protein SR140 from Eumetazoa, protein RRC1, and similar proteins from plants.
Pssm-ID: 410596 [Multi-domain] Cd Length: 48 Bit Score: 56.71 E-value: 6.38e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1720391393 55 PDILDHERKRRVELRCLELEEMMEEQG-YEEQQIQEKVATFRLMLLEK 101
Cdd:cd21369 1 MDEEKRAKKREIELKVMELRDELEEQGrKPEQQIQEKVEHYRDKLLQR 48
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
309-686 |
1.17e-09 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 64.42 E-value: 1.17e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 309 PSVEPGATNIQQPSSPAPSTKQSSSPyeDKDKKEKSAVRPSPSPERSSTGPELPAPTPLLVEQHVDSPRPLAAIPSSQEP 388
Cdd:PHA03307 75 PGTEAPANESRSTPTWSLSTLAPASP--AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASP 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 389 VNPSSEASPTRGCSPPKSPEKPPQSTSSES--CPPSPQPTKGSRHASSSPESLKPTPAPGSRREISSSPTSKNRSHGRAK 466
Cdd:PHA03307 153 PAAGASPAAVASDAASSRQAALPLSSPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDA 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 467 RDKSHSHTPSHRAGRSRSPATKRGRSRSRTPTKRGHSRSRSPQWRRSRSAQRWGKSRSPQRRGRSRSPQRPGwSRSRNTQ 546
Cdd:PHA03307 233 GASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG-SGPAPSS 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 547 RRGRSRSARRGRSHSRSPATRGRSRSRTPARRGRSRSRTPARRRSRSRTPARRRSRSRTPARRGRSRSRTPARRRSRTRS 626
Cdd:PHA03307 312 PRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRAR 391
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 627 pvrrrsrsrsqarrsgrsrsrtPARRSGRSRSRTPARRGRSRSRTPARRSARSRSRTPAR 686
Cdd:PHA03307 392 ----------------------AAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYAR 429
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1982-2466 |
2.20e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.80 E-value: 2.20e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 1982 SRSRTPLLPRKRSRSRSPLAIRRRSRSRTPraargkrsltrsPPAIRRRSASGSSSDRSRSATPPATRNHSGSRTPPVAL 2061
Cdd:PHA03247 2596 ARPRAPVDDRGDPRGPAPPSPLPPDTHAPD------------PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSR 2663
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2062 sSSRMSCFSRPSMSPTPLDRCRSPGMLEPLGSArtpmsvlqqtgGSMMDGPGPRIPDHPRSsvpenhaqsrialalTAIS 2141
Cdd:PHA03247 2664 -PRRARRLGRAAQASSPPQRPRRRAARPTVGSL-----------TSLADPPPPPPTPEPAP---------------HALV 2716
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2142 LGTARPPPSMSAAGLAARMSQVPAPVPLMslrTAPAANLASRIPAASAAAMNLASARTSAIPASvnladsrTPAAAAAMN 2221
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAA-------GPPRRLTRP 2786
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2222 LASPRTAVAPSAVNLADPRTPAASAVNLAGARTPAALAAlsltgSGTPPTAANYPSSSRTPQAPTPANLVVGprsaHGTA 2301
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----GPLPPPTSAQPTAPPPPPGPPPPSLPLG----GSVA 2857
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2302 PVNIAGSRTPAGLAPTNLSSSRMAPALSganLTSPRVPLSAydrvsgRTSPLMLDRARSRTPPSAPSQSRMTSERERAPS 2381
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR---LARPAVSRST------ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2382 PASRM-VQASSQSLLPPAQDRPRSPVPS-AFSDQSRSVVQTTPVAGSQSLSSGTVAKSTSSASDHNGMLSGPAPGIS--- 2456
Cdd:PHA03247 2929 PQPPPpPPPRPQPPLAPTTDPAGAGEPSgAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSswa 3008
|
490
....*....|....*
gi 1720391393 2457 -----HAEGGEPPAS 2466
Cdd:PHA03247 3009 sslalHEETDPPPVS 3023
|
|
| cwf21 |
pfam08312 |
cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a ... |
58-101 |
1.45e-07 |
|
cwf21 domain; The cwf21 family is involved in mRNA splicing. It has been isolated as a subcomplex of the splicosome in Schizosaccharomyces pombe. The function of the cwf21 domain is to bind directly to the spliceosomal protein Prp8. Mutations in the cwf21 domain prevent Prp8 from binding. The structure of this domain has recently been solved which shows this domain to be composed of two alpha helices.
Pssm-ID: 462421 [Multi-domain] Cd Length: 44 Bit Score: 49.73 E-value: 1.45e-07
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1720391393 58 LDHERKRRVELRCLELEEMMEEQGYEEQQIQEKVATFRLMLLEK 101
Cdd:pfam08312 1 LEHERKREIEVKVLELRDELEEQGLSEEEIEEKVDELRKKLLAE 44
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
2147-2405 |
4.23e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 52.54 E-value: 4.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2147 PPPSMSAAGLAARMSQVPAPVPLMSLRTAPAANLASRIPAASAAAMNLASARTSAI---PASVNLADSRTPAAAAAMNLA 2223
Cdd:PRK07003 360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAaaaAATRAEAPPAAPAPPATADRG 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2224 SPRTAVAPSAVNLADPRTPAASAVNLAGARTPAALAALSLTGSGTPPTAANYPSSSRTPQAPTPANLVVGPR-SAHGTAP 2302
Cdd:PRK07003 440 DDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARaPAAASRE 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2303 VNIAGSRTPAGLAPTNLSSSRMAPALSGANLTSPRVPLSAYDRVSGRTSPLMLDRARS-----RTPPSAPSQSRMTSERE 2377
Cdd:PRK07003 520 DAPAAAAPPAPEARPPTPAAAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPaaapaAAPKPAAPRVAVQVPTP 599
|
250 260
....*....|....*....|....*...
gi 1720391393 2378 RAPSPASRMVQASSQSLLPPAQDRPRSP 2405
Cdd:PRK07003 600 RARAATGDAPPNGAARAEQAAESRGAPP 627
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
294-600 |
1.96e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.55 E-value: 1.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 294 QSPPLASGHQGEGDAPSVEPGATNIQQPSSPAPSTKQSSSPYEDKDKKEKSAVRPSPSPERSSTGPELPAPTPLLVEQHV 373
Cdd:PHA03307 150 ASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAA 229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 374 DSPRPLAAIPSSQEPvnPSSEASPTRGCSPPKSPEKPPQSTSSESCPP-SPQPTKGSRHASSSPESLKPTPAPGSRREIS 452
Cdd:PHA03307 230 DDAGASSSDSSSSES--SGCGWGPENECPLPRPAPITLPTRIWEASGWnGPSSRPGPASSSSSPRERSPSPSPSSPGSGP 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 453 SSPTSKNRSHGRAKRDkSHSHTPSHRAGRSRSPATKRGRSRSRTPtkrghSRSRSPQWRRSrsaqrwgkSRSPQRRGRSR 532
Cdd:PHA03307 308 APSSPRASSSSSSSRE-SSSSSTSSSSESSRGAAVSPGPSPSRSP-----SPSRPPPPADP--------SSPRKRPRPSR 373
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391393 533 SPQRPGWSRSRNTqrrgrsrsarrGRSHSRSPATRGRSRSRTPAR-RGRSRSRTPARRRSRSRTPARRR 600
Cdd:PHA03307 374 APSSPAASAGRPT-----------RRRARAAVAGRARRRDATGRFpAGRPRPSPLDAGAASGAFYARYP 431
|
|
| U2AF_lg |
TIGR01642 |
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ... |
499-603 |
2.03e-05 |
|
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.
Pssm-ID: 273727 [Multi-domain] Cd Length: 509 Bit Score: 49.89 E-value: 2.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 499 KRGHSRSRSPQWRRSRSAQRWgKSRSPQRRGRSRSpqRPGWSRSRNtqrrgrSRSARRGRSHSRSPATRGRSRSRTPARR 578
Cdd:TIGR01642 12 SRGRDRDRSSERPRRRSRDRS-RFRDRHRRSRERS--YREDSRPRD------RRRYDSRSPRSLRYSSVRRSRDRPRRRS 82
|
90 100
....*....|....*....|....*
gi 1720391393 579 GRSRSRTPARRRSRSRTPARRRSRS 603
Cdd:TIGR01642 83 RSVRSIEQHRRRLRDRSPSNQWRKD 107
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
400-610 |
5.56e-05 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 48.75 E-value: 5.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 400 GCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESLKPTPAPGSRREISSSPTSKNRSHGRAKRDKSHSHTPSHRA 479
Cdd:PRK12678 62 GAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGA 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 480 GRSRSPATKRGRSRSRTPTKRGHSRSRSPQWRRSRSAQRWGKSRSPQRRGRSRSPQRPGWSRSRNTQRRGRSRSARRGRS 559
Cdd:PRK12678 142 ARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRR 221
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720391393 560 HSRSPATRGRSRSRTPARRGRSRSRTPARRR-----SRSRTPARRRSRSRTPARRG 610
Cdd:PRK12678 222 DGGDRRGRRRRRDRRDARGDDNREDRGDRDGddgegRGGRRGRRFRDRDRRGRRGG 277
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
2200-2425 |
3.05e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 3.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2200 SAIPASVNLADSRTPAAAAAMNLASPRTAVAPSAVNLADPRTPAASAVNLAGARTPAALAAL------SLTGSGTPPTAA 2273
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALaaarqaSARGPGGAPAPA 451
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2274 NYPSSSRTPQAPTPANLVVGPRSAHGTAPVNIAGSRTPAGLAPTNLSSSRMAPALSGAnltSPRVPLSAYDRVSGRTSPl 2353
Cdd:PRK12323 452 PAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASP---APAQPDAAPAGWVAESIP- 527
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720391393 2354 mldrarsRTPPSAPSQSRMTSERERAPSPASRMVQASSQSLLPPAQDRPRSPVPSAFSDQSRSVVQTTPVAG 2425
Cdd:PRK12323 528 -------DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPVRG 592
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
380-592 |
5.31e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 45.67 E-value: 5.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 380 AAIPSSQEPVNPSSEASPTRGCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESLKPTPAPGSRREiSSSPTSKN 459
Cdd:PRK12678 65 AAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGE-AARRGAAR 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 460 RSHGRAKRDKSHSHTPSHRAGRSRSPATKRGRSRSRTPTKRGHSRSRSPQWRRSRSAQRWG-KSRSPQRRGRSRSPQRPG 538
Cdd:PRK12678 144 KAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDrRDRREQGDRREERGRRDG 223
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1720391393 539 WSRSRNTQRRGRSRSARRGRSHSRSPATRGRSRSRTPARRGRSRSRTPARRRSR 592
Cdd:PRK12678 224 GDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGG 277
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
379-604 |
5.45e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 45.28 E-value: 5.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 379 LAAIPSSQEP-VNPSSEASPTRGCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESLKPTPAPGSRREISSSPTS 457
Cdd:PRK12678 52 IAAIKEARGGgAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERR 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 458 KNRSHGRAKRDKSHSHTPSHRAGRSRSPATKRGRSRSRTPTKRGHSRSRSPQWRRSRSAQRWGKSRSPQRRGRSRSPQRP 537
Cdd:PRK12678 132 ERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQ 211
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720391393 538 GWSRSRNTQRRGRSRSARRGRSHSRSPATRGRSRSRTPARRGRSRSRTpARRRSRSRTPARRRSRSR 604
Cdd:PRK12678 212 GDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRG-GRRGRRFRDRDRRGRRGG 277
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
2143-2352 |
6.65e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.25 E-value: 6.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2143 GTARPPPSMSAAGLAARMSQVPAPVPLMSLRTAPAANLASRIPAASAAAMNLASARTSAIPASVNLADSRTPAAAAAMNL 2222
Cdd:PRK12323 370 GGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPA 449
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2223 ASPRTAVAPSAVN---LADPRTPAASAVNLAGARTPAALAALSltGSGTPP---TAANYPSSSRTPQAPTPANLVVGPRS 2296
Cdd:PRK12323 450 PAPAPAAAPAAAArpaAAGPRPVAAAAAAAPARAAPAAAPAPA--DDDPPPweeLPPEFASPAPAQPDAAPAGWVAESIP 527
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720391393 2297 AHGTAPvniagsrtPAGLAPTNLSSSRMAPALSGANLTSPRVPLSAYDRVSGRTSP 2352
Cdd:PRK12323 528 DPATAD--------PDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
2223-2431 |
1.33e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 44.10 E-value: 1.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2223 ASPRTAVAPSAVNLADPRTPAASAVnlAGARTPAALAALSLTGSGTPPTAANYPSSSRTPQAPTPANLVVGPRSAHGTAP 2302
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAA--PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPA 449
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2303 VNIAGSRTPAGLAPTNLSSSRMAPALSgANLTSPRVPLSAYDRVSGRTSPLM-LDRARSRTPPSAPSQSRMTSERERAPS 2381
Cdd:PRK12323 450 PAPAPAAAPAAAARPAAAGPRPVAAAA-AAAPARAAPAAAPAPADDDPPPWEeLPPEFASPAPAQPDAAPAGWVAESIPD 528
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2382 PASRMVQASSQSLLPPAqdrPRSPVPSAFSDQSRSVVQTTPVAGSQSLSS 2431
Cdd:PRK12323 529 PATADPDDAFETLAPAP---AAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| SF-CC1 |
TIGR01622 |
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ... |
511-608 |
1.54e-03 |
|
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Pssm-ID: 273721 [Multi-domain] Cd Length: 494 Bit Score: 43.75 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 511 RRSRSAQRWGKSRSPQRRGRSRSPQRPGwSRSRNTQRRGRSRSARRgrshsrspatRGRSRSRTPARRGRSRSRTPaRRR 590
Cdd:TIGR01622 2 YRDRERERLRDSSSAGDRDRRRDKGRER-SRDRSRDRERSRSRRRD----------RHRDRDYYRGRERRSRSRRP-NRR 69
|
90
....*....|....*...
gi 1720391393 591 SRSRTPARRRSRSRTPAR 608
Cdd:TIGR01622 70 YRPREKRRRRGDSYRRRR 87
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
165-462 |
1.60e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 165 PEPPKPYSLVRETSSSRSPTPKQKKKKKKKDRGRRSESSSPRRERKKSSKKKKHRSESESKKRKHRSPTPKSKRKSKDKK 244
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 245 RKRSRSTTPAPKSRRAHRSTSADSASSSDTSRSRSRSAAAKI------HTTALTGQSPPLASGHQGEGDAP--SVEPGAT 316
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpagplpPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGD 2861
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 317 NIQQPSSPAPSTKQSSSPYEDKDKKEKSAVRPSPSPERSSTGPELPAPTPllveqhvDSPRPLAAIPSSQEPVNPSSeAS 396
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQP-------QAPPPPQPQPQPPPPPQPQP-PP 2933
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720391393 397 PTRGCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESLKPTPAPGSRREISSSPTSKNRSH 462
Cdd:PHA03247 2934 PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
2104-2382 |
1.76e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 44.07 E-value: 1.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2104 TGGSmmdGPGPRIPDHPRSSVPENHAQSRIALALTAISLGTARPPPSMSAAGLAARMSQVPAPVPLMSLRTAPAANLASR 2183
Cdd:PRK07003 363 TGGG---APGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRG 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2184 IPAASAAAMNLASARTSAIPASVNLADSRTPAAAAAMNlASPRTAVAPSAVNLADPRTPAASAVNLAGARTPAALAALSL 2263
Cdd:PRK07003 440 DDAADGDAPVPAKANARASADSRCDERDAQPPADSGSA-SAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASR 518
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2264 TGSGTPPTAANYPSSSRTPQAPTPANlvvgpRSAHGTAPVNI---AGSRTPAGlaptnlsSSRMAPALSGANLTSPRVPL 2340
Cdd:PRK07003 519 EDAPAAAAPPAPEARPPTPAAAAPAA-----RAGGAAAALDVlrnAGMRVSSD-------RGARAAAAAKPAAAPAAAPK 586
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 1720391393 2341 SAYDRVSGRTSPLMLDRARSRTPPSAPSQSRMTSERERAPSP 2382
Cdd:PRK07003 587 PAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
162-610 |
1.90e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 162 QIAPEPPKPYSLVRETsssRSPTPKQKKKKKKKDRGRRSESSSPRRERKKSSKKKKHRSESESKKRKHRSPTPKSKRKSK 241
Cdd:PHA03247 2572 RPAPRPSEPAVTSRAR---RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPP 2648
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 242 DKKRKRSRSTTPAPKSRRAHRSTSADSASSSDTSRSRSRSAAAKIHTTALTGQSPPlasghqgeGDAPSVEPGATNIQQP 321
Cdd:PHA03247 2649 PERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP--------PPTPEPAPHALVSATP 2720
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 322 SSPAPSTKQSSSPyedkdkkeksAVRPSPSPERSSTGPELPAPtpllvEQHVDSPRPLAAIPSSQEPVNPSSEASPTrgc 401
Cdd:PHA03247 2721 LPPGPAAARQASP----------ALPAAPAPPAVPAGPATPGG-----PARPARPPTTAGPPAPAPPAAPAAGPPRR--- 2782
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 402 SPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESLKPTPAPGSRREISSSPTSKNRSHGRAKRDKSHSHT-----PS 476
Cdd:PHA03247 2783 LTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSvapggDV 2862
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 477 HRAGRSRSPATK---RGRSRSRTPTKRGHSRSRSPQWRRSRSAQRWGKSRSPQRRGRSRSPQRPGWSRSRNTQRRGRSRS 553
Cdd:PHA03247 2863 RRRPPSRSPAAKpaaPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*..
gi 1720391393 554 ARRGRSHSRSPATRGRSRSRTPARRGRSRSRTPaRRRSRSRTPARRRSRSRTPARRG 610
Cdd:PHA03247 2943 LAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP-RFRVPQPAPSREAPASSTPPLTG 2998
|
|
| RSRP |
pfam17069 |
Arginine/Serine-Rich protein 1; RSRP1 is an eukaryotic protein family. Its function is unknown. |
436-604 |
1.96e-03 |
|
Arginine/Serine-Rich protein 1; RSRP1 is an eukaryotic protein family. Its function is unknown.
Pssm-ID: 293674 [Multi-domain] Cd Length: 299 Bit Score: 42.84 E-value: 1.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 436 PESLKPTPAPGSRREISSS-PTSKNRSHGRAKRDKSHSHTPSHRAGRSRS-PATKRGRSRSRTPTKR---GHSRSRSPQW 510
Cdd:pfam17069 10 PGSPQEKKSPSTSSSGSSSrLSSRSRSRSSSRSSRSHSRSSSRFSSRSRSrPRRSRSRSRSRRRHQRkyrRYSRSYSRSR 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 511 RRSRSAQRWGKSRSPQRRGRSRSPQRpgwSRSRNTQRRGRSRSARRGRSHSRSpatrGRSRSRTPARRGRSRSRTpaRRR 590
Cdd:pfam17069 90 SRSRRRRYYRRSRYRYSRRYYRSPSR---SRSRSRSRSRGRSYYAIWRGSRYY----GFGRTVYPERSPRWRSRS--RTR 160
|
170
....*....|....
gi 1720391393 591 SRSRTPARRRSRSR 604
Cdd:pfam17069 161 SRSRTPFRLSEKER 174
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2164-2613 |
2.10e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 2.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2164 PAPVPLMSLRTAPAANLAsriPAASAAAMNLASARTSAIPASvnlADSRTPAAAAAMNLASPRTAVAPSAVNLADP--RT 2241
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPA---PRPSEPAVTSRARRPDAPPQS---ARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPppPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2242 PAASAVNLAGARTPAALAALSLTGSGTPPTAANYPSSSRTPQAPTPANLVVGPRSAHGTAPVNIAGS--------RTPAG 2313
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpppppPTPEP 2710
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2314 LAPTNLSSSRMAPALSGANLTSPRVPLSAYDRVSGRTSPLMLDRARSRTPPSAPSQSRMTSERERAPSPASRMVQASSQS 2393
Cdd:PHA03247 2711 APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2394 LlppAQDRPRSPVPSAFSDQSRSVVQTTPVAGSQSLSSGTVAKSTSSASDHNGMLSGPAPGISHAEGGEPPASTGAQQ-P 2472
Cdd:PHA03247 2791 L---SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRpP 2867
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2473 STLAALQPAKERRSSSSSSSSSSSSSSSSSSSSSSSSSGSSSSDSEGSslPAQPEVALKRVPSPTPVPKEAIREGRPQEP 2552
Cdd:PHA03247 2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPP--PPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391393 2553 TP------------------AKRKRRSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSPSPAKPGPQAL 2613
Cdd:PHA03247 2946 TTdpagagepsgavpqpwlgALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSL 3024
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
313-444 |
3.42e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 42.84 E-value: 3.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 313 PGATNIQQPSSPAPStkQSSSPYEdkdkkEKSAVRPSPSPERSSTGPELPAPTPllVEQHVDSPRPLAAIPSSQEPVNPS 392
Cdd:PRK14971 370 SGGRGPKQHIKPVFT--QPAAAPQ-----PSAAAAASPSPSQSSAAAQPSAPQS--ATQPAGTPPTVSVDPPAAVPVNPP 440
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1720391393 393 SEASPTRGCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESLKPTPA 444
Cdd:PRK14971 441 STAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGT 492
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
359-590 |
4.21e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 42.58 E-value: 4.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 359 PELPAPTPLLVEQHVDSPRPLAAIPSSQEPVNPSSEASPTRGCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPES 438
Cdd:PRK12678 68 ATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGE 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 439 LKPTPAPGSRREISSSPTSKNRSHGRAKRDKSHSHTPSHRAGRSRSPATKRGRSRSRTPTKRGHSRSRSPQWRRSRSAQR 518
Cdd:PRK12678 148 GGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRR 227
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720391393 519 WGKSRSPQRRGRSRSPQRPGWSRSRNTQRRGRSRSARrgrshsrspatRGRSRSRTPARRGRSRSRTPARRR 590
Cdd:PRK12678 228 GRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGR-----------RFRDRDRRGRRGGDGGNEREPELR 288
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2043-2619 |
4.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 4.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2043 ATPPATRNHS--GSRTPPVALSSSRMSCFSRPSMSPTPlDRCRSPGmlEPLGSARTPMsvlqQTGGSMMDGPGPRIPDHP 2120
Cdd:PHA03247 2558 AAPPAAPDRSvpPPRPAPRPSEPAVTSRARRPDAPPQS-ARPRAPV--DDRGDPRGPA----PPSPLPPDTHAPDPPPPS 2630
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2121 RSSVPENHAQSRIALA------LTAISLGTARPPPSMSAAGLAARMSQVPAPVPLMSLR--TAPAANLAsRIPAASAAAM 2192
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVppperpRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARptVGSLTSLA-DPPPPPPTPE 2709
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2193 NLASARTSAIPASVNLADSRTPAAAAAMNLASPRTAVAPSAVnlADPRTPAASAVNlAGARTPAALAALSLTGSGTPPTA 2272
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP--GGPARPARPPTT-AGPPAPAPPAAPAAGPPRRLTRP 2786
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2273 ANYPSSSRTPQAPTPANLVVGPRSAHGTAPVnIAGSRTPAGLAPTNLSSSRMAPALsganltsPRVPLSAYDRVSGRTSP 2352
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAA-LPPAASPAGPLPPPTSAQPTAPPP-------PPGPPPPSLPLGGSVAP 2858
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2353 --LMLDRARSRTPPSAPSQSRMTSERERAPSPASRmvQASSQSLLPPAQDRPRSPVPSAfsdqsrsvvQTTPVAGSQSLS 2430
Cdd:PHA03247 2859 ggDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR--STESFALPPDQPERPPQPQAPP---------PPQPQPQPPPPP 2927
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2431 SGTVAKSTSSASDhngmlSGPAPGISHAEGGEPpasTGAQQPSTLAALQPAKERRSSSSSSSSSSSSSSSSSSSSSSSSS 2510
Cdd:PHA03247 2928 QPQPPPPPPPRPQ-----PPLAPTTDPAGAGEP---SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2511 GSSSSDSEGSSLPAQPEVAlkrvpsptpvpkeairegrpqePTPAKRKRRSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 2590
Cdd:PHA03247 3000 SLSRVSSWASSLALHEETD----------------------PPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDP 3057
|
570 580
....*....|....*....|....*....
gi 1720391393 2591 SSSSSSSSSSSSPSPAKPGPQALPKPASP 2619
Cdd:PHA03247 3058 LPPEPHDPFAHEPDPATPEAGARESPSSQ 3086
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2280-2690 |
5.32e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.47 E-value: 5.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2280 RTPQAPTPANLVVGPRSAHGTAPVNIAGSRTPAGLAPTNLSSSRMAPALSGANLTSPRVPLSAYDRVSGRTSPLMLDRAR 2359
Cdd:PHA03307 25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2360 SRTPPSAPSqsrmTSERERAPSPASRmvqassqsllPPAQDRPRSPVPSAFSDQSRSVVQTTPVAGSQSLSSGTVAKSTS 2439
Cdd:PHA03307 105 SPTPPGPSS----PDPPPPTPPPASP----------PPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSR 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2440 SASDHNGMLSGPAPGIShaeggEPPASTGAQQPSTLAALQPAKERRSSSSSSSSSSSSSSSSSSSSSSSSSGSSSSDSEG 2519
Cdd:PHA03307 171 QAALPLSSPEETARAPS-----SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESS 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2520 SSLPAQPEVALKRVPSPTPVPKEAIREGRPQEPTPAKRKRRSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 2599
Cdd:PHA03307 246 GCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS 325
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2600 SSSPSPAKPGPQALPKPASPKKPPPGERRSRSPRKPIDSLRDSRSLSYSPVERRQPSPQPSPRDLQSSERVSWRGQRGDS 2679
Cdd:PHA03307 326 SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATG 405
|
410
....*....|.
gi 1720391393 2680 HSPGHKRKETP 2690
Cdd:PHA03307 406 RFPAGRPRPSP 416
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
309-462 |
6.09e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 6.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 309 PSVEPGATNIQQPSSPAP--STKQSSSPYEDKDKKEKSAVRPSPSPERSST---GPELPAPTPLLV--EQHVDSPRPLAA 381
Cdd:pfam05109 449 PSSTHVPTNLTAPASTGPtvSTADVTSPTPAGTTSGASPVTPSPSPRDNGTeskAPDMTSPTSAVTtpTPNATSPTPAVT 528
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 382 IPSSQEPVNPSSEASPTRGCSPPKSPEKPPQSTSSESCPPSPQPTKGSRHASSSPESlkPTPAPGSRREISSSPTSKNRS 461
Cdd:pfam05109 529 TPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTT--PTPNATSPTVGETSPQANTTN 606
|
.
gi 1720391393 462 H 462
Cdd:pfam05109 607 H 607
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2230-2610 |
8.19e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 8.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2230 APSAVNLADPRTPAASAVNLAGARTPAALAAlslTGSGTPPTAANYPSSSRTPQAPtpanlvvgPRSAHGTAPVNIAGSR 2309
Cdd:PHA03247 2552 PPPLPPAAPPAAPDRSVPPPRPAPRPSEPAV---TSRARRPDAPPQSARPRAPVDD--------RGDPRGPAPPSPLPPD 2620
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2310 TPAGLAPTNLSSSRMAPALSGANLTSPRVPLSAYDRVSGRTSPLMLDRARSRTP-PSAPSQSrmtsERERAPSPASRMVQ 2388
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAqASSPPQR----PRRRAARPTVGSLT 2696
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2389 ASSQsllPPAQDRPRSPVPSAFSdqSRSVVQTTPVAGSQSLSSGTVAKSTSSASDHNGMLSGPAPGISHAEGGEPPASTG 2468
Cdd:PHA03247 2697 SLAD---PPPPPPTPEPAPHALV--SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP 2771
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 2469 AQQPST---LAALQPAKERRSSSSSSSSSSSSSSSSSSSSSSSSSGSSSSDSEGSSLPAQPEVALKRVPSPTPVPKEAIR 2545
Cdd:PHA03247 2772 PAAPAAgppRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720391393 2546 EGRPQEPTPAKRKRRSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSPSPAKPGP 2610
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPP 2916
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
350-726 |
8.46e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.70 E-value: 8.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 350 PSPERSSTGPELPAPTPLLVEQHVDSPRPLAAIPSSQEPVNPSSEASPTRGCSPPKSPEKPPQSTSSEScpPSPQPTKGS 429
Cdd:PHA03307 39 SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT--PPGPSSPDP 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 430 RHASSSPESLKPTPAPGSRREISSSPTSKNRSHGRAKR-DKSHSHTPSHRAG-RSRSPATKRGRSRSRTPTKRGHSR--S 505
Cdd:PHA03307 117 PPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASsRQAALPLSSPEETARAPSSPPAEPppS 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 506 RSPQWRRSRSAQRWGKSRSPQRRGRSRSPQRPGWSRSRNTQRRGRSRSARRGRSHSRSPATRGRSRSRTPARRGRSRSRT 585
Cdd:PHA03307 197 TPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWN 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 586 PARRR---SRSRTPARRRSRSRTPARRGRSRSRTPARRRSRTRSPVRRRSRSRSQARRSGRSRSRTPArrsgrsrsrtpa 662
Cdd:PHA03307 277 GPSSRpgpASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPG------------ 344
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391393 663 rRGRSRSRTPARRSARSRSRTPARRGRSRSRTPARRRSRSRSLVRRGRShSRTPQRRGRSGSSS 726
Cdd:PHA03307 345 -PSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA-AVAGRARRRDATGR 406
|
|
| U2AF_lg |
TIGR01642 |
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ... |
525-608 |
9.22e-03 |
|
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.
Pssm-ID: 273727 [Multi-domain] Cd Length: 509 Bit Score: 41.42 E-value: 9.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391393 525 PQRRGRSRSPQRP-GWSRSRNTQRRGRSRSARRGRSHSRSPATRGRSRSRTPA-----RRGRSRSRTPARRRSRSR-TPA 597
Cdd:TIGR01642 12 SRGRDRDRSSERPrRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRslrysSVRRSRDRPRRRSRSVRSiEQH 91
|
90
....*....|.
gi 1720391393 598 RRRSRSRTPAR 608
Cdd:TIGR01642 92 RRRLRDRSPSN 102
|
|
|