|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-190 |
2.67e-60 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed. :
Pssm-ID: 463988 Cd Length: 143 Bit Score: 203.43 E-value: 2.67e-60
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759471 128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
|
|
| Med15 super family |
cl26621 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
377-787 |
2.53e-11 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development. The actual alignment was detected with superfamily member pfam09606:
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 68.88 E-value: 2.53e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606 60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLG---GQANPNFMQGQVPSTTAaTPGNS 528
Cdd:pfam09606 140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQgpmGGQMPPQMGVPGMPGPA-DAGAQ 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 529 GALQLQANQSVQhAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGM 608
Cdd:pfam09606 219 MGQQAQANGGMN-PQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAM 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 609 HQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQ 688
Cdd:pfam09606 295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQP 370
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 689 VLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPS 762
Cdd:pfam09606 371 GMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPG 449
|
410 420
....*....|....*....|....*
gi 1958759471 763 QVMGIQGQVLRPPGPSPHMAQQHTD 787
Cdd:pfam09606 450 GSLNTPGQSAVNSPLNPQEEQLYRE 474
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
166-474 |
1.29e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 1.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247 2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247 2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926
|
...
gi 1958759471 472 RNP 474
Cdd:PHA03247 2927 PQP 2929
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1040-1260 |
6.67e-04 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 6.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1040 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1118
Cdd:PHA03247 2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1119 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1187
Cdd:PHA03247 2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 1188 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1260
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1658-1925 |
1.47e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1658 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1734
Cdd:PHA03247 2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1735 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1808
Cdd:PHA03247 2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1809 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1888
Cdd:PHA03247 2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1958759471 1889 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1925
Cdd:PHA03247 2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1482-1801 |
2.63e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1482 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1554
Cdd:pfam05109 446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1555 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1632
Cdd:pfam05109 526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1633 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1706
Cdd:pfam05109 603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1707 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1783
Cdd:pfam05109 679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
|
330
....*....|....*...
gi 1958759471 1784 PGSSANRRSPVSSSKGKG 1801
Cdd:pfam05109 754 TGGKANSTTGGKHTTGHG 771
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-190 |
2.67e-60 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.
Pssm-ID: 463988 Cd Length: 143 Bit Score: 203.43 E-value: 2.67e-60
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759471 128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
377-787 |
2.53e-11 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 68.88 E-value: 2.53e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606 60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLG---GQANPNFMQGQVPSTTAaTPGNS 528
Cdd:pfam09606 140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQgpmGGQMPPQMGVPGMPGPA-DAGAQ 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 529 GALQLQANQSVQhAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGM 608
Cdd:pfam09606 219 MGQQAQANGGMN-PQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAM 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 609 HQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQ 688
Cdd:pfam09606 295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQP 370
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 689 VLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPS 762
Cdd:pfam09606 371 GMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPG 449
|
410 420
....*....|....*....|....*
gi 1958759471 763 QVMGIQGQVLRPPGPSPHMAQQHTD 787
Cdd:pfam09606 450 GSLNTPGQSAVNSPLNPQEEQLYRE 474
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
166-474 |
1.29e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 1.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247 2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247 2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926
|
...
gi 1958759471 472 RNP 474
Cdd:PHA03247 2927 PQP 2929
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
603-774 |
1.60e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 49.81 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 603 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 682
Cdd:TIGR01628 369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 683 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 762
Cdd:TIGR01628 440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
|
170
....*....|..
gi 1958759471 763 QVMGIQGQVLRP 774
Cdd:TIGR01628 504 MQKQVLGERLFP 515
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1040-1260 |
6.67e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 6.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1040 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1118
Cdd:PHA03247 2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1119 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1187
Cdd:PHA03247 2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 1188 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1260
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
199-678 |
1.16e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGwnqlpsgalqPPPAQGSLgpmtTNQGWKKAP 344
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRA----------ARPTVGSL----TSLADPPPP 2704
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 345 LPSPmqaqlQARPSLATVQTPSHPPPPYPFGSQQASQAhtnfpqmsNPGQFTAPQMKSLQGGPSRVPTPlqqphltnKSP 424
Cdd:PHA03247 2705 PPTP-----EPAPHALVSATPLPPGPAAARQASPALPA--------APAPPAVPAGPATPGGPARPARP--------PTT 2763
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 425 ASSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPL---SQGFQQPVSSPGRNPMVQQGNVPpnfmvmqqqppsqgppslh 501
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpspWDPADPPAAVLAPAAALPPAASP------------------- 2824
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 502 pglggqanpnfmQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPqnqmqVSHGPPNMMQPSLMGIHGNINNQQAG 581
Cdd:PHA03247 2825 ------------AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD-----VRRRPPSRSPAAKPAAPARPPVRRLA 2887
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 582 SSGVPQVTLGSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmiLSRAQLMPQGQMMVNAQNQNLGPSPQ-- 659
Cdd:PHA03247 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP--RPQPPLAPTTDPAGAGEPSGAVPQPWlg 2965
|
490 500
....*....|....*....|....
gi 1958759471 660 -----RMTPPKQMLPQQGPQMMAP 678
Cdd:PHA03247 2966 alvpgRVAVPRFRVPQPAPSREAP 2989
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1658-1925 |
1.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1658 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1734
Cdd:PHA03247 2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1735 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1808
Cdd:PHA03247 2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1809 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1888
Cdd:PHA03247 2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1958759471 1889 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1925
Cdd:PHA03247 2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1482-1801 |
2.63e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1482 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1554
Cdd:pfam05109 446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1555 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1632
Cdd:pfam05109 526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1633 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1706
Cdd:pfam05109 603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1707 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1783
Cdd:pfam05109 679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
|
330
....*....|....*...
gi 1958759471 1784 PGSSANRRSPVSSSKGKG 1801
Cdd:pfam05109 754 TGGKANSTTGGKHTTGHG 771
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1091-1397 |
3.63e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.60 E-value: 3.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1091 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1166
Cdd:pfam05109 414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1167 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1242
Cdd:pfam05109 490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1243 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1319
Cdd:pfam05109 559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759471 1320 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1397
Cdd:pfam05109 637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1516-1886 |
6.67e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.44 E-value: 6.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1516 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1595
Cdd:pfam05109 432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1596 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1673
Cdd:pfam05109 512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1674 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1750
Cdd:pfam05109 592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1751 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1827
Cdd:pfam05109 670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759471 1828 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1886
Cdd:pfam05109 746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-190 |
2.67e-60 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.
Pssm-ID: 463988 Cd Length: 143 Bit Score: 203.43 E-value: 2.67e-60
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759471 128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
377-787 |
2.53e-11 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 68.88 E-value: 2.53e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606 60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLG---GQANPNFMQGQVPSTTAaTPGNS 528
Cdd:pfam09606 140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQgpmGGQMPPQMGVPGMPGPA-DAGAQ 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 529 GALQLQANQSVQhAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGM 608
Cdd:pfam09606 219 MGQQAQANGGMN-PQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAM 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 609 HQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQ 688
Cdd:pfam09606 295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQP 370
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 689 VLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPS 762
Cdd:pfam09606 371 GMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPG 449
|
410 420
....*....|....*....|....*
gi 1958759471 763 QVMGIQGQVLRPPGPSPHMAQQHTD 787
Cdd:pfam09606 450 GSLNTPGQSAVNSPLNPQEEQLYRE 474
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
268-732 |
7.54e-08 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 57.71 E-value: 7.54e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 268 QQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAptqvPVPPGwNQLPSGALQPPPAQGSLGPMTTNQGWKK---AP 344
Cdd:pfam09606 64 QGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMG----PMGPG-PGGPMGQQMGGPGTASNLLASLGRPQMPmggAG 138
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 345 LPSPMQAQLQARPSLATvqtpsHPPPPYPFGSQQASQAHTNFPQMSnPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSP 424
Cdd:pfam09606 139 FPSQMSRVGRMQPGGQA-----GGMMQPSSGQPGSGTPNQMGPNGG-PGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGP 212
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 425 ASSPSSFQQGSPASSPtVNQTQQQMGP--RPPQNNPLSQGFQQPVSSPGRNPMVQQGNVPPNfmvmqqqppsqGPPSLHP 502
Cdd:pfam09606 213 ADAGAQMGQQAQANGG-MNPQQMGGAPnqVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGG-----------GAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 503 GLGGQANPNFMQGQVPSTTAATPGNsgALQLQANQSVQHAGGQGAGPPQNQMQVSHGPPNMMQPSLMGIH------GNIN 576
Cdd:pfam09606 281 GQPMGPPGQQPGAMPNVMSIGDQNN--YQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHletwnpGNFG 358
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 577 NQQAGSSGVPQVTLGSM------QGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQ 650
Cdd:pfam09606 359 GLGANPMQRGQPGMMSSpspvpgQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPA 438
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 651 NQNLGPSPQRMTPPKQMLPQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNSQNQsnvMPGPAQIMRG 730
Cdd:pfam09606 439 QQRTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYREKYRQLTKYIEPLKRMIAKMENDPGDIDKMNK---MKRLLEILSN 515
|
..
gi 1958759471 731 PT 732
Cdd:pfam09606 516 PS 517
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
430-697 |
5.13e-07 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 55.04 E-value: 5.13e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 430 SFQQGSPASSPTVNQTQQQMGPRPPQNNPLSQGFQQPVSSpGRNPMVQQGNVPpnfmvmqqqppsqgppSLHP-----GL 504
Cdd:pfam09770 103 NRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRT-GYEKYKEPEPIP----------------DLQVdaslwGV 165
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 505 GGQANPNFMQGQVPSTTAATPGNSG-----------ALQLQANQSVQHAGGQGAGPPQNQMQVSHGPPNMMQPSlmgihg 573
Cdd:pfam09770 166 APKKAAAPAPAPQPAAQPASLPAPSrkmmsleeveaAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQ------ 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 574 ninnqqagssgvpqvtlgSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQN 653
Cdd:pfam09770 240 ------------------IQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ 301
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1958759471 654 LGPSPQRMTPPKQMLPQQGPQMMAP-HNQMMGPQGQVLLQQNPMI 697
Cdd:pfam09770 302 ILQNPNRLSAARVGYPQNPQPGVQPaPAHQAHRQQGSFGRQAPII 346
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
166-474 |
1.29e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 1.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247 2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247 2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926
|
...
gi 1958759471 472 RNP 474
Cdd:PHA03247 2927 PQP 2929
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
603-774 |
1.60e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 49.81 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 603 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 682
Cdd:TIGR01628 369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 683 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 762
Cdd:TIGR01628 440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
|
170
....*....|..
gi 1958759471 763 QVMGIQGQVLRP 774
Cdd:TIGR01628 504 MQKQVLGERLFP 515
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
596-739 |
2.13e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 49.42 E-value: 2.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 596 QPQQGPPSQ-LMGMHQQIVPSQGQMAQQQGTLNPQNPMilsraqLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQ 674
Cdd:TIGR01628 384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958759471 675 MMAPHNQMMGPQGQVLLQQNPmieqimtnQMQGNKAQFNSqnqsnvMPGPAQIMRGPTPNMQGNM 739
Cdd:TIGR01628 458 PMQPVMYPPNYQSLPLSQDLP--------QPQSTASQGGQ------NKKLAQVLASATPQMQKQV 508
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
592-793 |
3.34e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 49.26 E-value: 3.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 592 SMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmilsrAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQq 671
Cdd:pfam09770 202 AMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQ-----QQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPD- 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 672 gpqmmaPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkaqfnsqnqsNVMPGPAQIMrgPTPNMQGNMVQftgQMSGQML 751
Cdd:pfam09770 276 ------PAQPSIQPQAQQFHQQPP-----------------------PVPVQPTQIL--QNPNRLSAARV---GYPQNPQ 321
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1958759471 752 PQQGPVSNSPSQvmgiqgqvlRPPGPSPHMAQQHTDPATTAN 793
Cdd:pfam09770 322 PGVQPAPAHQAH---------RQQGSFGRQAPIITHPQQLAQ 354
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1040-1260 |
6.67e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 6.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1040 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1118
Cdd:PHA03247 2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1119 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1187
Cdd:PHA03247 2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 1188 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1260
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1006-1396 |
8.66e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 8.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1006 SQPQSQQQQQQMMMMLMMQQDPKSIRLPVSQNVHPPR----GPLNPDSQRVPMQQSGNVPVMVSLQG--PASVPPSPDKQ 1079
Cdd:PHA03247 2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGpappSPLPPDTHAPDPPPPSPSPAANEPDPhpPPTVPPPERPR 2653
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1080 RMPMPvntPLGSNSRKMVYQESPQNSSSPLG--EMPSLPEAGGS-----EVPSVSGGPSNMPSHLVVSQNQLMMTGPKPG 1152
Cdd:PHA03247 2654 DDPAP---GRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSltslaDPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1153 PSPLSATQGATPQQPPVNSLPsshghhfpnvAAPTQTSRPKTPNrASPRPYYPQTPNNRPP--STEPSEISLSPERLNAS 1230
Cdd:PHA03247 2731 ASPALPAAPAPPAVPAGPATP----------GGPARPARPPTTA-GPPAPAPPAAPAAGPPrrLTRPAVASLSESRESLP 2799
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1231 IAGLFPPQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSnltvsnPPNFAAPQAHKLDSVVVSSGKQSNPGTTKRASP 1310
Cdd:PHA03247 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP------PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1311 SNSRRSSPGSSRKTTPSPGRQNS---------KAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNNNGLN 1381
Cdd:PHA03247 2874 KPAAPARPPVRRLARPAVSRSTEsfalppdqpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
|
410
....*....|....*
gi 1958759471 1382 SQNPTVPAPAVGTVV 1396
Cdd:PHA03247 2954 EPSGAVPQPWLGALV 2968
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1069-1552 |
8.73e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 8.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1069 PASVPPSPDK-----QRMPMPVNTPLGSNSRKMVYQESPQNSSSPLGEMPSLP-EAGGSEVPSVSGGPSNMPSHLVVSQN 1142
Cdd:PHA03247 2557 PAAPPAAPDRsvpppRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1143 QLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHfPNVAAPTQTSRPktpnRASPRPYYPQTPNNRPPSTEPSEISL 1222
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA-AQASSPPQRPRR----RAARPTVGSLTSLADPPPPPPTPEPA 2711
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1223 SPERLNAsiaglfppqinIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTVSnPPNFAAPQAHKLDSVVVSSGKQSNP 1302
Cdd:PHA03247 2712 PHALVSA-----------TPLPPGPAAAR----QASPALPAAPAPPAVPAGPAT-PGGPARPARPPTTAGPPAPAPPAAP 2775
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1303 GTTKRASPSNSRRSSPGSSRKTTPSPgRQNSKAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNnnglns 1382
Cdd:PHA03247 2776 AAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP------ 2848
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1383 qnptvPAPAVGTVVEDNKESLNVPQDSDCQNSQGRKEQVNTELKAVPIQEAKMVVPEdqskkdgQPLDPNKLPSVEENKT 1462
Cdd:PHA03247 2849 -----SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-------PPDQPERPPQPQAPPP 2916
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1463 LMSPAMREAPTSLSQLLDNSGAPNVTIKPPGLTDLEVTPPAVSGEDLKKASVIPTLQDPSSKEPSNSLNLPHSNEPCSTL 1542
Cdd:PHA03247 2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996
|
490
....*....|
gi 1958759471 1543 AHPELSEVSS 1552
Cdd:PHA03247 2997 TGHSLSRVSS 3006
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
199-678 |
1.16e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGwnqlpsgalqPPPAQGSLgpmtTNQGWKKAP 344
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRA----------ARPTVGSL----TSLADPPPP 2704
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 345 LPSPmqaqlQARPSLATVQTPSHPPPPYPFGSQQASQAhtnfpqmsNPGQFTAPQMKSLQGGPSRVPTPlqqphltnKSP 424
Cdd:PHA03247 2705 PPTP-----EPAPHALVSATPLPPGPAAARQASPALPA--------APAPPAVPAGPATPGGPARPARP--------PTT 2763
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 425 ASSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPL---SQGFQQPVSSPGRNPMVQQGNVPpnfmvmqqqppsqgppslh 501
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpspWDPADPPAAVLAPAAALPPAASP------------------- 2824
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 502 pglggqanpnfmQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPqnqmqVSHGPPNMMQPSLMGIHGNINNQQAG 581
Cdd:PHA03247 2825 ------------AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD-----VRRRPPSRSPAAKPAAPARPPVRRLA 2887
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 582 SSGVPQVTLGSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmiLSRAQLMPQGQMMVNAQNQNLGPSPQ-- 659
Cdd:PHA03247 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP--RPQPPLAPTTDPAGAGEPSGAVPQPWlg 2965
|
490 500
....*....|....*....|....
gi 1958759471 660 -----RMTPPKQMLPQQGPQMMAP 678
Cdd:PHA03247 2966 alvpgRVAVPRFRVPQPAPSREAP 2989
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1658-1925 |
1.47e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1658 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1734
Cdd:PHA03247 2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1735 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1808
Cdd:PHA03247 2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1809 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1888
Cdd:PHA03247 2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1958759471 1889 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1925
Cdd:PHA03247 2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
376-483 |
2.30e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.10 E-value: 2.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 376 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ-----QGSPASSPTVNQTQQQMG 450
Cdd:pfam09770 211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqspQPDPAQPSIQPQAQQFHQ 290
|
90 100 110
....*....|....*....|....*....|....
gi 1958759471 451 PRPPQNNPLSQGFQQP-VSSPGRNPMVQQGNVPP 483
Cdd:pfam09770 291 QPPPVPVQPTQILQNPnRLSAARVGYPQNPQPGV 324
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1482-1801 |
2.63e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1482 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1554
Cdd:pfam05109 446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1555 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1632
Cdd:pfam05109 526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1633 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1706
Cdd:pfam05109 603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1707 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1783
Cdd:pfam05109 679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
|
330
....*....|....*...
gi 1958759471 1784 PGSSANRRSPVSSSKGKG 1801
Cdd:pfam05109 754 TGGKANSTTGGKHTTGHG 771
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
631-760 |
3.26e-03 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 42.49 E-value: 3.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 631 PMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMM---APHnQMMGPQGQVLLQQNPMIEQIMTnqMQG 707
Cdd:TIGR01628 355 PLYVALAQRKEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYygqGPQ-QQFNGQPLGWPRMSMMPTPMGP--GGP 431
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 708 NKAQFNSQNQSNVMPGPAQIMRGPTPNMQGnmVQFTGQMSGQMLPQQGPVSNS 760
Cdd:TIGR01628 432 LRPNGLAPMNAVRAPSRNAQNAAQKPPMQP--VMYPPNYQSLPLSQDLPQPQS 482
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1091-1397 |
3.63e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.60 E-value: 3.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1091 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1166
Cdd:pfam05109 414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1167 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1242
Cdd:pfam05109 490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1243 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1319
Cdd:pfam05109 559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759471 1320 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1397
Cdd:pfam05109 637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
|
|
| KAR9 |
pfam08580 |
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ... |
1147-1304 |
4.47e-03 |
|
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.
Pssm-ID: 430088 [Multi-domain] Cd Length: 684 Bit Score: 42.12 E-value: 4.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1147 TGPKPGPSPlSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRP---------YYPQTPNNRPPSTEP 1217
Cdd:pfam08580 511 TATSETPTP-ALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPsrssrssstLPPVSPLSRDKSRSP 589
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1218 SEISLSPERLNASIAGLFPPQINIPLPPRPNLNrgfdqqglNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVVSSG 1297
Cdd:pfam08580 590 APTCRSVSRASRRRASRKPTRIGSPNSRTSLLD--------EPPYPKLTLSKGLPRTPRNRQSYAGTSPSRSVSVSSGLG 661
|
....*..
gi 1958759471 1298 KQSNPGT 1304
Cdd:pfam08580 662 PQTRPGT 668
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
699-811 |
5.72e-03 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 41.72 E-value: 5.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 699 QIMTNQMQGNKAQFNSQNQSNVMPG--PAQIMRGPTPNMQGNMVQFTGQMSGQMLPQQGPvsNSPSQVMGIQ--GQVLRP 774
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPMGSPMGGamGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP--GGPLRPNGLApmNAVRAP 446
|
90 100 110
....*....|....*....|....*....|....*..
gi 1958759471 775 PGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQQTS 811
Cdd:TIGR01628 447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQST 483
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
659-825 |
5.97e-03 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 41.72 E-value: 5.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 659 QRMTPPKQMLPQQGPQMMAPH--NQMMGPQGQVLL-QQNPmieqimtnQMQGNKAQFNSQNQSNvMPGPAQIMRGPTPNM 735
Cdd:TIGR01628 366 QRRAHLQDQFMQLQPRMRQLPmgSPMGGAMGQPPYyGQGP--------QQQFNGQPLGWPRMSM-MPTPMGPGGPLRPNG 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 736 QGNMVQFTGQMSGQmlpQQGPVSNSPsqvMGIQGQVLRPPGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQ-QTSMVP 814
Cdd:TIGR01628 437 LAPMNAVRAPSRNA---QNAAQKPPM---QPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQmQKQVLG 510
|
170
....*....|....*
gi 1958759471 815 ----PHVQSMQGNSA 825
Cdd:TIGR01628 511 erlfPLVEAIEPALA 525
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1516-1886 |
6.67e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.44 E-value: 6.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1516 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1595
Cdd:pfam05109 432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1596 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1673
Cdd:pfam05109 512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1674 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1750
Cdd:pfam05109 592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1751 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1827
Cdd:pfam05109 670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759471 1828 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1886
Cdd:pfam05109 746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
|
|
|