|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-195 |
1.86e-59 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed. :
Pssm-ID: 463988 Cd Length: 143 Bit Score: 201.12 E-value: 1.86e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034624624 128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
|
|
| Med15 super family |
cl26621 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
537-862 |
3.22e-09 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development. The actual alignment was detected with superfamily member pfam09606:
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 61.95 E-value: 3.22e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 537 IFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMN 616
Cdd:pfam09606 47 ILHVRDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTAS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 617 NQQAGTsGVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNP 689
Cdd:pfam09606 122 NLLASL-GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQM 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 690 PSQNLGPS-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIM 768
Cdd:pfam09606 201 PPQMGVPGmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 769 RGPTPNMQgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQ 848
Cdd:pfam09606 281 GQPMGPPG----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWN 353
|
330
....*....|....
gi 1034624624 849 QTNMVPPHVQAMQG 862
Cdd:pfam09606 354 PGNFGGLGANPMQR 367
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1080-1326 |
2.94e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 2.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1080 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1159
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1160 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1228
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1229 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1308
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
|
250
....*....|....*...
gi 1034624624 1309 qAPSNLTMNPSNFATPQT 1326
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
190-491 |
7.98e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 7.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 190 PPGGNVSSSMMAPGPNPELQPRtPRPASQSDamdPLLSGLHIQQQSHPSGSLAP-----PHHPMQPVSVNRQMNPANFPQ 264
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPA-PHALVSAT---PLPPGPAAARQASPALPAAPappavPAGPATPGGPARPARPPTTAG 2765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 265 LQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHqqqqpqgirPQFTAPTQVPVPPGWNQLPSGALQPPPaqgsLGTMT 344
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS---------PWDPADPPAAVLAPAAALPPAASPAGP----LPPPT 2832
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSL-QGGPSRVPTPL 423
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALpPDQPERPPQPQ 2912
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034624624 424 QQPHLTNKSPASSPssfqqgsPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVP 491
Cdd:PHA03247 2913 APPPPQPQPQPPPP-------PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-195 |
1.86e-59 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.
Pssm-ID: 463988 Cd Length: 143 Bit Score: 201.12 E-value: 1.86e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034624624 128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
537-862 |
3.22e-09 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 61.95 E-value: 3.22e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 537 IFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMN 616
Cdd:pfam09606 47 ILHVRDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTAS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 617 NQQAGTsGVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNP 689
Cdd:pfam09606 122 NLLASL-GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQM 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 690 PSQNLGPS-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIM 768
Cdd:pfam09606 201 PPQMGVPGmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 769 RGPTPNMQgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQ 848
Cdd:pfam09606 281 GQPMGPPG----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWN 353
|
330
....*....|....
gi 1034624624 849 QTNMVPPHVQAMQG 862
Cdd:pfam09606 354 PGNFGGLGANPMQR 367
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1080-1326 |
2.94e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 2.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1080 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1159
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1160 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1228
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1229 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1308
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
|
250
....*....|....*...
gi 1034624624 1309 qAPSNLTMNPSNFATPQT 1326
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
636-779 |
5.82e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.88 E-value: 5.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 636 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 714
Cdd:TIGR01628 384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034624624 715 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 779
Cdd:TIGR01628 458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-491 |
7.98e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 7.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 190 PPGGNVSSSMMAPGPNPELQPRtPRPASQSDamdPLLSGLHIQQQSHPSGSLAP-----PHHPMQPVSVNRQMNPANFPQ 264
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPA-PHALVSAT---PLPPGPAAARQASPALPAAPappavPAGPATPGGPARPARPPTTAG 2765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 265 LQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHqqqqpqgirPQFTAPTQVPVPPGWNQLPSGALQPPPaqgsLGTMT 344
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS---------PWDPADPPAAVLAPAAALPPAASPAGP----LPPPT 2832
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSL-QGGPSRVPTPL 423
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALpPDQPERPPQPQ 2912
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034624624 424 QQPHLTNKSPASSPssfqqgsPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVP 491
Cdd:PHA03247 2913 APPPPQPQPQPPPP-------PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1131-1426 |
7.72e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.52 E-value: 7.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1131 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1202
Cdd:pfam05109 414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1203 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1276
Cdd:pfam05109 485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1277 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1355
Cdd:pfam05109 565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034624624 1356 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGlnPQNST 1426
Cdd:pfam05109 645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG--PGNSS 716
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
385-523 |
8.54e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.64 E-value: 8.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 385 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ----QGSPASSPTVNQTQQQMGP 460
Cdd:pfam09770 211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqsPQPDPAQPSIQPQAQQFHQ 290
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034624624 461 RPPQNNPLP-QGFQQP-VSSPGRNPMVQQGnvPPNFMVMQQQPPNQGPQSLHPGLGEKSEPSNLA 523
Cdd:pfam09770 291 QPPPVPVQPtQILQNPnRLSAARVGYPQNP--QPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLA 353
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Nucleic_acid_bd |
pfam13820 |
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ... |
48-195 |
1.86e-59 |
|
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.
Pssm-ID: 463988 Cd Length: 143 Bit Score: 201.12 E-value: 1.86e-59
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820 1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034624624 128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820 79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
537-862 |
3.22e-09 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 61.95 E-value: 3.22e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 537 IFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMN 616
Cdd:pfam09606 47 ILHVRDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTAS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 617 NQQAGTsGVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNP 689
Cdd:pfam09606 122 NLLASL-GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQM 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 690 PSQNLGPS-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIM 768
Cdd:pfam09606 201 PPQMGVPGmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGP 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 769 RGPTPNMQgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQ 848
Cdd:pfam09606 281 GQPMGPPG----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWN 353
|
330
....*....|....
gi 1034624624 849 QTNMVPPHVQAMQG 862
Cdd:pfam09606 354 PGNFGGLGANPMQR 367
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1080-1326 |
2.94e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 2.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1080 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1159
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1160 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1228
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1229 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1308
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
|
250
....*....|....*...
gi 1034624624 1309 qAPSNLTMNPSNFATPQT 1326
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
636-806 |
3.28e-06 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 52.35 E-value: 3.28e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 636 QPQQGPPSQLMGMHQQIVPSQGQMVQQQGTLNPQnpmilsraQLMPQGQMMVNPPSQNLGPSPQRmtppkqmLSQQGPQM 715
Cdd:pfam09770 209 KPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ--------QQQPQQQPQQPQQHPGQGHPVTI-------LQRPQSPQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 716 MAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQsnvMPGPAQIMRGPTPNMQgnmvqftGQMSGQMLPQQG 795
Cdd:pfam09770 274 PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVG---YPQNPQPGVQPAPAHQ-------AHRQQGSFGRQA 343
|
170
....*....|.
gi 1034624624 796 PVNNSPSQVMG 806
Cdd:pfam09770 344 PIITHPQQLAQ 354
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1109-1583 |
3.53e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.17 E-value: 3.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1109 PASVPPSPDkQRMPMPVNTPlgsnsrkmvyqeSPQNPSSSPLAEMASLPEASGSeaPSVPGGPNNMPSHVV----LPQNQ 1184
Cdd:PHA03247 2557 PAAPPAAPD-RSVPPPRPAP------------RPSEPAVTSRARRPDAPPQSAR--PRAPVDDRGDPRGPAppspLPPDT 2621
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1185 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1264
Cdd:PHA03247 2622 HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1265 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGA 1344
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAAR----QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1345 TKRASPSNSRRSSPGSSRKTTPS---PGRQNSKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGLN 1421
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSpwdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1422 PQNSTVSVAAVGGVVEDNKESLNVPQDSdcqnsqsrkeqvnIELKAVPAQEVKMVVPEDQSKKDGQPSDPNK--LPSVEE 1499
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAPPPpqPQPQPP 2924
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1500 NKNLVSPAMREAPTSLSQL---LDNSGAPNVTIKPPGLTDLEVTPPVVSGEDLKKASVIPTLQDLSSSKEPSNSLNLPHS 1576
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLaptTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
|
....*..
gi 1034624624 1577 NELCSSL 1583
Cdd:PHA03247 3005 SSWASSL 3011
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
636-779 |
5.82e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.88 E-value: 5.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 636 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 714
Cdd:TIGR01628 384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034624624 715 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 779
Cdd:TIGR01628 458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-491 |
7.98e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 7.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 190 PPGGNVSSSMMAPGPNPELQPRtPRPASQSDamdPLLSGLHIQQQSHPSGSLAP-----PHHPMQPVSVNRQMNPANFPQ 264
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPA-PHALVSAT---PLPPGPAAARQASPALPAAPappavPAGPATPGGPARPARPPTTAG 2765
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 265 LQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHqqqqpqgirPQFTAPTQVPVPPGWNQLPSGALQPPPaqgsLGTMT 344
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS---------PWDPADPPAAVLAPAAALPPAASPAGP----LPPPT 2832
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSL-QGGPSRVPTPL 423
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALpPDQPERPPQPQ 2912
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034624624 424 QQPHLTNKSPASSPssfqqgsPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVP 491
Cdd:PHA03247 2913 APPPPQPQPQPPPP-------PQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
643-794 |
1.58e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 46.72 E-value: 1.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 643 SQLMGMHQQIVPSQGQMVQqqgtLNPQNPMILSRAQLMPQGQMMVNPPSQNLGPSPQRMtPPKQMLSQQGPQMMAPhnqm 722
Cdd:TIGR01628 369 AHLQDQFMQLQPRMRQLPM----GSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034624624 723 MGPQGQVLLQQNPMIEQimtNQMQGNKQQFNTQNQSNVMPGPAQimrGPTPNMQGNMvQFTGQMSGQMLPQQ 794
Cdd:TIGR01628 440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQN-KKLAQVLASATPQM 504
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
953-1514 |
2.11e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 2.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 953 DLNTPDTRPAgleeadqpPLPGEQGINLDNSGPKLPEFSNRP--PGYPS-QPVEQRPLQQMPPQLMQHVAPPPQPPQQQP 1029
Cdd:PHA03247 2565 DRSVPPPRPA--------PRPSEPAVTSRARRPDAPPQSARPraPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1030 QPQLPQQQQPPPPSQPQSQQQqqqqqqmmmmlmmqqdPKSVRLP--VSQNVHPPRGPLNPDSQRMPMQQSGSVPVmVSLQ 1107
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPA----------------PGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSL-TSLA 2699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1108 GPASVPPSPDKQRMPMPVNTPL--GSNSRKMVYQESPQNPSSSPLAEMASLPEASGSEA-PSVPGGPNNmPSHVVLPQNQ 1184
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLppGPAAARQASPALPAAPAPPAVPAGPATPGGPARPArPPTTAGPPA-PAPPAAPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1185 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVaAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1264
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL-PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1265 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqqglnpTTLKAIGQAPSNLTMNPSNFATPQTHKLDsvvvnSGKQSNSGA 1344
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR---------LARPAVSRSTESFALPPDQPERPPQPQAP-----PPPQPQPQP 2923
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1345 TKRASPSNSRRSSPGSSRKTTPSPGRQNSKAPKLTLASQTNAALLQ-NVELPRNVLVSPTPLANPPVPGSFPNNSGLNPQ 1423
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1424 NSTVSVAAVGGVVEDN-----KESLNVPQDSDCQNSQSRKEQVNIELkavpaqevkmvvpeDQSKKDGQPSDPNKLPSVE 1498
Cdd:PHA03247 3004 VSSWASSLALHEETDPppvslKQTLWPPDDTEDSDADSLFDSDSERS--------------DLEALDPLPPEPHDPFAHE 3069
|
570
....*....|....*.
gi 1034624624 1499 ENKNLVSPAMREAPTS 1514
Cdd:PHA03247 3070 PDPATPEAGARESPSS 3085
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
439-678 |
4.38e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.41 E-value: 4.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 439 SFQQGSPASSPTVNQTQQQMGPRPPQNNPLPQGFQQPVSS-------PGRNPMVQQ-----GNVPPNFMVMQQQPPNQGP 506
Cdd:pfam09770 103 NRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekykePEPIPDLQVdaslwGVAPKKAAAPAPAPQPAAQ 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 507 QSLHPGLGEK---------------SEPSNLAVAWPQITFREQIAIFSLACSKSGQANPNFMQGQVPSTTATTPGNSGAP 571
Cdd:pfam09770 183 PASLPAPSRKmmsleeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHP 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 572 --QLQANQNVQHAGGQGAGPPQNQMQVSHGPPNMMQPslMGIHGNMNNQQAGTSGVPQvnlsNMQGQPQQGPPSQlmgmH 649
Cdd:pfam09770 263 vtILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP--TQILQNPNRLSAARVGYPQ----NPQPGVQPAPAHQ----A 332
|
250 260
....*....|....*....|....*....
gi 1034624624 650 QQIVPSQGQMVQQQgtLNPQNPMILSRAQ 678
Cdd:pfam09770 333 HRQQGSFGRQAPII--THPQQLAQLSEEE 359
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1131-1426 |
7.72e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.52 E-value: 7.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1131 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1202
Cdd:pfam05109 414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1203 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1276
Cdd:pfam05109 485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1277 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1355
Cdd:pfam05109 565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034624624 1356 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGlnPQNST 1426
Cdd:pfam05109 645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG--PGNSS 716
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
385-523 |
8.54e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.64 E-value: 8.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 385 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ----QGSPASSPTVNQTQQQMGP 460
Cdd:pfam09770 211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqsPQPDPAQPSIQPQAQQFHQ 290
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034624624 461 RPPQNNPLP-QGFQQP-VSSPGRNPMVQQGnvPPNFMVMQQQPPNQGPQSLHPGLGEKSEPSNLA 523
Cdd:pfam09770 291 QPPPVPVQPtQILQNPnRLSAARVGYPQNP--QPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLA 353
|
|
| SOBP |
pfam15279 |
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ... |
1104-1318 |
3.09e-03 |
|
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.
Pssm-ID: 464609 [Multi-domain] Cd Length: 325 Bit Score: 42.11 E-value: 3.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1104 VSLQGPASVPPSPDKQRMPMPVNTPLGS--NSRKMVYQESPQNPSSSPLAEMASLPEASGSEAPSVPGGPNNMPSHVVLP 1181
Cdd:pfam15279 91 ESVSPGPSSSASPSSSPTSSNSSKPLISvaSSSKLLAPKPHEPPSLPPPPLPPKKGRRHRPGLHPPLGRPPGSPPMSMTP 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 1182 QNQLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHhfPNVAAPTQTSRPKTPNRASPRPYYPQT-PNNRPP-----S 1255
Cdd:pfam15279 171 RGLLGKPQQHPPPSPLPAFMEPSSMPPPFLRPPPSIPQ--PNSPLSNPMLPGIGPPPKPPRNLGPPSnPMHRPPfsphhP 248
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034624624 1256 TEPSEISLSPERLNASIAGLFPPQINIPLPPrpnLNRGFDQQGLNPTTLKAIGQAPSNLTMNP 1318
Cdd:pfam15279 249 PPPPTPPGPPPGLPPPPPRGFTPPFGPPFPP---VNMMPNPPEMNFGLPSLAPLVPPVTVLVP 308
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
190-483 |
4.56e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 190 PPGGNVS----SSMMAPGPNPELQPRTPRPASQSDAMDPLlsglhiqqqshpsGSLAPPHHPMQPVSvnrqmnPANFPQL 265
Cdd:PHA03247 2656 PAPGRVSrprrARRLGRAAQASSPPQRPRRRAARPTVGSL-------------TSLADPPPPPPTPE------PAPHALV 2716
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 266 QQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPVPPGWNQLPSGALQPPPAQGSLGTMTA 345
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRE 2796
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034624624 346 NQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP-GQFTAPQMKSLQGGPSR--VPTP 422
Cdd:PHA03247 2797 SLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlGGSVAPGGDVRRRPPSRspAAKP 2875
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034624624 423 LQQPHLTNKS---PASSPSSFQQGSPASSPT-VNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNP 483
Cdd:PHA03247 2876 AAPARPPVRRlarPAVSRSTESFALPPDQPErPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
|
|