NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958759469|ref|XP_038960032|]
View 

nuclear receptor coactivator 6 isoform X3 [Rattus norvegicus]

Protein Classification

SANT/Myb-like DNA-binding domain-containing protein; Nup50 family Ran-binding domain-containing protein( domain architecture ID 13845479)

SANT (SWI3, ADA2, N-CoR and TFIIIB)/Myb-like DNA-binding domain-containing protein binds DNA and may function as a transcription factor; also contains a Med15 domain, a critical transducer of gene activation signals that control early metazoan development.| Nup50 family Ran-binding domain (RanBD)-containing protein similar to RanBD domain region of Homo sapiens nuclear pore complex protein Nup50, a component of the nuclear pore complex that has a direct role in nuclear protein import

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.67e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


:

Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.67e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759469  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-788 7.95e-12

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 70.81  E-value: 7.95e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLGAGQANPNFMQGQVPSTTAATPGNSGA 531
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQM 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  532 LQLQANQSVQHAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGMHQ 611
Cdd:pfam09606  220 GQQAQANGGMNPQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPN 296
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  612 QIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQVL 691
Cdd:pfam09606  297 VMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQPGM 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  692 LQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPSQV 765
Cdd:pfam09606  373 MSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPGGS 451
                          410       420
                   ....*....|....*....|...
gi 1958759469  766 MGIQGQVLRPPGPSPHMAQQHTD 788
Cdd:pfam09606  452 LNTPGQSAVNSPLNPQEEQLYRE 474
PHA03247 super family cl33720
large tegument protein UL36; Provisional
166-474 1.26e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926

                   ...
gi 1958759469  472 RNP 474
Cdd:PHA03247  2927 PQP 2929
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1041-1261 6.79e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1041 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1119
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1120 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1188
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759469 1189 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1261
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1659-1926 1.52e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1659 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1735
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1736 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1809
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1810 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1889
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759469 1890 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1926
Cdd:PHA03247  2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1483-1802 2.57e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1483 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1555
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1556 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1633
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1634 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1707
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1708 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1784
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759469 1785 PGSSANRRSPVSSSKGKG 1802
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.67e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.67e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759469  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-788 7.95e-12

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 70.81  E-value: 7.95e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLGAGQANPNFMQGQVPSTTAATPGNSGA 531
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQM 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  532 LQLQANQSVQHAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGMHQ 611
Cdd:pfam09606  220 GQQAQANGGMNPQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPN 296
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  612 QIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQVL 691
Cdd:pfam09606  297 VMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQPGM 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  692 LQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPSQV 765
Cdd:pfam09606  373 MSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPGGS 451
                          410       420
                   ....*....|....*....|...
gi 1958759469  766 MGIQGQVLRPPGPSPHMAQQHTD 788
Cdd:pfam09606  452 LNTPGQSAVNSPLNPQEEQLYRE 474
PHA03247 PHA03247
large tegument protein UL36; Provisional
166-474 1.26e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926

                   ...
gi 1958759469  472 RNP 474
Cdd:PHA03247  2927 PQP 2929
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
604-775 1.61e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  604 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 683
Cdd:TIGR01628  369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  684 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 763
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
                          170
                   ....*....|..
gi 1958759469  764 QVMGIQGQVLRP 775
Cdd:TIGR01628  504 MQKQVLGERLFP 515
PHA03247 PHA03247
large tegument protein UL36; Provisional
1041-1261 6.79e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1041 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1119
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1120 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1188
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759469 1189 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1261
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 PHA03247
large tegument protein UL36; Provisional
1659-1926 1.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1659 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1735
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1736 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1809
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1810 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1889
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759469 1890 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1926
Cdd:PHA03247  2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
199-679 1.56e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGwnqlpsgalqPPPAQGSLgpmtTNQGWKKAP 344
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRA----------ARPTVGSL----TSLADPPPP 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  345 LPSPmqaqlQARPSLATVQTPSHPPPPYPFGSQQASQAhtnfpqmsNPGQFTAPQMKSLQGGPSRVPTPlqqphltnKSP 424
Cdd:PHA03247  2705 PPTP-----EPAPHALVSATPLPPGPAAARQASPALPA--------APAPPAVPAGPATPGGPARPARP--------PTT 2763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  425 ASSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPL---SQGFQQPVSSPGRNPMVQQGNVPpnfmvmqqqppsqgppslh 501
Cdd:PHA03247  2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpspWDPADPPAAVLAPAAALPPAASP------------------- 2824
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  502 pglgagqanpnfmQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPqnqmqVSHGPPNMMQPSLMGIHGNINNQQA 581
Cdd:PHA03247  2825 -------------AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD-----VRRRPPSRSPAAKPAAPARPPVRRL 2886
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  582 GSSGVPQVTLGSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmiLSRAQLMPQGQMMVNAQNQNLGPSPQ- 660
Cdd:PHA03247  2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP--RPQPPLAPTTDPAGAGEPSGAVPQPWl 2964
                          490       500
                   ....*....|....*....|....*
gi 1958759469  661 ------RMTPPKQMLPQQGPQMMAP 679
Cdd:PHA03247  2965 galvpgRVAVPRFRVPQPAPSREAP 2989
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1483-1802 2.57e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1483 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1555
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1556 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1633
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1634 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1707
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1708 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1784
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759469 1785 PGSSANRRSPVSSSKGKG 1802
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1092-1398 3.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 3.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1092 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1167
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1168 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1243
Cdd:pfam05109  490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1244 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1320
Cdd:pfam05109  559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759469 1321 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1398
Cdd:pfam05109  637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1517-1887 6.56e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 6.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1517 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1596
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1597 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1674
Cdd:pfam05109  512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1675 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1751
Cdd:pfam05109  592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1752 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1828
Cdd:pfam05109  670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759469 1829 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1887
Cdd:pfam05109  746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.67e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.67e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759469  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-788 7.95e-12

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 70.81  E-value: 7.95e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLGAGQANPNFMQGQVPSTTAATPGNSGA 531
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQM 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  532 LQLQANQSVQHAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGMHQ 611
Cdd:pfam09606  220 GQQAQANGGMNPQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPN 296
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  612 QIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQVL 691
Cdd:pfam09606  297 VMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQPGM 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  692 LQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPSQV 765
Cdd:pfam09606  373 MSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPGGS 451
                          410       420
                   ....*....|....*....|...
gi 1958759469  766 MGIQGQVLRPPGPSPHMAQQHTD 788
Cdd:pfam09606  452 LNTPGQSAVNSPLNPQEEQLYRE 474
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
430-698 2.12e-08

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 59.66  E-value: 2.12e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  430 SFQQGSPASSPTVNQTQQQMGPRPPQNNPLSQGFQQPVSSpGRNPMVQQGNVPpnfmvmqqqppsqgppSLHP-----GL 504
Cdd:pfam09770  103 NRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRT-GYEKYKEPEPIP----------------DLQVdaslwGV 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  505 GAGQANPNFMQGQVPSTTAATPGNS----------GALQLQANQSVQHAGGQGAGPPQNQMQVSHGPPNMMQPSlmgihg 574
Cdd:pfam09770  166 APKKAAAPAPAPQPAAQPASLPAPSrkmmsleeveAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQ------ 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  575 ninnqqagssgvpqvtlgSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQN 654
Cdd:pfam09770  240 ------------------IQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ 301
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1958759469  655 LGPSPQRMTPPKQMLPQQGPQMMAP-HNQMMGPQGQVLLQQNPMI 698
Cdd:pfam09770  302 ILQNPNRLSAARVGYPQNPQPGVQPaPAHQAHRQQGSFGRQAPII 346
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
268-733 3.25e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 58.87  E-value: 3.25e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  268 QQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAptqvPVPPGwNQLPSGALQPPPAQGSLGPMTTNQGWKK---AP 344
Cdd:pfam09606   64 QGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMG----PMGPG-PGGPMGQQMGGPGTASNLLASLGRPQMPmggAG 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  345 LPSPMQAQLQARPSLATvqtpsHPPPPYPFGSQQASQAHTNFPQMSnPGQFTAPQMKSLQGGPSRVPTPLQ--------- 415
Cdd:pfam09606  139 FPSQMSRVGRMQPGGQA-----GGMMQPSSGQPGSGTPNQMGPNGG-PGQGQAGGMNGGQQGPMGGQMPPQmgvpgmpgp 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  416 ---QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMgpRPPQNNPLSQGFQQPVSSPGRNPMVQQGNVPPNFMvmqqQP 492
Cdd:pfam09606  213 adaGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQ--QQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPM----GP 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  493 PSQGPPSLHPGLGAGQANPNFMQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPQNQMQVshgppnmmqpslmGI 572
Cdd:pfam09606  287 PGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLET-------------WN 353
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  573 HGNINNQQAGSSGVPQVTLGSM------QGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQM 646
Cdd:pfam09606  354 PGNFGGLGANPMQRGQPGMMSSpspvpgQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQM 433
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  647 MVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNSQNQsnvMPGPA 726
Cdd:pfam09606  434 SQQPAQQRTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYREKYRQLTKYIEPLKRMIAKMENDPGDIDKMNK---MKRLL 510

                   ....*..
gi 1958759469  727 QIMRGPT 733
Cdd:pfam09606  511 EILSNPS 517
PHA03247 PHA03247
large tegument protein UL36; Provisional
166-474 1.26e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926

                   ...
gi 1958759469  472 RNP 474
Cdd:PHA03247  2927 PQP 2929
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
604-775 1.61e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  604 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 683
Cdd:TIGR01628  369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  684 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 763
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
                          170
                   ....*....|..
gi 1958759469  764 QVMGIQGQVLRP 775
Cdd:TIGR01628  504 MQKQVLGERLFP 515
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
597-740 2.13e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.42  E-value: 2.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  597 QPQQGPPSQ-LMGMHQQIVPSQGQMAQQQGTLNPQNPMilsraqLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQ 675
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958759469  676 MMAPHNQMMGPQGQVLLQQNPmieqimtnQMQGNKAQFNSqnqsnvMPGPAQIMRGPTPNMQGNM 740
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------QPQSTASQGGQ------NKKLAQVLASATPQMQKQV 508
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
593-794 3.34e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 3.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  593 SMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmilsrAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQq 672
Cdd:pfam09770  202 AMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQ-----QQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPD- 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  673 gpqmmaPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkaqfnsqnqsNVMPGPAQIMrgPTPNMQGNMVQftgQMSGQML 752
Cdd:pfam09770  276 ------PAQPSIQPQAQQFHQQPP-----------------------PVPVQPTQIL--QNPNRLSAARV---GYPQNPQ 321
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1958759469  753 PQQGPVSNSPSQvmgiqgqvlRPPGPSPHMAQQHTDPATTAN 794
Cdd:pfam09770  322 PGVQPAPAHQAH---------RQQGSFGRQAPIITHPQQLAQ 354
PHA03247 PHA03247
large tegument protein UL36; Provisional
1041-1261 6.79e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1041 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1119
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1120 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1188
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759469 1189 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1261
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 PHA03247
large tegument protein UL36; Provisional
1070-1553 8.52e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 8.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1070 PASVPPSPDK-----QRMPMPVNTPLGSNSRKMVYQESPQNSSSPLGEMPSLP-EAGGSEVPSVSGGPSNMPSHLVVSQN 1143
Cdd:PHA03247  2557 PAAPPAAPDRsvpppRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1144 QLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHfPNVAAPTQTSRPktpnRASPRPYYPQTPNNRPPSTEPSEISL 1223
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA-AQASSPPQRPRR----RAARPTVGSLTSLADPPPPPPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1224 SPERLNAsiaglfppqinIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTVSnPPNFAAPQAHKLDSVVVSSGKQSNP 1303
Cdd:PHA03247  2712 PHALVSA-----------TPLPPGPAAAR----QASPALPAAPAPPAVPAGPAT-PGGPARPARPPTTAGPPAPAPPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1304 GTTKRASPSNSRRSSPGSSRKTTPSPgRQNSKAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNnnglns 1383
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP------ 2848
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1384 qnptvPAPAVGTVVEDNKESLNVPQDSDCQNSQGRKEQVNTELKAVPIQEAKMVVPEdqskkdgQPLDPNKLPSVEENKT 1463
Cdd:PHA03247  2849 -----SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-------PPDQPERPPQPQAPPP 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1464 LMSPAMREAPTSLSQLLDNSGAPNVTIKPPGLTDLEVTPPAVSGEDLKKASVIPTLQDPSSKEPSNSLNLPHSNEPCSTL 1543
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996
                          490
                   ....*....|
gi 1958759469 1544 AHPELSEVSS 1553
Cdd:PHA03247  2997 TGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
1007-1397 8.88e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 8.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1007 SQPQSQQQQQQMMMMLMMQQDPKSIRLPVSQNVHPPR----GPLNPDSQRVPMQQSGNVPVMVSLQG--PASVPPSPDKQ 1080
Cdd:PHA03247  2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGpappSPLPPDTHAPDPPPPSPSPAANEPDPhpPPTVPPPERPR 2653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1081 RMPMPvntPLGSNSRKMVYQESPQNSSSPLG--EMPSLPEAGGS-----EVPSVSGGPSNMPSHLVVSQNQLMMTGPKPG 1153
Cdd:PHA03247  2654 DDPAP---GRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSltslaDPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1154 PSPLSATQGATPQQPPVNSLPsshghhfpnvAAPTQTSRPKTPNrASPRPYYPQTPNNRPP--STEPSEISLSPERLNAS 1231
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATP----------GGPARPARPPTTA-GPPAPAPPAAPAAGPPrrLTRPAVASLSESRESLP 2799
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1232 IAGLFPPQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSnltvsnPPNFAAPQAHKLDSVVVSSGKQSNPGTTKRASP 1311
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP------PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1312 SNSRRSSPGSSRKTTPSPGRQNS---------KAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNNNGLN 1382
Cdd:PHA03247  2874 KPAAPARPPVRRLARPAVSRSTEsfalppdqpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
                          410
                   ....*....|....*
gi 1958759469 1383 SQNPTVPAPAVGTVV 1397
Cdd:PHA03247  2954 EPSGAVPQPWLGALV 2968
PHA03247 PHA03247
large tegument protein UL36; Provisional
1659-1926 1.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1659 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1735
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1736 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1809
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1810 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1889
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759469 1890 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1926
Cdd:PHA03247  2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
PHA03247 PHA03247
large tegument protein UL36; Provisional
199-679 1.56e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGwnqlpsgalqPPPAQGSLgpmtTNQGWKKAP 344
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRA----------ARPTVGSL----TSLADPPPP 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  345 LPSPmqaqlQARPSLATVQTPSHPPPPYPFGSQQASQAhtnfpqmsNPGQFTAPQMKSLQGGPSRVPTPlqqphltnKSP 424
Cdd:PHA03247  2705 PPTP-----EPAPHALVSATPLPPGPAAARQASPALPA--------APAPPAVPAGPATPGGPARPARP--------PTT 2763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  425 ASSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPL---SQGFQQPVSSPGRNPMVQQGNVPpnfmvmqqqppsqgppslh 501
Cdd:PHA03247  2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpspWDPADPPAAVLAPAAALPPAASP------------------- 2824
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  502 pglgagqanpnfmQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPqnqmqVSHGPPNMMQPSLMGIHGNINNQQA 581
Cdd:PHA03247  2825 -------------AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD-----VRRRPPSRSPAAKPAAPARPPVRRL 2886
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  582 GSSGVPQVTLGSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmiLSRAQLMPQGQMMVNAQNQNLGPSPQ- 660
Cdd:PHA03247  2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP--RPQPPLAPTTDPAGAGEPSGAVPQPWl 2964
                          490       500
                   ....*....|....*....|....*
gi 1958759469  661 ------RMTPPKQMLPQQGPQMMAP 679
Cdd:PHA03247  2965 galvpgRVAVPRFRVPQPAPSREAP 2989
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
376-483 2.30e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.10  E-value: 2.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  376 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ-----QGSPASSPTVNQTQQQMG 450
Cdd:pfam09770  211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqspQPDPAQPSIQPQAQQFHQ 290
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1958759469  451 PRPPQNNPLSQGFQQP-VSSPGRNPMVQQGNVPP 483
Cdd:pfam09770  291 QPPPVPVQPTQILQNPnRLSAARVGYPQNPQPGV 324
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1483-1802 2.57e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1483 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1555
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1556 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1633
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1634 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1707
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1708 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1784
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759469 1785 PGSSANRRSPVSSSKGKG 1802
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
632-761 3.26e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.49  E-value: 3.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  632 PMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMM---APHnQMMGPQGQVLLQQNPMIEQIMTnqMQG 708
Cdd:TIGR01628  355 PLYVALAQRKEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYygqGPQ-QQFNGQPLGWPRMSMMPTPMGP--GGP 431
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958759469  709 NKAQFNSQNQSNVMPGPAQIMRGPTPNMQGnmVQFTGQMSGQMLPQQGPVSNS 761
Cdd:TIGR01628  432 LRPNGLAPMNAVRAPSRNAQNAAQKPPMQP--VMYPPNYQSLPLSQDLPQPQS 482
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1092-1398 3.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 3.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1092 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1167
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1168 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1243
Cdd:pfam05109  490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1244 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1320
Cdd:pfam05109  559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759469 1321 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1398
Cdd:pfam05109  637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
1148-1305 4.43e-03

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 42.12  E-value: 4.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1148 TGPKPGPSPlSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRP---------YYPQTPNNRPPSTEP 1218
Cdd:pfam08580  511 TATSETPTP-ALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPsrssrssstLPPVSPLSRDKSRSP 589
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1219 SEISLSPERLNASIAGLFPPQINIPLPPRPNLNrgfdqqglNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVVSSG 1298
Cdd:pfam08580  590 APTCRSVSRASRRRASRKPTRIGSPNSRTSLLD--------EPPYPKLTLSKGLPRTPRNRQSYAGTSPSRSVSVSSGLG 661

                   ....*..
gi 1958759469 1299 KQSNPGT 1305
Cdd:pfam08580  662 PQTRPGT 668
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
700-812 5.72e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.72  E-value: 5.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  700 QIMTNQMQGNKAQFNSQNQSNVMPG--PAQIMRGPTPNMQGNMVQFTGQMSGQMLPQQGPvsNSPSQVMGIQ--GQVLRP 775
Cdd:TIGR01628  369 AHLQDQFMQLQPRMRQLPMGSPMGGamGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP--GGPLRPNGLApmNAVRAP 446
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1958759469  776 PGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQQTS 812
Cdd:TIGR01628  447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQST 483
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
660-826 5.97e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.72  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  660 QRMTPPKQMLPQQGPQMMAPH--NQMMGPQGQVLL-QQNPmieqimtnQMQGNKAQFNSQNQSNvMPGPAQIMRGPTPNM 736
Cdd:TIGR01628  366 QRRAHLQDQFMQLQPRMRQLPmgSPMGGAMGQPPYyGQGP--------QQQFNGQPLGWPRMSM-MPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469  737 QGNMVQFTGQMSGQmlpQQGPVSNSPsqvMGIQGQVLRPPGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQ-QTSMVP 815
Cdd:TIGR01628  437 LAPMNAVRAPSRNA---QNAAQKPPM---QPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQmQKQVLG 510
                          170
                   ....*....|....*
gi 1958759469  816 ----PHVQSMQGNSA 826
Cdd:TIGR01628  511 erlfPLVEAIEPALA 525
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1517-1887 6.56e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 6.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1517 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1596
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1597 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1674
Cdd:pfam05109  512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1675 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1751
Cdd:pfam05109  592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759469 1752 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1828
Cdd:pfam05109  670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759469 1829 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1887
Cdd:pfam05109  746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH