NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958759471|ref|XP_038960033|]
View 

nuclear receptor coactivator 6 isoform X4 [Rattus norvegicus]

Protein Classification

SANT/Myb-like DNA-binding domain-containing protein; Nup50 family Ran-binding domain-containing protein( domain architecture ID 13845479)

SANT (SWI3, ADA2, N-CoR and TFIIIB)/Myb-like DNA-binding domain-containing protein binds DNA and may function as a transcription factor; also contains a Med15 domain, a critical transducer of gene activation signals that control early metazoan development.| Nup50 family Ran-binding domain (RanBD)-containing protein similar to RanBD domain region of Homo sapiens nuclear pore complex protein Nup50, a component of the nuclear pore complex that has a direct role in nuclear protein import

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.67e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


:

Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.67e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759471  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-787 2.53e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 68.88  E-value: 2.53e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLG---GQANPNFMQGQVPSTTAaTPGNS 528
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQgpmGGQMPPQMGVPGMPGPA-DAGAQ 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  529 GALQLQANQSVQhAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGM 608
Cdd:pfam09606  219 MGQQAQANGGMN-PQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAM 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  609 HQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQ 688
Cdd:pfam09606  295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQP 370
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  689 VLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPS 762
Cdd:pfam09606  371 GMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPG 449
                          410       420
                   ....*....|....*....|....*
gi 1958759471  763 QVMGIQGQVLRPPGPSPHMAQQHTD 787
Cdd:pfam09606  450 GSLNTPGQSAVNSPLNPQEEQLYRE 474
PHA03247 super family cl33720
large tegument protein UL36; Provisional
166-474 1.29e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926

                   ...
gi 1958759471  472 RNP 474
Cdd:PHA03247  2927 PQP 2929
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1040-1260 6.67e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1040 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1118
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1119 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1187
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 1188 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1260
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1658-1925 1.47e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1658 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1734
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1735 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1808
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1809 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1888
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759471 1889 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1925
Cdd:PHA03247  2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1482-1801 2.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1482 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1554
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1555 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1632
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1633 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1706
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1707 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1783
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759471 1784 PGSSANRRSPVSSSKGKG 1801
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.67e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.67e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759471  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-787 2.53e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 68.88  E-value: 2.53e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLG---GQANPNFMQGQVPSTTAaTPGNS 528
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQgpmGGQMPPQMGVPGMPGPA-DAGAQ 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  529 GALQLQANQSVQhAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGM 608
Cdd:pfam09606  219 MGQQAQANGGMN-PQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAM 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  609 HQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQ 688
Cdd:pfam09606  295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQP 370
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  689 VLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPS 762
Cdd:pfam09606  371 GMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPG 449
                          410       420
                   ....*....|....*....|....*
gi 1958759471  763 QVMGIQGQVLRPPGPSPHMAQQHTD 787
Cdd:pfam09606  450 GSLNTPGQSAVNSPLNPQEEQLYRE 474
PHA03247 PHA03247
large tegument protein UL36; Provisional
166-474 1.29e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926

                   ...
gi 1958759471  472 RNP 474
Cdd:PHA03247  2927 PQP 2929
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
603-774 1.60e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 1.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  603 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 682
Cdd:TIGR01628  369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  683 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 762
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
                          170
                   ....*....|..
gi 1958759471  763 QVMGIQGQVLRP 774
Cdd:TIGR01628  504 MQKQVLGERLFP 515
PHA03247 PHA03247
large tegument protein UL36; Provisional
1040-1260 6.67e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1040 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1118
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1119 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1187
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 1188 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1260
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 PHA03247
large tegument protein UL36; Provisional
199-678 1.16e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGwnqlpsgalqPPPAQGSLgpmtTNQGWKKAP 344
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRA----------ARPTVGSL----TSLADPPPP 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  345 LPSPmqaqlQARPSLATVQTPSHPPPPYPFGSQQASQAhtnfpqmsNPGQFTAPQMKSLQGGPSRVPTPlqqphltnKSP 424
Cdd:PHA03247  2705 PPTP-----EPAPHALVSATPLPPGPAAARQASPALPA--------APAPPAVPAGPATPGGPARPARP--------PTT 2763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  425 ASSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPL---SQGFQQPVSSPGRNPMVQQGNVPpnfmvmqqqppsqgppslh 501
Cdd:PHA03247  2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpspWDPADPPAAVLAPAAALPPAASP------------------- 2824
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  502 pglggqanpnfmQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPqnqmqVSHGPPNMMQPSLMGIHGNINNQQAG 581
Cdd:PHA03247  2825 ------------AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD-----VRRRPPSRSPAAKPAAPARPPVRRLA 2887
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  582 SSGVPQVTLGSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmiLSRAQLMPQGQMMVNAQNQNLGPSPQ-- 659
Cdd:PHA03247  2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP--RPQPPLAPTTDPAGAGEPSGAVPQPWlg 2965
                          490       500
                   ....*....|....*....|....
gi 1958759471  660 -----RMTPPKQMLPQQGPQMMAP 678
Cdd:PHA03247  2966 alvpgRVAVPRFRVPQPAPSREAP 2989
PHA03247 PHA03247
large tegument protein UL36; Provisional
1658-1925 1.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1658 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1734
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1735 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1808
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1809 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1888
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759471 1889 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1925
Cdd:PHA03247  2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1482-1801 2.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1482 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1554
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1555 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1632
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1633 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1706
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1707 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1783
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759471 1784 PGSSANRRSPVSSSKGKG 1801
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1091-1397 3.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 3.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1091 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1166
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1167 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1242
Cdd:pfam05109  490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1243 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1319
Cdd:pfam05109  559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759471 1320 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1397
Cdd:pfam05109  637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1516-1886 6.67e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 6.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1516 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1595
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1596 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1673
Cdd:pfam05109  512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1674 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1750
Cdd:pfam05109  592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1751 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1827
Cdd:pfam05109  670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759471 1828 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1886
Cdd:pfam05109  746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.67e-60

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 203.43  E-value: 2.67e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471   48 IFVAFKGNIDDkdFKWKLDTILQSVPGLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958759471  128 QIEGEGAINLALG---QNRSQDVRMnGPVVSGNSVRMEAGFPMASGPGLIRMTSPATVMMPQGGNA 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
377-787 2.53e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 68.88  E-value: 2.53e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  377 QQASQAHTNFPQMSNPGQFTAPQMKSLQGGPS---RVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRP 453
Cdd:pfam09606   60 QQQPQGGQGNGGMGGGQQGMPDPINALQNLAGqgtRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGF 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  454 PQNNPLSQGFQQPVSSPGRNP--MVQQGNVPPNFMVMQQQPPSQGPPSLHPGLG---GQANPNFMQGQVPSTTAaTPGNS 528
Cdd:pfam09606  140 PSQMSRVGRMQPGGQAGGMMQpsSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQgpmGGQMPPQMGVPGMPGPA-DAGAQ 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  529 GALQLQANQSVQhAGGQGAGPPQNQMQvshGPPNMMQPSLMGIHGNINNQQAGSSGVPQVTLGSMQGQPQQGPPSQLMGM 608
Cdd:pfam09606  219 MGQQAQANGGMN-PQQMGGAPNQVAMQ---QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAM 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  609 HQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMMAPhnqmMGPQGQ 688
Cdd:pfam09606  295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANP----MQRGQP 370
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  689 VLLQQNPMIEQIMTNQMQGNKAQFNSQNQSNVMP------GPAQIMRGPTPNmQGNMVQFTGQMSGQMLPQQGPVSNSPS 762
Cdd:pfam09606  371 GMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPqgpgsqPPQSHPGGMIPS-PALIPSPSPQMSQQPAQQRTIGQDSPG 449
                          410       420
                   ....*....|....*....|....*
gi 1958759471  763 QVMGIQGQVLRPPGPSPHMAQQHTD 787
Cdd:pfam09606  450 GSLNTPGQSAVNSPLNPQEEQLYRE 474
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
268-732 7.54e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 57.71  E-value: 7.54e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  268 QQQQQQQQQQQQQQLQTRPLQQHQQQQPQGIRPQFTAptqvPVPPGwNQLPSGALQPPPAQGSLGPMTTNQGWKK---AP 344
Cdd:pfam09606   64 QGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMG----PMGPG-PGGPMGQQMGGPGTASNLLASLGRPQMPmggAG 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  345 LPSPMQAQLQARPSLATvqtpsHPPPPYPFGSQQASQAHTNFPQMSnPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSP 424
Cdd:pfam09606  139 FPSQMSRVGRMQPGGQA-----GGMMQPSSGQPGSGTPNQMGPNGG-PGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGP 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  425 ASSPSSFQQGSPASSPtVNQTQQQMGP--RPPQNNPLSQGFQQPVSSPGRNPMVQQGNVPPNfmvmqqqppsqGPPSLHP 502
Cdd:pfam09606  213 ADAGAQMGQQAQANGG-MNPQQMGGAPnqVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGG-----------GAGQGGP 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  503 GLGGQANPNFMQGQVPSTTAATPGNsgALQLQANQSVQHAGGQGAGPPQNQMQVSHGPPNMMQPSLMGIH------GNIN 576
Cdd:pfam09606  281 GQPMGPPGQQPGAMPNVMSIGDQNN--YQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHletwnpGNFG 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  577 NQQAGSSGVPQVTLGSM------QGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQ 650
Cdd:pfam09606  359 GLGANPMQRGQPGMMSSpspvpgQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPA 438
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  651 NQNLGPSPQRMTPPKQMLPQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKAQFNSQNQsnvMPGPAQIMRG 730
Cdd:pfam09606  439 QQRTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYREKYRQLTKYIEPLKRMIAKMENDPGDIDKMNK---MKRLLEILSN 515

                   ..
gi 1958759471  731 PT 732
Cdd:pfam09606  516 PS 517
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
430-697 5.13e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 55.04  E-value: 5.13e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  430 SFQQGSPASSPTVNQTQQQMGPRPPQNNPLSQGFQQPVSSpGRNPMVQQGNVPpnfmvmqqqppsqgppSLHP-----GL 504
Cdd:pfam09770  103 NRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRT-GYEKYKEPEPIP----------------DLQVdaslwGV 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  505 GGQANPNFMQGQVPSTTAATPGNSG-----------ALQLQANQSVQHAGGQGAGPPQNQMQVSHGPPNMMQPSlmgihg 573
Cdd:pfam09770  166 APKKAAAPAPAPQPAAQPASLPAPSrkmmsleeveaAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQ------ 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  574 ninnqqagssgvpqvtlgSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQN 653
Cdd:pfam09770  240 ------------------IQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ 301
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1958759471  654 LGPSPQRMTPPKQMLPQQGPQMMAP-HNQMMGPQGQVLLQQNPMI 697
Cdd:pfam09770  302 ILQNPNRLSAARVGYPQNPQPGVQPaPAHQAHRQQGSFGRQAPII 346
PHA03247 PHA03247
large tegument protein UL36; Provisional
166-474 1.29e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 1.29e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  166 PMASGPGLIRMTSPATVMMPQGGNASSSMMAPGPNPELQPRTPRPASQSDAMDPLLSGLHIQQQSHPSGSLPPAHHP--- 242
Cdd:PHA03247  2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvg 2693
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  243 -----MQPVPVNRQMNPANFPQLQQQQQQQQQQQQQQQQQQQQQLQTRPLQQHQQQQPQGI----RPQFTAPTQVPVPPG 313
Cdd:PHA03247  2694 sltslADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParpaRPPTTAGPPAPAPPA 2773
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  314 WNQLPSGALQPPPAQGSLGPMTTNQGWKKAPLPSPMqAQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP- 392
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA-AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPl 2852
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  393 GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGP-RPPQNNPLSQGFQQPVSSPG 471
Cdd:PHA03247  2853 GGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPP 2926

                   ...
gi 1958759471  472 RNP 474
Cdd:PHA03247  2927 PQP 2929
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
603-774 1.60e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 1.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  603 SQLMGMHQQIVPsqgqMAQQQGTLNPQNPMILSRAQLMPQGQMMVNAQNQNLGPSPQRMtPPKQMLPQQGPQMMAPhnqm 682
Cdd:TIGR01628  369 AHLQDQFMQLQP----RMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  683 MGPQGQVLLQQNPMIEQimtNQMQGNKAQFNSQNQSNVMPGPAQimrGPTPNMQGNMVQFTGQMSgqmlpqqgpvSNSPS 762
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQNKKLAQVLA----------SATPQ 503
                          170
                   ....*....|..
gi 1958759471  763 QVMGIQGQVLRP 774
Cdd:TIGR01628  504 MQKQVLGERLFP 515
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
596-739 2.13e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.42  E-value: 2.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  596 QPQQGPPSQ-LMGMHQQIVPSQGQMAQQQGTLNPQNPMilsraqLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQ 674
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958759471  675 MMAPHNQMMGPQGQVLLQQNPmieqimtnQMQGNKAQFNSqnqsnvMPGPAQIMRGPTPNMQGNM 739
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------QPQSTASQGGQ------NKKLAQVLASATPQMQKQV 508
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
592-793 3.34e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 3.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  592 SMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmilsrAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQq 671
Cdd:pfam09770  202 AMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQ-----QQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPD- 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  672 gpqmmaPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkaqfnsqnqsNVMPGPAQIMrgPTPNMQGNMVQftgQMSGQML 751
Cdd:pfam09770  276 ------PAQPSIQPQAQQFHQQPP-----------------------PVPVQPTQIL--QNPNRLSAARV---GYPQNPQ 321
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1958759471  752 PQQGPVSNSPSQvmgiqgqvlRPPGPSPHMAQQHTDPATTAN 793
Cdd:pfam09770  322 PGVQPAPAHQAH---------RQQGSFGRQAPIITHPQQLAQ 354
PHA03247 PHA03247
large tegument protein UL36; Provisional
1040-1260 6.67e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1040 PPRGPLNPDSQRVPMQQSgnVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNSSSP-LGEMPSLPEA 1118
Cdd:PHA03247  2744 VPAGPATPGGPARPARPP--TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAvLAPAAALPPA 2821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1119 ---GGSEVPSVSGGPSNMPSHLVVSQNQLMMTG--------PKPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1187
Cdd:PHA03247  2822 aspAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvapggdvRRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958759471 1188 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNP 1260
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP 2960
PHA03247 PHA03247
large tegument protein UL36; Provisional
1006-1396 8.66e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 8.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1006 SQPQSQQQQQQMMMMLMMQQDPKSIRLPVSQNVHPPR----GPLNPDSQRVPMQQSGNVPVMVSLQG--PASVPPSPDKQ 1079
Cdd:PHA03247  2574 APRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGpappSPLPPDTHAPDPPPPSPSPAANEPDPhpPPTVPPPERPR 2653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1080 RMPMPvntPLGSNSRKMVYQESPQNSSSPLG--EMPSLPEAGGS-----EVPSVSGGPSNMPSHLVVSQNQLMMTGPKPG 1152
Cdd:PHA03247  2654 DDPAP---GRVSRPRRARRLGRAAQASSPPQrpRRRAARPTVGSltslaDPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1153 PSPLSATQGATPQQPPVNSLPsshghhfpnvAAPTQTSRPKTPNrASPRPYYPQTPNNRPP--STEPSEISLSPERLNAS 1230
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATP----------GGPARPARPPTTA-GPPAPAPPAAPAAGPPrrLTRPAVASLSESRESLP 2799
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1231 IAGLFPPQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSnltvsnPPNFAAPQAHKLDSVVVSSGKQSNPGTTKRASP 1310
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP------PPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAA 2873
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1311 SNSRRSSPGSSRKTTPSPGRQNS---------KAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNNNGLN 1381
Cdd:PHA03247  2874 KPAAPARPPVRRLARPAVSRSTEsfalppdqpERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
                          410
                   ....*....|....*
gi 1958759471 1382 SQNPTVPAPAVGTVV 1396
Cdd:PHA03247  2954 EPSGAVPQPWLGALV 2968
PHA03247 PHA03247
large tegument protein UL36; Provisional
1069-1552 8.73e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 8.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1069 PASVPPSPDK-----QRMPMPVNTPLGSNSRKMVYQESPQNSSSPLGEMPSLP-EAGGSEVPSVSGGPSNMPSHLVVSQN 1142
Cdd:PHA03247  2557 PAAPPAAPDRsvpppRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1143 QLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHfPNVAAPTQTSRPktpnRASPRPYYPQTPNNRPPSTEPSEISL 1222
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA-AQASSPPQRPRR----RAARPTVGSLTSLADPPPPPPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1223 SPERLNAsiaglfppqinIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTVSnPPNFAAPQAHKLDSVVVSSGKQSNP 1302
Cdd:PHA03247  2712 PHALVSA-----------TPLPPGPAAAR----QASPALPAAPAPPAVPAGPAT-PGGPARPARPPTTAGPPAPAPPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1303 GTTKRASPSNSRRSSPGSSRKTTPSPgRQNSKAPKLTLASQTSTTLLQNMELPRNVLVGPTPLANPPLSGSFPNnnglns 1382
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSP-WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP------ 2848
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1383 qnptvPAPAVGTVVEDNKESLNVPQDSDCQNSQGRKEQVNTELKAVPIQEAKMVVPEdqskkdgQPLDPNKLPSVEENKT 1462
Cdd:PHA03247  2849 -----SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-------PPDQPERPPQPQAPPP 2916
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1463 LMSPAMREAPTSLSQLLDNSGAPNVTIKPPGLTDLEVTPPAVSGEDLKKASVIPTLQDPSSKEPSNSLNLPHSNEPCSTL 1542
Cdd:PHA03247  2917 PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPL 2996
                          490
                   ....*....|
gi 1958759471 1543 AHPELSEVSS 1552
Cdd:PHA03247  2997 TGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
199-678 1.16e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  199 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLPPAHHPMQPVPVNRQMNPANFPQLQQQQ 264
Cdd:PHA03247  2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  265 QQQQQQQQQQQQQQQQQLQTRplqqhqqQQPQGIRPQFTAPTQVPVPPGwnqlpsgalqPPPAQGSLgpmtTNQGWKKAP 344
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRR-------ARRLGRAAQASSPPQRPRRRA----------ARPTVGSL----TSLADPPPP 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  345 LPSPmqaqlQARPSLATVQTPSHPPPPYPFGSQQASQAhtnfpqmsNPGQFTAPQMKSLQGGPSRVPTPlqqphltnKSP 424
Cdd:PHA03247  2705 PPTP-----EPAPHALVSATPLPPGPAAARQASPALPA--------APAPPAVPAGPATPGGPARPARP--------PTT 2763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  425 ASSPSSFQQGSPASSPTVNQTQQQMGPRPPQNNPL---SQGFQQPVSSPGRNPMVQQGNVPpnfmvmqqqppsqgppslh 501
Cdd:PHA03247  2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpspWDPADPPAAVLAPAAALPPAASP------------------- 2824
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  502 pglggqanpnfmQGQVPSTTAATPGNSGALQLQANQSVQHAGGQGAGPPqnqmqVSHGPPNMMQPSLMGIHGNINNQQAG 581
Cdd:PHA03247  2825 ------------AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGD-----VRRRPPSRSPAAKPAAPARPPVRRLA 2887
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  582 SSGVPQVTLGSMQGQPQQGPPSQLMGMHQQIVPSQGQMAQQQGTLNPQNPmiLSRAQLMPQGQMMVNAQNQNLGPSPQ-- 659
Cdd:PHA03247  2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP--RPQPPLAPTTDPAGAGEPSGAVPQPWlg 2965
                          490       500
                   ....*....|....*....|....
gi 1958759471  660 -----RMTPPKQMLPQQGPQMMAP 678
Cdd:PHA03247  2966 alvpgRVAVPRFRVPQPAPSREAP 2989
PHA03247 PHA03247
large tegument protein UL36; Provisional
1658-1925 1.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1658 PSTIPATPLTTNSGLMPPSVAVVGPlhiPQNIKFSSAPVTPNVPSSSPAPniQTGRPLVLSSRATPVPLPSPPCTS---S 1734
Cdd:PHA03247  2735 LPAAPAPPAVPAGPATPGGPARPAR---PPTTAGPPAPAPPAAPAAGPPR--RLTRPAVASLSESRESLPSPWDPAdppA 2809
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1735 PVVAPNPSVQQVKELNPDEASPQTNTSAdQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKG------KVDKIGQ 1808
Cdd:PHA03247  2810 AVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPaaparpPVRRLAR 2888
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1809 ILLTKACKKVtgSLEKGEEQYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTPSAPTllKMTSSPMGPSSTSTGPIL 1888
Cdd:PHA03247  2889 PAVSRSTESF--ALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT--TDPAGAGEPSGAVPQPWL 2964
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1958759471 1889 pgGALPTSVRSVVTTLVPSELIS---TAPTTKGNHGGITS 1925
Cdd:PHA03247  2965 --GALVPGRVAVPRFRVPQPAPSreaPASSTPPLTGHSLS 3002
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
376-483 2.30e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.10  E-value: 2.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  376 SQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQQPHLTNKSPASSPSSFQ-----QGSPASSPTVNQTQQQMG 450
Cdd:pfam09770  211 AQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQrpqspQPDPAQPSIQPQAQQFHQ 290
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1958759471  451 PRPPQNNPLSQGFQQP-VSSPGRNPMVQQGNVPP 483
Cdd:pfam09770  291 QPPPVPVQPTQILQNPnRLSAARVGYPQNPQPGV 324
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1482-1801 2.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1482 SGAPNVTIKPPGLTDLEVTPPAVSGEDLKK-------ASVIPTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNI 1554
Cdd:pfam05109  446 TGLPSSTHVPTNLTAPASTGPTVSTADVTSptpagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTP 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1555 APSIP-PVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNTSAALPTHLQSALMSTVVT-MPNVGNKVMvseGQSAAQS 1632
Cdd:pfam05109  526 AVTTPtPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTpTPNATSPTV---GETSPQA 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1633 NARPQFITpvfiNSSSIIQVMKGSQPSTIPATPLTTNSGLMPPSVAVVGPLHIPQNIKFSSAPVT----PNVPSSSP--A 1706
Cdd:pfam05109  603 NTTNHTLG----GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNStshmPLLTSAHPtgG 678
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1707 PNIQTGRPLVLSSRATPVPLPSP-PCTSSPVVAPNPSVQQVK--ELNPDEASPQTNTSADQStlPSSQPTTVVSpllANS 1783
Cdd:pfam05109  679 ENITQVTPASTSTHHVSTSSPAPrPGTTSQASGPGNSSTSTKpgEVNVTKGTPPKNATSPQA--PSGQKTAVPT---VTS 753
                          330
                   ....*....|....*...
gi 1958759471 1784 PGSSANRRSPVSSSKGKG 1801
Cdd:pfam05109  754 TGGKANSTTGGKHTTGHG 771
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
631-760 3.26e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.49  E-value: 3.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  631 PMILSRAQLMPQGQMMVNAQNQNLGPSPQRMTPPKQMLPQQGPQMM---APHnQMMGPQGQVLLQQNPMIEQIMTnqMQG 707
Cdd:TIGR01628  355 PLYVALAQRKEQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYygqGPQ-QQFNGQPLGWPRMSMMPTPMGP--GGP 431
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1958759471  708 NKAQFNSQNQSNVMPGPAQIMRGPTPNMQGnmVQFTGQMSGQMLPQQGPVSNS 760
Cdd:TIGR01628  432 LRPNGLAPMNAVRAPSRNAQNAAQKPPMQP--VMYPPNYQSLPLSQDLPQPQS 482
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1091-1397 3.63e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.60  E-value: 3.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1091 SNSRKMVYQESPQNSSSPlgemPSLPEAGGSEVPSVSGGPSN--MPSHLVV--SQNQLMMTGPKPGPSPLSATQGATPQQ 1166
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTS----PTLNTTGFAAPNTTTGLPSSthVPTNLTApaSTGPTVSTADVTSPTPAGTTSGASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1167 PPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSEISLSPErlnasiaglfpPQINIP 1242
Cdd:pfam05109  490 PSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSAVTTPT-----------PNATSP 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1243 LPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVvsSGKQSNPGTT---KRASPSNSRRSSPG 1319
Cdd:pfam05109  559 TPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTL--GGTSSTPVVTsppKNATSAVTTGQHNI 636
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958759471 1320 SSRKTTPSPGRQNSKAPklTLASQTSTTLLQNMELprnvLVGPTPLANPPLSGSFPNNNGLNSQNPTVPAPAVGTVVE 1397
Cdd:pfam05109  637 TSSSTSSMSLRPSSISE--TLSPSTSDNSTSHMPL----LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQ 708
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
1147-1304 4.47e-03

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 42.12  E-value: 4.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1147 TGPKPGPSPlSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRP---------YYPQTPNNRPPSTEP 1217
Cdd:pfam08580  511 TATSETPTP-ALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPsrssrssstLPPVSPLSRDKSRSP 589
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1218 SEISLSPERLNASIAGLFPPQINIPLPPRPNLNrgfdqqglNPTTLKAIGQAPSNLTVSNPPNFAAPQAHKLDSVVVSSG 1297
Cdd:pfam08580  590 APTCRSVSRASRRRASRKPTRIGSPNSRTSLLD--------EPPYPKLTLSKGLPRTPRNRQSYAGTSPSRSVSVSSGLG 661

                   ....*..
gi 1958759471 1298 KQSNPGT 1304
Cdd:pfam08580  662 PQTRPGT 668
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
699-811 5.72e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.72  E-value: 5.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  699 QIMTNQMQGNKAQFNSQNQSNVMPG--PAQIMRGPTPNMQGNMVQFTGQMSGQMLPQQGPvsNSPSQVMGIQ--GQVLRP 774
Cdd:TIGR01628  369 AHLQDQFMQLQPRMRQLPMGSPMGGamGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGP--GGPLRPNGLApmNAVRAP 446
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1958759471  775 PGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQQTS 811
Cdd:TIGR01628  447 SRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQST 483
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
659-825 5.97e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.72  E-value: 5.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  659 QRMTPPKQMLPQQGPQMMAPH--NQMMGPQGQVLL-QQNPmieqimtnQMQGNKAQFNSQNQSNvMPGPAQIMRGPTPNM 735
Cdd:TIGR01628  366 QRRAHLQDQFMQLQPRMRQLPmgSPMGGAMGQPPYyGQGP--------QQQFNGQPLGWPRMSM-MPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471  736 QGNMVQFTGQMSGQmlpQQGPVSNSPsqvMGIQGQVLRPPGPSPHMAQQHTDPATTANNDVNLSQMMPDVSMQ-QTSMVP 814
Cdd:TIGR01628  437 LAPMNAVRAPSRNA---QNAAQKPPM---QPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQmQKQVLG 510
                          170
                   ....*....|....*
gi 1958759471  815 ----PHVQSMQGNSA 825
Cdd:TIGR01628  511 erlfPLVEAIEPALA 525
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1516-1886 6.67e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 6.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1516 PTLQDPSSKEPSNSLNLPHSNEPCSTLAHPELSEVSSNIAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSSNT 1595
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTS 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1596 SAALPTHLQSALMSTVVT-MPNVGNKVMVSEGQSAAQSNARPQFITPVFINSSSIIQVMKGSQPSTIPATPLTTNS-GLM 1673
Cdd:pfam05109  512 AVTTPTPNATSPTPAVTTpTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1674 PPSVAVVGPLHIPQNIKFSSAPVTPNV--PSSSPAPNIQTGRPLVLSSRATPVPLpsPPCTSSPVVAPNPSVQQVKELN- 1750
Cdd:pfam05109  592 SPTVGETSPQANTTNHTLGGTSSTPVVtsPPKNATSAVTTGQHNITSSSTSSMSL--RPSSISETLSPSTSDNSTSHMPl 669
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958759471 1751 PDEASPQTNTSADQSTLPSSQPTTVVSPLLANSPGSSANRRSPVSSSKGKgkvdKIGQILLTKACKKVTGSLEKGEE--- 1827
Cdd:pfam05109  670 LTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTST----KPGEVNVTKGTPPKNATSPQAPSgqk 745
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958759471 1828 -------QYGADGETEGPGLETTTPGLMGTEQCSTELDSKTPTP-----SAPTLLKMTSSPMGPSSTSTGP 1886
Cdd:pfam05109  746 tavptvtSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrtrynATTYLPPSTSSKLRPRWTFTSP 816
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH