NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|768016675|ref|XP_011527023|]
View 

nuclear receptor coactivator 6 isoform X4 [Homo sapiens]

Protein Classification

SANT/Myb-like DNA-binding domain-containing protein; auxin response factor family protein( domain architecture ID 12155422)

SANT (SWI3, ADA2, N-CoR and TFIIIB)/Myb-like DNA-binding domain-containing protein binds DNA and may function as a transcription factor; also contains a Med15 domain, a critical transducer of gene activation signals that control early metazoan development.| auxin response factor family protein containing a B3 DNA binding domain

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-195 2.34e-59

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


:

Pssm-ID: 463988  Cd Length: 143  Bit Score: 200.73  E-value: 2.34e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675    48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 768016675   128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
525-843 2.28e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 62.72  E-value: 2.28e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   525 SAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMNNQQAGTs 604
Cdd:pfam09606   54 SKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTASNLLASL- 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   605 GVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNPPSQNLGP 677
Cdd:pfam09606  128 GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   678 S-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIMRGPTPNM 756
Cdd:pfam09606  208 GmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPP 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   757 QgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQQTNMVPP 836
Cdd:pfam09606  288 G----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWNPGNFGGL 360

                   ....*..
gi 768016675   837 HVQAMQG 843
Cdd:pfam09606  361 GANPMQR 367
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1061-1307 3.24e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1061 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1140
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1141 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1209
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1210 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1289
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
                         250
                  ....*....|....*...
gi 768016675 1290 qAPSNLTMNPSNFATPQT 1307
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
PHA03247 super family cl33720
large tegument protein UL36; Provisional
190-549 1.01e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 1.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  190 PPGGNVS----SSMMAPGPNPELQPRTPRPASQSDAMDPLlsglhiqqqshpsGSLAPPHHPMQPVSvnrqmnPANFPQL 265
Cdd:PHA03247 2656 PAPGRVSrprrARRLGRAAQASSPPQRPRRRAARPTVGSL-------------TSLADPPPPPPTPE------PAPHALV 2716
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  266 QQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPVPPGWNQLPSGALQPPPAQGSLGTMTA 345
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRE 2796
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  346 NQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP-GQFTAPqmkslqGGPSRVPTPLQ 424
Cdd:PHA03247 2797 SLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlGGSVAP------GGDVRRRPPSR 2869
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  425 QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLPQGFQQPVSSPGRNPMVQQGNVPPnfmvmqqqPPNQ 504
Cdd:PHA03247 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPPQPQPQPPPPPQPQPPPP--------PPPR 2938
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 768016675  505 GPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNS 549
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-195 2.34e-59

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 200.73  E-value: 2.34e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675    48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 768016675   128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
525-843 2.28e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 62.72  E-value: 2.28e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   525 SAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMNNQQAGTs 604
Cdd:pfam09606   54 SKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTASNLLASL- 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   605 GVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNPPSQNLGP 677
Cdd:pfam09606  128 GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   678 S-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIMRGPTPNM 756
Cdd:pfam09606  208 GmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPP 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   757 QgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQQTNMVPP 836
Cdd:pfam09606  288 G----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWNPGNFGGL 360

                   ....*..
gi 768016675   837 HVQAMQG 843
Cdd:pfam09606  361 GANPMQR 367
PHA03247 PHA03247
large tegument protein UL36; Provisional
1061-1307 3.24e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1061 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1140
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1141 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1209
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1210 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1289
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
                         250
                  ....*....|....*...
gi 768016675 1290 qAPSNLTMNPSNFATPQT 1307
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
617-760 6.42e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 6.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   617 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 695
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 768016675   696 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 760
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
PHA03247 PHA03247
large tegument protein UL36; Provisional
190-549 1.01e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 1.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  190 PPGGNVS----SSMMAPGPNPELQPRTPRPASQSDAMDPLlsglhiqqqshpsGSLAPPHHPMQPVSvnrqmnPANFPQL 265
Cdd:PHA03247 2656 PAPGRVSrprrARRLGRAAQASSPPQRPRRRAARPTVGSL-------------TSLADPPPPPPTPE------PAPHALV 2716
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  266 QQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPVPPGWNQLPSGALQPPPAQGSLGTMTA 345
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRE 2796
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  346 NQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP-GQFTAPqmkslqGGPSRVPTPLQ 424
Cdd:PHA03247 2797 SLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlGGSVAP------GGDVRRRPPSR 2869
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  425 QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLPQGFQQPVSSPGRNPMVQQGNVPPnfmvmqqqPPNQ 504
Cdd:PHA03247 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPPQPQPQPPPPPQPQPPPP--------PPPR 2938
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 768016675  505 GPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNS 549
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1112-1407 1.07e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.14  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1112 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1183
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1184 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1257
Cdd:pfam05109  485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1258 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1336
Cdd:pfam05109  565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768016675  1337 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGlnPQNST 1407
Cdd:pfam05109  645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG--PGNSS 716
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-195 2.34e-59

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 200.73  E-value: 2.34e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675    48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 768016675   128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIFflgiiRMNNPATVMIPPGGNV 195
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLI-----SDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
525-843 2.28e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 62.72  E-value: 2.28e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   525 SAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMNNQQAGTs 604
Cdd:pfam09606   54 SKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTASNLLASL- 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   605 GVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNPPSQNLGP 677
Cdd:pfam09606  128 GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   678 S-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIMRGPTPNM 756
Cdd:pfam09606  208 GmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPP 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   757 QgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQQTNMVPP 836
Cdd:pfam09606  288 G----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWNPGNFGGL 360

                   ....*..
gi 768016675   837 HVQAMQG 843
Cdd:pfam09606  361 GANPMQR 367
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
277-753 2.16e-06

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 53.09  E-value: 2.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   277 QQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAptqvPVPPGwNQLPSGALQPPPAQGSLGTMTANQGWKK---AP 353
Cdd:pfam09606   64 QGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMG----PMGPG-PGGPMGQQMGGPGTASNLLASLGRPQMPmggAG 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   354 LPGPMQQQLQARPSLATvqtpsHPPPPYPFGSQQASQAHTNFPQMSnPGQFTAPQMKSLQGGPSRVPTPLQ--------- 424
Cdd:pfam09606  139 FPSQMSRVGRMQPGGQA-----GGMMQPSSGQPGSGTPNQMGPNGG-PGQGQAGGMNGGQQGPMGGQMPPQmgvpgmpgp 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   425 ---QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMgpRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVPPNFMvmqqqp 501
Cdd:pfam09606  213 adaGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQ--QQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPM------ 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   502 pnqgpqslhPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVQHAGGQGAGPPQNQMqvshGP 581
Cdd:pfam09606  285 ---------GPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHL----ET 351
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   582 PNMMQPSLMGIHGnMNNQQAGTSGVPQVnlSNMQGQPQQGPPSQLMGMHQQIVPSQGQMVQQQGTLNPQNPMILSRAQLM 661
Cdd:pfam09606  352 WNPGNFGGLGANP-MQRGQPGMMSSPSP--VPGQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPS 428
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   662 PQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQsnv 741
Cdd:pfam09606  429 PSPQMSQQPAQQRTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYREKYRQLTKYIEPLKRMIAKMENDPGDIDKMNK--- 505
                          490
                   ....*....|..
gi 768016675   742 MPGPAQIMRGPT 753
Cdd:pfam09606  506 MKRLLEILSNPS 517
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
617-787 3.13e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 52.35  E-value: 3.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   617 QPQQGPPSQLMGMHQQIVPSQGQMVQQQGTLNPQnpmilsraQLMPQGQMMVNPPSQNLGPSPQRmtppkqmLSQQGPQM 696
Cdd:pfam09770  209 KPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ--------QQQPQQQPQQPQQHPGQGHPVTI-------LQRPQSPQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   697 MAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQsnvMPGPAQIMRGPTPNMQgnmvqftGQMSGQMLPQQG 776
Cdd:pfam09770  274 PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVG---YPQNPQPGVQPAPAHQ-------AHRQQGSFGRQA 343
                          170
                   ....*....|.
gi 768016675   777 PVNNSPSQVMG 787
Cdd:pfam09770  344 PIITHPQQLAQ 354
PHA03247 PHA03247
large tegument protein UL36; Provisional
1061-1307 3.24e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1061 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1140
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1141 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1209
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1210 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1289
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
                         250
                  ....*....|....*...
gi 768016675 1290 qAPSNLTMNPSNFATPQT 1307
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
460-659 2.88e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   460 PRPPQNNPLPQGFQQPVSSPGRN---------PMVQQGNVPPNfmvmQQQPPNQGPQSLHPGLGGMPKRLPPGFSAGQAN 530
Cdd:pfam09770  170 AAAPAPAPQPAAQPASLPAPSRKmmsleeveaAMRAQAKKPAQ----QPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQ 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   531 PNFMQGQVPSTtattPGNSGAPQ-LQANQNVQHAGGQGAGPPQNQMQVSHGPPNMMQPslMGIHGNMNNQQAGTSGVPQv 609
Cdd:pfam09770  246 PQQQPQQPQQH----PGQGHPVTiLQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP--TQILQNPNRLSAARVGYPQ- 318
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 768016675   610 nlsNMQGQPQQGPPSQlmgmHQQIVPSQGQMVQQQgtLNPQNPMILSRAQ 659
Cdd:pfam09770  319 ---NPQPGVQPAPAHQ----AHRQQGSFGRQAPII--THPQQLAQLSEEE 359
PHA03247 PHA03247
large tegument protein UL36; Provisional
1090-1564 3.58e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 3.58e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1090 PASVPPSPDkQRMPMPVNTPlgsnsrkmvyqeSPQNPSSSPLAEMASLPEASGSeaPSVPGGPNNMPSHVV----LPQNQ 1165
Cdd:PHA03247 2557 PAAPPAAPD-RSVPPPRPAP------------RPSEPAVTSRARRPDAPPQSAR--PRAPVDDRGDPRGPAppspLPPDT 2621
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1166 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1245
Cdd:PHA03247 2622 HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1246 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGA 1325
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAAR----QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1326 TKRASPSNSRRSSPGSSRKTTPS---PGRQNSKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGLN 1402
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSpwdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1403 PQNSTVSVAAVGGVVEDNKESLNVPQDSdcqnsqsrkeqvnIELKAVPAQEVKMVVPEDQSKKDGQPSDPNK--LPSVEE 1480
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAPPPpqPQPQPP 2924
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1481 NKNLVSPAMREAPTSLSQL---LDNSGAPNVTIKPPGLTDLEVTPPVVSGEDLKKASVIPTLQDLSSSKEPSNSLNLPHS 1557
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLaptTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004

                  ....*..
gi 768016675 1558 NELCSSL 1564
Cdd:PHA03247 3005 SSWASSL 3011
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
617-760 6.42e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 6.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   617 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 695
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 768016675   696 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 760
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
PHA03247 PHA03247
large tegument protein UL36; Provisional
190-549 1.01e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 1.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  190 PPGGNVS----SSMMAPGPNPELQPRTPRPASQSDAMDPLlsglhiqqqshpsGSLAPPHHPMQPVSvnrqmnPANFPQL 265
Cdd:PHA03247 2656 PAPGRVSrprrARRLGRAAQASSPPQRPRRRAARPTVGSL-------------TSLADPPPPPPTPE------PAPHALV 2716
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  266 QQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPVPPGWNQLPSGALQPPPAQGSLGTMTA 345
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRE 2796
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  346 NQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNP-GQFTAPqmkslqGGPSRVPTPLQ 424
Cdd:PHA03247 2797 SLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlGGSVAP------GGDVRRRPPSR 2869
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  425 QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLPQGFQQPVSSPGRNPMVQQGNVPPnfmvmqqqPPNQ 504
Cdd:PHA03247 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPPQPQPQPPPPPQPQPPPP--------PPPR 2938
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 768016675  505 GPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNS 549
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
624-775 1.77e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 46.72  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675   624 SQLMGMHQQIVPSQGQMVQqqgtLNPQNPMILSRAQLMPQGQMMVNPPSQNLGPSPQRMtPPKQMLSQQGPQMMAPhnqm 703
Cdd:TIGR01628  369 AHLQDQFMQLQPRMRQLPM----GSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768016675   704 MGPQGQVLLQQNPMIEQimtNQMQGNKQQFNTQNQSNVMPGPAQimrGPTPNMQGNMvQFTGQMSGQMLPQQ 775
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQN-KKLAQVLASATPQM 504
PHA03247 PHA03247
large tegument protein UL36; Provisional
934-1495 2.08e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 2.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  934 DLNTPDTRPAgleeadqpPLPGEQGINLDNSGPKLPEFSNRP--PGYPS-QPVEQRPLQQMPPQLMQHVAPPPQPPQQQP 1010
Cdd:PHA03247 2565 DRSVPPPRPA--------PRPSEPAVTSRARRPDAPPQSARPraPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1011 QPQLPQQQQPPPPSQPQSQQQqqqqqqmmmmlmmqqdPKSVRLP--VSQNVHPPRGPLNPDSQRMPMQQSGSVPVmVSLQ 1088
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPA----------------PGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSL-TSLA 2699
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1089 GPASVPPSPDKQRMPMPVNTPL--GSNSRKMVYQESPQNPSSSPLAEMASLPEASGSEA-PSVPGGPNNmPSHVVLPQNQ 1165
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLppGPAAARQASPALPAAPAPPAVPAGPATPGGPARPArPPTTAGPPA-PAPPAAPAAG 2778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1166 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVaAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1245
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL-PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1246 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqqglnpTTLKAIGQAPSNLTMNPSNFATPQTHKLDsvvvnSGKQSNSGA 1325
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR---------LARPAVSRSTESFALPPDQPERPPQPQAP-----PPPQPQPQP 2923
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1326 TKRASPSNSRRSSPGSSRKTTPSPGRQNSKAPKLTLASQTNAALLQ-NVELPRNVLVSPTPLANPPVPGSFPNNSGLNPQ 1404
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675 1405 NSTVSVAAVGGVVEDN-----KESLNVPQDSDCQNSQSRKEQVNIELkavpaqevkmvvpeDQSKKDGQPSDPNKLPSVE 1479
Cdd:PHA03247 3004 VSSWASSLALHEETDPppvslKQTLWPPDDTEDSDADSLFDSDSERS--------------DLEALDPLPPEPHDPFAHE 3069
                         570
                  ....*....|....*.
gi 768016675 1480 ENKNLVSPAMREAPTS 1495
Cdd:PHA03247 3070 PDPATPEAGARESPSS 3085
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1112-1407 1.07e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.14  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1112 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1183
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1184 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1257
Cdd:pfam05109  485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1258 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1336
Cdd:pfam05109  565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768016675  1337 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGlnPQNST 1407
Cdd:pfam05109  645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG--PGNSS 716
PHA03247 PHA03247
large tegument protein UL36; Provisional
204-575 1.71e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  204 PNPELQPRTPRPASQSDAMDPllsGLHIQQ--------------QSHPSGSLAPPHHPMQPVSVNRQMNPANFPQLQQQQ 269
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSRARRP---DAPPQSarprapvddrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPT 2645
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  270 QQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPV-----PPGWNQLPSGAlqPPPAQGSLGTMT 344
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLtsladPPPPPPTPEPA--PHALVSATPLPP 2723
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  345 ANQGWKKAPLPGPMQQQLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFPQMSNPGQFTAPQMKSLQGGPSRVPTPLQ 424
Cdd:PHA03247 2724 GPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWD 2803
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  425 QPHLTNKSPASSPSSFQQGSPAS----SPTVNQTQQQMGPRPPQNNPLPQGFQQPVSSPGRNPMVQQGNV-------PPN 493
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkpaaparPPV 2883
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  494 FMVMQQQPPNQGPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNSGAPqLQANQNVQHAGGQGAGPPQN 573
Cdd:PHA03247 2884 RRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP-LAPTTDPAGAGEPSGAVPQP 2962

                  ..
gi 768016675  574 QM 575
Cdd:PHA03247 2963 WL 2964
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
1085-1299 2.67e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 42.11  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1085 VSLQGPASVPPSPDKQRMPMPVNTPLGS--NSRKMVYQESPQNPSSSPLAEMASLPEASGSEAPSVPGGPNNMPSHVVLP 1162
Cdd:pfam15279   91 ESVSPGPSSSASPSSSPTSSNSSKPLISvaSSSKLLAPKPHEPPSLPPPPLPPKKGRRHRPGLHPPLGRPPGSPPMSMTP 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768016675  1163 QNQLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHhfPNVAAPTQTSRPKTPNRASPRPYYPQT-PNNRPP-----S 1236
Cdd:pfam15279  171 RGLLGKPQQHPPPSPLPAFMEPSSMPPPFLRPPPSIPQ--PNSPLSNPMLPGIGPPPKPPRNLGPPSnPMHRPPfsphhP 248
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 768016675  1237 TEPSEISLSPERLNASIAGLFPPQINIPLPPrpnLNRGFDQQGLNPTTLKAIGQAPSNLTMNP 1299
Cdd:pfam15279  249 PPPPTPPGPPPGLPPPPPRGFTPPFGPPFPP---VNMMPNPPEMNFGLPSLAPLVPPVTVLVP 308
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH