NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|929653947|dbj|BAA11498|]
View 

KIAA0181 [Homo sapiens]

Protein Classification

SANT/Myb-like DNA-binding domain-containing protein; auxin response factor family protein( domain architecture ID 12155422)

SANT (SWI3, ADA2, N-CoR and TFIIIB)/Myb-like DNA-binding domain-containing protein binds DNA and may function as a transcription factor; also contains a Med15 domain, a critical transducer of gene activation signals that control early metazoan development.| auxin response factor family protein containing a B3 DNA binding domain

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.30e-61

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


:

Pssm-ID: 463988  Cd Length: 143  Bit Score: 206.51  E-value: 2.30e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947    48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 929653947   128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIIRMNNPATVMIPPGGNV 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
520-838 2.91e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 62.33  E-value: 2.91e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   520 SAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMNNQQAGTs 599
Cdd:pfam09606   54 SKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTASNLLASL- 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   600 GVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNPPSQNLGP 672
Cdd:pfam09606  128 GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   673 S-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIMRGPTPNM 751
Cdd:pfam09606  208 GmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPP 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   752 QgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQQTNMVPP 831
Cdd:pfam09606  288 G----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWNPGNFGGL 360

                   ....*..
gi 929653947   832 HVQAMQG 838
Cdd:pfam09606  361 GANPMQR 367
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1056-1302 3.49e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1056 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1135
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1136 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1204
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1205 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1284
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
                         250
                  ....*....|....*...
gi 929653947 1285 qAPSNLTMNPSNFATPQT 1302
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
PHA03247 super family cl33720
large tegument protein UL36; Provisional
152-544 3.57e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 3.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  152 PMGAGNSVRMEAGFPMASGPGIIRMNNPATVMIPPGGNVSSSMMAPGPNPElqPRTPRPASQSDAMDPLlsglhiqqqsh 231
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARPTVGSL----------- 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  232 psGSLAPPHHPMQPVSvnrqmnPANFPQLQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQ 311
Cdd:PHA03247 2696 --TSLADPPPPPPTPE------PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP 2767
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  312 VPVPPGWNQLPSGALQPPPAQGSLGTMTANQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFP 391
Cdd:PHA03247 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  392 QMSNP-GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLPQGFQQP 470
Cdd:PHA03247 2847 PPSLPlGGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPP 2917
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 929653947  471 VSSPGRNPMVQQGNVPPnfmvmqqqPPNQGPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNS 544
Cdd:PHA03247 2918 QPQPQPPPPPQPQPPPP--------PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.30e-61

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 206.51  E-value: 2.30e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947    48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 929653947   128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIIRMNNPATVMIPPGGNV 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
520-838 2.91e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 62.33  E-value: 2.91e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   520 SAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMNNQQAGTs 599
Cdd:pfam09606   54 SKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTASNLLASL- 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   600 GVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNPPSQNLGP 672
Cdd:pfam09606  128 GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   673 S-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIMRGPTPNM 751
Cdd:pfam09606  208 GmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPP 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   752 QgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQQTNMVPP 831
Cdd:pfam09606  288 G----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWNPGNFGGL 360

                   ....*..
gi 929653947   832 HVQAMQG 838
Cdd:pfam09606  361 GANPMQR 367
PHA03247 PHA03247
large tegument protein UL36; Provisional
1056-1302 3.49e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1056 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1135
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1136 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1204
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1205 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1284
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
                         250
                  ....*....|....*...
gi 929653947 1285 qAPSNLTMNPSNFATPQT 1302
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
PHA03247 PHA03247
large tegument protein UL36; Provisional
152-544 3.57e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 3.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  152 PMGAGNSVRMEAGFPMASGPGIIRMNNPATVMIPPGGNVSSSMMAPGPNPElqPRTPRPASQSDAMDPLlsglhiqqqsh 231
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARPTVGSL----------- 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  232 psGSLAPPHHPMQPVSvnrqmnPANFPQLQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQ 311
Cdd:PHA03247 2696 --TSLADPPPPPPTPE------PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP 2767
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  312 VPVPPGWNQLPSGALQPPPAQGSLGTMTANQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFP 391
Cdd:PHA03247 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  392 QMSNP-GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLPQGFQQP 470
Cdd:PHA03247 2847 PPSLPlGGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPP 2917
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 929653947  471 VSSPGRNPMVQQGNVPPnfmvmqqqPPNQGPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNS 544
Cdd:PHA03247 2918 QPQPQPPPPPQPQPPPP--------PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
612-755 6.62e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 6.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   612 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 690
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 929653947   691 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 755
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1107-1402 1.12e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.14  E-value: 1.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1107 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1178
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1179 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1252
Cdd:pfam05109  485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1253 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1331
Cdd:pfam05109  565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 929653947  1332 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGlnPQNST 1402
Cdd:pfam05109  645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG--PGNSS 716
 
Name Accession Description Interval E-value
Nucleic_acid_bd pfam13820
Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding ...
48-190 2.30e-61

Putative nucleic acid-binding region; This is a family of putative nucleic acid-binding proteins. Several members are annotated as being nuclear receptor coactivator 6 proteins but this could not be confirmed.


Pssm-ID: 463988  Cd Length: 143  Bit Score: 206.51  E-value: 2.30e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947    48 IFVAFKGNIDDkdFKWKLDAILKNVPNLLHMESSKLKVQKVEPWNSVRVTFNIPREAAERLRILAQSNNQQLRDLGILSV 127
Cdd:pfam13820    1 TFLAVKGNLRM--FQEKLDSIRENVAELLRTEKSKLKVRKVEPWNSVRVTFSIPREAALRLRLLAQHNDPRLRDLGILSV 78
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 929653947   128 QIEGEGAINLALA---QNRSQDVRMnGPMGAGNSVRMEAGFPMASGPGIIRMNNPATVMIPPGGNV 190
Cdd:pfam13820   79 QIEGEGPINLTLAtmvQNPADDIIL-GCSTQLNSQRAEQGWTMLSALGLISDALPLHLRLAESGEY 143
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
520-838 2.91e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 62.33  E-value: 2.91e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   520 SAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVqhAGGQGAGPPQNQMQVSHGPPnmmQPSLMGIHGNMNNQQAGTs 599
Cdd:pfam09606   54 SKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNL--AGQGTRPQMMGPMGPGPGGP---MGQQMGGPGTASNLLASL- 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   600 GVPQVNL------SNMQGQPQQGPPSQLMGMHQQIVPSQGQ-MVQQQGTLNPQNPMILSRAQLMPQGQMMVNPPSQNLGP 672
Cdd:pfam09606  128 GRPQMPMggagfpSQMSRVGRMQPGGQAGGMMQPSSGQPGSgTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVP 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   673 S-PQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQSNVMPGPAQIMRGPTPNM 751
Cdd:pfam09606  208 GmPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPP 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   752 QgnmvQFTGQMSGQMLPQQGPVNNSPSQVMGIQGQVLRPPGPSPhmaQQHGDPATTANNDVSLSQMMPDVSIQQTNMVPP 831
Cdd:pfam09606  288 G----QQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQ---QQMNQSVGQGGQVVALGGLNHLETWNPGNFGGL 360

                   ....*..
gi 929653947   832 HVQAMQG 838
Cdd:pfam09606  361 GANPMQR 367
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
272-748 2.69e-06

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 52.70  E-value: 2.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   272 QQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQVPVPPGWNQLPSGALQPPPAQGSLGTMTANQGwkKAPLPG 351
Cdd:pfam09606   64 QGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQMPMG--GAGFPS 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   352 PMQQQLQARPSLATvqtpsHPPPPYPFGSQQASQAHTNFPQMSnPGQFTAPQMKSLQGGPSRVPTPLQ------------ 419
Cdd:pfam09606  142 QMSRVGRMQPGGQA-----GGMMQPSSGQPGSGTPNQMGPNGG-PGQGQAGGMNGGQQGPMGGQMPPQmgvpgmpgpada 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   420 QPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMgpRPPQNNPLPQGFQQPVSSPGRNPMVQQGNVPPNFMvmqqqppnq 499
Cdd:pfam09606  216 GAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQ--QQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPM--------- 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   500 gpqslhPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNSGAPQLQANQNVQHAGGQGAGPPQNQMqvshGPPNM 579
Cdd:pfam09606  285 ------GPPGQQPGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVVALGGLNHL----ETWNP 354
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   580 MQPSLMGIHGnMNNQQAGTSGVPQVnlSNMQGQPQQGPPSQLMGMHQQIVPSQGQMVQQQGTLNPQNPMILSRAQLMPQG 659
Cdd:pfam09606  355 GNFGGLGANP-MQRGQPGMMSSPSP--VPGQQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSP 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   660 QMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQMMAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQsnvMPG 739
Cdd:pfam09606  432 QMSQQPAQQRTIGQDSPGGSLNTPGQSAVNSPLNPQEEQLYREKYRQLTKYIEPLKRMIAKMENDPGDIDKMNK---MKR 508

                   ....*....
gi 929653947   740 PAQIMRGPT 748
Cdd:pfam09606  509 LLEILSNPS 517
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
612-782 3.26e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 52.35  E-value: 3.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   612 QPQQGPPSQLMGMHQQIVPSQGQMVQQQGTLNPQnpmilsraQLMPQGQMMVNPPSQNLGPSPQRmtppkqmLSQQGPQM 691
Cdd:pfam09770  209 KPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQ--------QQQPQQQPQQPQQHPGQGHPVTI-------LQRPQSPQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   692 MAPHNQMMGPQGQVLLQQNPMIEQIMTNQMQGNKQQFNTQNQsnvMPGPAQIMRGPTPNMQgnmvqftGQMSGQMLPQQG 771
Cdd:pfam09770  274 PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVG---YPQNPQPGVQPAPAHQ-------AHRQQGSFGRQA 343
                          170
                   ....*....|.
gi 929653947   772 PVNNSPSQVMG 782
Cdd:pfam09770  344 PIITHPQQLAQ 354
PHA03247 PHA03247
large tegument protein UL36; Provisional
1056-1302 3.49e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 3.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1056 PPRGPLNPDSQRMPmqQSGSVPVMVSLQGPASVPPSPDKQRMPMPVNTPLGSNSRKMVYQESPQNPSSSPLAEMASLPEA 1135
Cdd:PHA03247 2744 VPAGPATPGGPARP--ARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1136 SG---------SEAPSVPGGPNNMPSHVVLPQNQLMMTGP--KPGPSPLSATQGATPQQPPVNSLPSshghhfPNVAAPT 1204
Cdd:PHA03247 2822 ASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLAR------PAVSRST 2895
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1205 QTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLSPERLnasiaglfPPQINIPLPPRPNLNRGFDQQGLNPttlkaig 1284
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP--------PPRPQPPLAPTTDPAGAGEPSGAVP------- 2960
                         250
                  ....*....|....*...
gi 929653947 1285 qAPSNLTMNPSNFATPQT 1302
Cdd:PHA03247 2961 -QPWLGALVPGRVAVPRF 2977
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
455-654 2.97e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 2.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   455 PRPPQNNPLPQGFQQPVSSPGRN---------PMVQQGNVPPNfmvmQQQPPNQGPQSLHPGLGGMPKRLPPGFSAGQAN 525
Cdd:pfam09770  170 AAAPAPAPQPAAQPASLPAPSRKmmsleeveaAMRAQAKKPAQ----QPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQ 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   526 PNFMQGQVPSTtattPGNSGAPQ-LQANQNVQHAGGQGAGPPQNQMQVSHGPPNMMQPslMGIHGNMNNQQAGTSGVPQv 604
Cdd:pfam09770  246 PQQQPQQPQQH----PGQGHPVTiLQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP--TQILQNPNRLSAARVGYPQ- 318
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 929653947   605 nlsNMQGQPQQGPPSQlmgmHQQIVPSQGQMVQQQgtLNPQNPMILSRAQ 654
Cdd:pfam09770  319 ---NPQPGVQPAPAHQ----AHRQQGSFGRQAPII--THPQQLAQLSEEE 359
PHA03247 PHA03247
large tegument protein UL36; Provisional
152-544 3.57e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 3.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  152 PMGAGNSVRMEAGFPMASGPGIIRMNNPATVMIPPGGNVSSSMMAPGPNPElqPRTPRPASQSDAMDPLlsglhiqqqsh 231
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARPTVGSL----------- 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  232 psGSLAPPHHPMQPVSvnrqmnPANFPQLQQQQQQQQQQQQQQQQQQQQQQQQQLQARPPQQHQQQQPQGIRPQFTAPTQ 311
Cdd:PHA03247 2696 --TSLADPPPPPPTPE------PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP 2767
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  312 VPVPPGWNQLPSGALQPPPAQGSLGTMTANQGWKKAPLPGPMQQqLQARPSLATVQTPSHPPPPYPFGSQQASQAHTNFP 391
Cdd:PHA03247 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAV-LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  392 QMSNP-GQFTAPqmkslqGGPSRVPTPLQQPHLTNKSPASSPSSFQQGSPASSPTVNQTQQQMGPRPPqnnPLPQGFQQP 470
Cdd:PHA03247 2847 PPSLPlGGSVAP------GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP---PQPQAPPPP 2917
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 929653947  471 VSSPGRNPMVQQGNVPPnfmvmqqqPPNQGPQSLHPGLGGMPKRLPPGFSAGQANPNFMQGQVPSTTATTPGNS 544
Cdd:PHA03247 2918 QPQPQPPPPPQPQPPPP--------PPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA 2983
PHA03247 PHA03247
large tegument protein UL36; Provisional
1085-1559 4.05e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 4.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1085 PASVPPSPDkQRMPMPVNTPlgsnsrkmvyqeSPQNPSSSPLAEMASLPEASGSeaPSVPGGPNNMPSHVV----LPQNQ 1160
Cdd:PHA03247 2557 PAAPPAAPD-RSVPPPRPAP------------RPSEPAVTSRARRPDAPPQSAR--PRAPVDDRGDPRGPAppspLPPDT 2621
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1161 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1240
Cdd:PHA03247 2622 HAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1241 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGA 1320
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAAR----QASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1321 TKRASPSNSRRSSPGSSRKTTPS---PGRQNSKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGLN 1397
Cdd:PHA03247 2778 GPPRRLTRPAVASLSESRESLPSpwdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1398 PQNSTVSVAAVGGVVEDNKESLNVPQDSdcqnsqsrkeqvnIELKAVPAQEVKMVVPEDQSKKDGQPSDPNK--LPSVEE 1475
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAPPPpqPQPQPP 2924
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1476 NKNLVSPAMREAPTSLSQL---LDNSGAPNVTIKPPGLTDLEVTPPVVSGEDLKKASVIPTLQDLSSSKEPSNSLNLPHS 1552
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLaptTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004

                  ....*..
gi 929653947 1553 NELCSSL 1559
Cdd:PHA03247 3005 SSWASSL 3011
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
612-755 6.62e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 6.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   612 QPQQGPPSQ-LMGMHQQIVPSQGQMVQQQGTLNPQNPMilsraqLMPQGQMMVNPPSQNLGPSPQRMTPPKQMLSQQGPQ 690
Cdd:TIGR01628  384 QLPMGSPMGgAMGQPPYYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKP 457
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 929653947   691 MMAPHNQMMGPQGQVLLQQNPmieqimtnqmqgnkQQFNTQNQSNVMPGPAQIMRGPTPNMQGNM 755
Cdd:TIGR01628  458 PMQPVMYPPNYQSLPLSQDLP--------------QPQSTASQGGQNKKLAQVLASATPQMQKQV 508
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
619-770 1.84e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 46.34  E-value: 1.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947   619 SQLMGMHQQIVPSQGQMVQqqgtLNPQNPMILSRAQLMPQGQMMVNPPSQNLGPSPQRMtPPKQMLSQQGPQMMAPhnqm 698
Cdd:TIGR01628  369 AHLQDQFMQLQPRMRQLPM----GSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMP-TPMGPGGPLRPNGLAP---- 439
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 929653947   699 MGPQGQVLLQQNPMIEQimtNQMQGNKQQFNTQNQSNVMPGPAQimrGPTPNMQGNMvQFTGQMSGQMLPQQ 770
Cdd:TIGR01628  440 MNAVRAPSRNAQNAAQK---PPMQPVMYPPNYQSLPLSQDLPQP---QSTASQGGQN-KKLAQVLASATPQM 504
PHA03247 PHA03247
large tegument protein UL36; Provisional
929-1490 2.10e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 2.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  929 DLNTPDTRPAgleeadqpPLPGEQGISLDNSGPKLPEFSNRP--PGYPS-QPVEQRPLQQMPPQLMQHVAPPPQPPQQQP 1005
Cdd:PHA03247 2565 DRSVPPPRPA--------PRPSEPAVTSRARRPDAPPQSARPraPVDDRgDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN 2636
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1006 QPQLPQQQQPPPPSQPQSQQQqqqqqqmmmmlmmqqdPKSVRLP--VSQNVHPPRGPLNPDSQRMPMQQSGSVPVmVSLQ 1083
Cdd:PHA03247 2637 EPDPHPPPTVPPPERPRDDPA----------------PGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSL-TSLA 2699
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1084 GPASVPPSPDKQRMPMPVNTPL--GSNSRKMVYQESPQNPSSSPLAEMASLPEASGSEA-PSVPGGPNNmPSHVVLPQNQ 1160
Cdd:PHA03247 2700 DPPPPPPTPEPAPHALVSATPLppGPAAARQASPALPAAPAPPAVPAGPATPGGPARPArPPTTAGPPA-PAPPAAPAAG 2778
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1161 LMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHHFPNVaAPTQTSRPKTPNRASPRPYYPQTPNNRPPSTEPSEISLS 1240
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL-PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1241 PERLNASIAGLFPPQINIPLPPRPNLNRgfdqqglnpTTLKAIGQAPSNLTMNPSNFATPQTHKLDsvvvnSGKQSNSGA 1320
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRR---------LARPAVSRSTESFALPPDQPERPPQPQAP-----PPPQPQPQP 2923
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1321 TKRASPSNSRRSSPGSSRKTTPSPGRQNSKAPKLTLASQTNAALLQ-NVELPRNVLVSPTPLANPPVPGSFPNNSGLNPQ 1399
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSR 3003
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947 1400 NSTVSVAAVGGVVEDN-----KESLNVPQDSDCQNSQSRKEQVNIELkavpaqevkmvvpeDQSKKDGQPSDPNKLPSVE 1474
Cdd:PHA03247 3004 VSSWASSLALHEETDPppvslKQTLWPPDDTEDSDADSLFDSDSERS--------------DLEALDPLPPEPHDPFAHE 3069
                         570
                  ....*....|....*.
gi 929653947 1475 ENKNLVSPAMREAPTS 1490
Cdd:PHA03247 3070 PDPATPEAGARESPSS 3085
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1107-1402 1.12e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.14  E-value: 1.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1107 SNSRKMVYQESPQNPSSSPLAEmaslpeASGSEAPSVPGGpnnMPSHVVLPQNQLM--MTGPK------PGPSPLSATQG 1178
Cdd:pfam05109  414 TTTHKVIFSKAPESTTTSPTLN------TTGFAAPNTTTG---LPSSTHVPTNLTApaSTGPTvstadvTSPTPAGTTSG 484
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1179 ATPQQPPVNSLPSSHGHHFPNVAAPTQTSRPKTPNRASPRPYYPQ-TPNNRPPS---TEPSE--ISLSPERLNASIAGLF 1252
Cdd:pfam05109  485 ASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTpTPNATSPTlgkTSPTSavTTPTPNATSPTPAVTT 564
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1253 P-PQINIPLPPRPNLNRGFDQQGLNPTTLKAIGQAPSNLTMNPSNFATPQTHKLDSVVVNSGKQSNSGATKRASPSNSRR 1331
Cdd:pfam05109  565 PtPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 929653947  1332 SSPGSSRKTTPSPGRQN---SKAPKLTLASQTNAALLQNVELPRNVLVSPTPLANPPVPGSFPNNSGlnPQNST 1402
Cdd:pfam05109  645 SLRPSSISETLSPSTSDnstSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASG--PGNSS 716
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
1080-1294 2.98e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 42.11  E-value: 2.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1080 VSLQGPASVPPSPDKQRMPMPVNTPLGS--NSRKMVYQESPQNPSSSPLAEMASLPEASGSEAPSVPGGPNNMPSHVVLP 1157
Cdd:pfam15279   91 ESVSPGPSSSASPSSSPTSSNSSKPLISvaSSSKLLAPKPHEPPSLPPPPLPPKKGRRHRPGLHPPLGRPPGSPPMSMTP 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 929653947  1158 QNQLMMTGPKPGPSPLSATQGATPQQPPVNSLPSSHGHhfPNVAAPTQTSRPKTPNRASPRPYYPQT-PNNRPP-----S 1231
Cdd:pfam15279  171 RGLLGKPQQHPPPSPLPAFMEPSSMPPPFLRPPPSIPQ--PNSPLSNPMLPGIGPPPKPPRNLGPPSnPMHRPPfsphhP 248
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 929653947  1232 TEPSEISLSPERLNASIAGLFPPQINIPLPPrpnLNRGFDQQGLNPTTLKAIGQAPSNLTMNP 1294
Cdd:pfam15279  249 PPPPTPPGPPPGLPPPPPRGFTPPFGPPFPP---VNMMPNPPEMNFGLPSLAPLVPPVTVLVP 308
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH