NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907123471|ref|XP_036016345|]
View 

histone-lysine N-methyltransferase PRDM9 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PR-SET_PRDM7_9 cd19193
PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar ...
246-374 7.86e-80

PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar proteins; PRDM7 (also termed PR domain-containing protein 7) is a primate-specific histone methyltransferase that is the result of a recent gene duplication of PRDM9. It selectively catalyzes the trimethylation of H3 lysine 4 (H3K4me3). PRDM9 (also termed PR domain-containing protein 9) is a histone methyltransferase that specifically trimethylates 'Lys-4' of histone H3 (H3K4me3) during meiotic prophase and is essential for proper meiotic progression. It also efficiently mono-, di-, and trimethylates H3K36. Aberrant PRDM9 expression is assciated with with genome instability in cancer.


:

Pssm-ID: 380970 [Multi-domain]  Cd Length: 129  Bit Score: 252.92  E-value: 7.86e-80
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPSGIPEAGLGVWNEAsDLPVGLHFGPYEGQITEDEEAANSGYSWLITKGRNCYEYVDGQDESQANWMRYV 325
Cdd:cd19193     2 LTLPPGLSIKRSSIPGAGLGVWAEA-PIPKGMVFGPYEGEIVEDEEAADSGYSWQIYKGGKLSHYIDAKDESKSNWMRYV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 326 NCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELGIK 374
Cdd:cd19193    81 NCARNEEEQNLVAFQYRGKIYYRTCKDIAPGTELLVWYGDEYAKELGIK 129
KRAB super family cl42959
krueppel associated box;
32-89 6.60e-13

krueppel associated box;


The actual alignment was detected with superfamily member smart00349:

Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 64.15  E-value: 6.60e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123471   32 FKDISIYFSKEEWAEMGEWEKIRYRNV-KRNYKMLISIGLRAPRPAFMCYQRQAMKPQI 89
Cdd:smart00349   3 FEDVAVYFTQEEWEQLDPAQKNLYRDVmLENYSNLVSLGFQVPKPDLISQLEQGEEPWI 61
SSXRD pfam09514
SSXRD motif; SSX1 can repress transcription, and this has been attributed to a putative ...
181-207 2.61e-10

SSXRD motif; SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localize to this region.


:

Pssm-ID: 430657  Cd Length: 31  Bit Score: 55.85  E-value: 2.61e-10
                          10        20
                  ....*....|....*....|....*..
gi 1907123471 181 NVEVKMYRLRERKGLAYEEVSEPQDDD 207
Cdd:pfam09514   5 EVEVWMYRLRERKGVVYEEISDPQEDD 31
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
541-846 1.82e-09

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.86  E-value: 1.82e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 541 KPYVCRECGRGFTQNSHLIQHQRTHTGEKPYVCRECGRGFTQK--SDLIKHQRTHTGEKPYvcRECGRGFTQKSDLIKHQ 618
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSD--LNSKSLPLSNSKASSSS 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 619 RTHTGEKPYV---CRECGRGFTQKSVLIKHQRTHTGEKPYVCRECGRGF--TQKSVLIKHQRTHTgekpyvcRECGRGFT 693
Cdd:COG5048   110 LSSSSSNSNDnnlLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSvnTPQSNSLHPPLPAN-------SLSKDPSS 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 694 AKSVLIQHQRTHTGEKPYVCRECGRgFTAKSNLIQHQRTHTGEKPYVCRECGRGFTAKSVLIQHQRTHTGEKPYVCRECG 773
Cdd:COG5048   183 NLSLLISSNVSTSIPSSSENSPLSS-SYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESP 261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 774 RGFTAKSVLIQHQRTHTGE-------KPYVCRECGRGFTQKSNLIKHQRT--HTGE--KPYVCRE--CGWGFTQKSDLIQ 840
Cdd:COG5048   262 RSSLPTASSQSSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRSvnHSGEslKPFSCPYslCGKLFSRNDALKR 341

                  ....*.
gi 1907123471 841 HQRTHT 846
Cdd:COG5048   342 HILLHT 347
 
Name Accession Description Interval E-value
PR-SET_PRDM7_9 cd19193
PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar ...
246-374 7.86e-80

PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar proteins; PRDM7 (also termed PR domain-containing protein 7) is a primate-specific histone methyltransferase that is the result of a recent gene duplication of PRDM9. It selectively catalyzes the trimethylation of H3 lysine 4 (H3K4me3). PRDM9 (also termed PR domain-containing protein 9) is a histone methyltransferase that specifically trimethylates 'Lys-4' of histone H3 (H3K4me3) during meiotic prophase and is essential for proper meiotic progression. It also efficiently mono-, di-, and trimethylates H3K36. Aberrant PRDM9 expression is assciated with with genome instability in cancer.


Pssm-ID: 380970 [Multi-domain]  Cd Length: 129  Bit Score: 252.92  E-value: 7.86e-80
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPSGIPEAGLGVWNEAsDLPVGLHFGPYEGQITEDEEAANSGYSWLITKGRNCYEYVDGQDESQANWMRYV 325
Cdd:cd19193     2 LTLPPGLSIKRSSIPGAGLGVWAEA-PIPKGMVFGPYEGEIVEDEEAADSGYSWQIYKGGKLSHYIDAKDESKSNWMRYV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 326 NCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELGIK 374
Cdd:cd19193    81 NCARNEEEQNLVAFQYRGKIYYRTCKDIAPGTELLVWYGDEYAKELGIK 129
KRAB smart00349
krueppel associated box;
32-89 6.60e-13

krueppel associated box;


Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 64.15  E-value: 6.60e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123471   32 FKDISIYFSKEEWAEMGEWEKIRYRNV-KRNYKMLISIGLRAPRPAFMCYQRQAMKPQI 89
Cdd:smart00349   3 FEDVAVYFTQEEWEQLDPAQKNLYRDVmLENYSNLVSLGFQVPKPDLISQLEQGEEPWI 61
SSXRD pfam09514
SSXRD motif; SSX1 can repress transcription, and this has been attributed to a putative ...
181-207 2.61e-10

SSXRD motif; SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localize to this region.


Pssm-ID: 430657  Cd Length: 31  Bit Score: 55.85  E-value: 2.61e-10
                          10        20
                  ....*....|....*....|....*..
gi 1907123471 181 NVEVKMYRLRERKGLAYEEVSEPQDDD 207
Cdd:pfam09514   5 EVEVWMYRLRERKGVVYEEISDPQEDD 31
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
541-846 1.82e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.86  E-value: 1.82e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 541 KPYVCRECGRGFTQNSHLIQHQRTHTGEKPYVCRECGRGFTQK--SDLIKHQRTHTGEKPYvcRECGRGFTQKSDLIKHQ 618
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSD--LNSKSLPLSNSKASSSS 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 619 RTHTGEKPYV---CRECGRGFTQKSVLIKHQRTHTGEKPYVCRECGRGF--TQKSVLIKHQRTHTgekpyvcRECGRGFT 693
Cdd:COG5048   110 LSSSSSNSNDnnlLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSvnTPQSNSLHPPLPAN-------SLSKDPSS 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 694 AKSVLIQHQRTHTGEKPYVCRECGRgFTAKSNLIQHQRTHTGEKPYVCRECGRGFTAKSVLIQHQRTHTGEKPYVCRECG 773
Cdd:COG5048   183 NLSLLISSNVSTSIPSSSENSPLSS-SYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESP 261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 774 RGFTAKSVLIQHQRTHTGE-------KPYVCRECGRGFTQKSNLIKHQRT--HTGE--KPYVCRE--CGWGFTQKSDLIQ 840
Cdd:COG5048   262 RSSLPTASSQSSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRSvnHSGEslKPFSCPYslCGKLFSRNDALKR 341

                  ....*.
gi 1907123471 841 HQRTHT 846
Cdd:COG5048   342 HILLHT 347
KRAB_A-box cd07765
KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression ...
32-68 1.06e-08

KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through recruiting corepressors. A key mechanism appears to be the following: KRAB-AFPs tethered to DNA recruit, via their KRAB domain, the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta , KRAB-A interacting protein , and tripartite motif protein 28). The KAP1/ KRAB-AFP complex in turn recruits the heterochromatin protein 1 (HP1) family, and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation.


Pssm-ID: 143639  Cd Length: 40  Bit Score: 51.40  E-value: 1.06e-08
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1907123471  32 FKDISIYFSKEEWAEMGEWEKIRYRNVKR-NYKMLISI 68
Cdd:cd07765     3 FEDVAVYFSQEEWELLDPAQRDLYRDVMLeNYENLVSL 40
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
252-370 1.99e-08

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 53.49  E-value: 1.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471  252 LRISPSgiPEAGLGVWneAS-DLPVGLHFGPYEGQITEDEEAANSG--YSWLITKGRNCYE-----YVDGQdeSQANWMR 323
Cdd:smart00317   3 LEVFKS--PGKGWGVR--ATeDIPKGEFIGEYVGEIITSEEAEERPkaYDTDGAKAFYLFDidsdlCIDAR--RKGNLAR 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1907123471  324 YVNCARDDEEQNLVAFQ-YHRKIFYRTCRVIRPGCELLVWYGDEYGQE 370
Cdd:smart00317  77 FINHSCEPNCELLFVEVnGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
KRAB pfam01352
KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc ...
32-69 1.68e-07

KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.


Pssm-ID: 460171  Cd Length: 42  Bit Score: 48.24  E-value: 1.68e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1907123471  32 FKDISIYFSKEEWAEMGEWEKIRYRNV-KRNYKMLISIG 69
Cdd:pfam01352   4 FEDVAVDFTQEEWALLDPAQRNLYRDVmLENYRNLVSLG 42
zf-H2C2_2 pfam13465
Zinc-finger double domain;
557-582 1.61e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.28  E-value: 1.61e-04
                          10        20
                  ....*....|....*....|....*.
gi 1907123471 557 HLIQHQRTHTGEKPYVCRECGRGFTQ 582
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
304-364 1.73e-04

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 41.74  E-value: 1.73e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123471 304 GRNCYEYVDGQDESQANWMRYVNcaRDDEEQNLVAFQY---HRKIFYRTCRVIRPGCELLVWYG 364
Cdd:pfam00856  54 DEDSEYCIDARALYYGNWARFIN--HSCDPNCEVRVVYvngGPRIVIFALRDIKPGEELTIDYG 115
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
765-813 1.01e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 1.01e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 765 KPYvCRECGRGFTAKSVLIQHQRTHTgekpYVCRECGRGFTQKSNLIKH 813
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
 
Name Accession Description Interval E-value
PR-SET_PRDM7_9 cd19193
PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar ...
246-374 7.86e-80

PR-SET domain found in PR domain zinc finger protein 7 (PRDM7) and 9 (PRDM9) and similar proteins; PRDM7 (also termed PR domain-containing protein 7) is a primate-specific histone methyltransferase that is the result of a recent gene duplication of PRDM9. It selectively catalyzes the trimethylation of H3 lysine 4 (H3K4me3). PRDM9 (also termed PR domain-containing protein 9) is a histone methyltransferase that specifically trimethylates 'Lys-4' of histone H3 (H3K4me3) during meiotic prophase and is essential for proper meiotic progression. It also efficiently mono-, di-, and trimethylates H3K36. Aberrant PRDM9 expression is assciated with with genome instability in cancer.


Pssm-ID: 380970 [Multi-domain]  Cd Length: 129  Bit Score: 252.92  E-value: 7.86e-80
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPSGIPEAGLGVWNEAsDLPVGLHFGPYEGQITEDEEAANSGYSWLITKGRNCYEYVDGQDESQANWMRYV 325
Cdd:cd19193     2 LTLPPGLSIKRSSIPGAGLGVWAEA-PIPKGMVFGPYEGEIVEDEEAADSGYSWQIYKGGKLSHYIDAKDESKSNWMRYV 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 326 NCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELGIK 374
Cdd:cd19193    81 NCARNEEEQNLVAFQYRGKIYYRTCKDIAPGTELLVWYGDEYAKELGIK 129
PR-SET_PRDM-like cd10534
PR-SET domain found in PRDM (PRDI-BF1 and RIZ homology domain) family of proteins; PRDM family ...
248-364 5.21e-33

PR-SET domain found in PRDM (PRDI-BF1 and RIZ homology domain) family of proteins; PRDM family of proteins is defined based on the conserved N-terminal PR domain, which is closely related to the Su(var)3-9, enhancer of zeste, and trithorax (SET) domains of histone methyltransferases, and is specifically called PR-SET domain. The family consists of 17 members in primates. PRDMs play diverse roles in cell-cycle regulation, differentiation, and meiotic recombination. The family also contains zinc finger protein ZFPM1 and ZFPM2. ZFPM1 (also termed friend of GATA protein 1, FOG-1, friend of GATA 1, zinc finger protein 89A, or zinc finger protein multitype 1) functions as a transcription regulator that plays an essential role in erythroid and megakaryocytic cell differentiation. ZFPM2 (also termed friend of GATA protein 2, FOG-2, friend of GATA 2, zinc finger protein 89B, or zinc finger protein multitype 2) functions as a transcription regulator that plays a central role in heart morphogenesis and development of coronary vessels from epicardium, by regulating genes that are essential during cardiogenesis.


Pssm-ID: 380932  Cd Length: 83  Bit Score: 121.92  E-value: 5.21e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 248 LPPGLRISPSGIPEAGLGVWNEAsDLPVGLHFGPYEGQItedeeaansgyswlitkgrncyeyvdgqdesqaNWMRYVNC 327
Cdd:cd10534     1 LPAGLELVLSSIPEGGLGVFARR-TIPAGTRFGPLEGVV---------------------------------NWMRFVRP 46
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1907123471 328 ARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYG 364
Cdd:cd10534    47 ARNEEEQNLVAYQHGGQIYFRTTRDIPPGEELLVWYS 83
PR-SET_PRDM1 cd19187
PR-SET domain found in PR domain zinc finger protein 1 (PRDM1) and similar proteins; PRDM1 ...
247-371 9.01e-33

PR-SET domain found in PR domain zinc finger protein 1 (PRDM1) and similar proteins; PRDM1 (also termed BLIMP-1, beta-interferon gene positive regulatory domain I-binding factor, PR domain-containing protein 1, positive regulatory domain I-binding factor 1, PRDI-BF1, or PRDI-binding factor 1) acts as a transcription factor that mediates a transcriptional program in various innate and adaptive immune tissue-resident lymphocyte T cell types such as tissue-resident memory T (Trm), natural killer (trNK) and natural killer T (NKT) cells and negatively regulates gene expression of proteins that promote the egress of tissue-resident T-cell populations from non-lymphoid organs.


Pssm-ID: 380964 [Multi-domain]  Cd Length: 128  Bit Score: 123.21  E-value: 9.01e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 247 SLPPGLRISPSGIPEAGLGVWneASD-LPVGLHFGPYEGQI-TEDE--EAANSGYSWLITKGRNCYEYVDGQDESQANWM 322
Cdd:cd19187     2 SLPRNLTLKYSSVGREVLGVW--SSDyIPRGTRFGPLVGEIyTNDPvpKGANRKYFWRIYSNGEFYHYIDGFDPSKSNWM 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 323 RYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQEL 371
Cdd:cd19187    80 RYVNPAHSLQEQNLVACQIGMNIYFYTVKPIPPNQELLVWYCREFARRL 128
PR-SET_PRDM11 cd19195
PR-SET domain found in PR domain zinc finger protein 11 (PRDM11) and similar proteins; PRDM11 ...
246-371 1.67e-32

PR-SET domain found in PR domain zinc finger protein 11 (PRDM11) and similar proteins; PRDM11 (also termed PR domain-containing protein 11) may be involved in transcription regulation.


Pssm-ID: 380972  Cd Length: 127  Bit Score: 122.27  E-value: 1.67e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPSGIPEAGLGVWNEAsdLPVGLHFGPYEGQITEDEEAANSgYSWLITKGRNCYEYVDGQDESQANWMRYV 325
Cdd:cd19195     3 LTAPQGIEVVKDTSGESDVRCVDEV--IPKGHIFGPYEGQICTQDKSSGF-FSWLIVDKNNRYKSIDGSDETKANWMRYV 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1907123471 326 NCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQEL 371
Cdd:cd19195    80 VISREEREQNLLAFQHSEQIYFRACRDIRPGEKLRVWYSEDYMKRL 125
PR-SET_PRDM12 cd19196
PR-SET domain found in PR domain zinc finger protein 12 (PRDM12) and similar proteins; PRDM12 ...
248-373 1.99e-31

PR-SET domain found in PR domain zinc finger protein 12 (PRDM12) and similar proteins; PRDM12 (also termed PR domain-containing protein 12) acts as a transcription factor that is involved in the positive regulation of histone H3-K9 dimethylation.


Pssm-ID: 380973 [Multi-domain]  Cd Length: 130  Bit Score: 119.38  E-value: 1.99e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 248 LPPGLRISPSGIPEAGLGVWneASD-LPVGLHFGPYEGQI---TEDEEAANSGYSW--LITKGRNCYeYVDGQDESQANW 321
Cdd:cd19196     1 LPSQVIIAQSSIPGAGLGVF--SKTwIKEGTEMGPYTGRIvspEDVDPCKNNNLMWevFNEDGTVSH-FIDASQENHRSW 77
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1907123471 322 MRYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELGI 373
Cdd:cd19196    78 MTFVNCARNEQEQNLEVVQIGESIYYRAIKDIPPDQELLVWYGNSYNTFLGI 129
PR-SET_PRDM2 cd19188
PR-SET domain found in PR domain zinc finger protein 2 (PRDM2) and similar proteins; PRDM2 ...
248-366 1.50e-29

PR-SET domain found in PR domain zinc finger protein 2 (PRDM2) and similar proteins; PRDM2 (also termed GATA-3-binding protein G3B, lysine N-methyltransferase 8, MTB-or MTE-binding protein, PR domain-containing protein 2, retinoblastoma protein-interacting zinc finger protein, or zinc finger protein RIZ) is S-adenosyl-L-methionine-dependent histone methyltransferase that specifically methylates 'Lys-9' of histone H3. It may function as a DNA-binding transcription factor.


Pssm-ID: 380965  Cd Length: 123  Bit Score: 113.69  E-value: 1.50e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 248 LPPGLRISPSGIPEAGLGVWNEASdLPVGLHFGPYEGQITEDEEAANSGYSWLITKGRNCYEYVDGQDESQANWMRYVNC 327
Cdd:cd19188     4 LPEELELKPSAVDKTRIGVWAKKS-IPKGRKFGPFVGEKKKRSQVKNNVYMWEIYGPKRGWMCVDASDPTKGNWLRYVNW 82
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 1907123471 328 ARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDE 366
Cdd:cd19188    83 ARSGEEQNLFPLQINRAIYYKTLKPIAPGEELLCWYNGE 121
PR-SET_PRDM14 cd19198
PR-SET domain found in PR domain zinc finger protein 14 (PRDM14) and similar proteins; PRDM14 ...
246-373 2.08e-29

PR-SET domain found in PR domain zinc finger protein 14 (PRDM14) and similar proteins; PRDM14 (also termed PR domain-containing protein 14) acts as a transcription factor that has both positive and negative roles on transcription. It acts on regulating epigenetic modifications in the cells, playing a key role in the regulation of cell pluripotency, epigenetic reprogramming, differentiation and development. Aberrant PRDM14 expression is associated with tumorigenesis, cell migration and cell chemotherapeutic drugs resistance.


Pssm-ID: 380975  Cd Length: 133  Bit Score: 113.65  E-value: 2.08e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRI---SPSGIPEAGLGVwneASDLPVGLHFGPYEG---QITEDEEAANSGYSWLITKGRNCYEYVDGQDESqA 319
Cdd:cd19198     1 LDLPEGLRVlqtSFGGTPHYGVFC---KKTIPKGTRFGPFRGrvvNTSEIKTYDDNSFMWEIFEDGKLSHFIDGRGST-G 76
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1907123471 320 NWMRYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELGI 373
Cdd:cd19198    77 NWMSYVNCARYAEEQNLIAIQCQGQIFYESCKEILQGQELLVWYGDCYLQFMGI 130
PR-SET_PRDM6 cd19191
PR-SET domain found in PR domain zinc finger protein 6 (PRDM6) and similar proteins; PRDM6 ...
248-369 5.35e-28

PR-SET domain found in PR domain zinc finger protein 6 (PRDM6) and similar proteins; PRDM6 (also termed PR domain-containing protein 6) is a putative histone-lysine N-methyltransferase that acts as a transcriptional repressor of smooth muscle gene expression. It may specifically methylate 'Lys-20' of histone H4 when associated with other proteins and in vitro.


Pssm-ID: 380968  Cd Length: 128  Bit Score: 109.49  E-value: 5.35e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 248 LPPGLRISPSGIPEAGLGVWnEASDLPVGLHFGPYEGQITEDEE----AANSGYSWLI--TKGRNCYeYVDGQDESQANW 321
Cdd:cd19191     1 LPDEVCLCTSSIPGLGYGIC-AAQRIPQGTWIGPFEGVLVSPEKqigaVRNTQHLWEIydQEGTLQH-FIDGGDPSKSSW 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1907123471 322 MRYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQ 369
Cdd:cd19191    79 MRYIRCARHCGEQNLTVVQYRGCIFYRACRDIPRGTELLVWYDDSYTS 126
PR-SET_PRDM16_PRDM3 cd19200
PR-SET domain found in PR domain zinc finger protein 16 (PRDM16), MDS1 and EVI1 complex locus ...
245-370 2.06e-23

PR-SET domain found in PR domain zinc finger protein 16 (PRDM16), MDS1 and EVI1 complex locus protein and similar proteins; PRDM16 (also termed PR domain-containing protein 16, transcription factor MEL1, or MDS1/EVI1-like gene 1) functions as a transcriptional regulator. PRDM16 is preferentially expressed by hematopoietic and neuronal stem cells. It is closely related to paralog of PRDM3 (also termed MDS1 and EVI1 complex locus protein, ecotropic virus integration site 1 protein, EVI-1, myelodysplasia syndrome 1 protein, myelodysplasia syndrome-associated protein 1, or MECOM) which is a nuclear transcription factor essential for the proliferation/maintenance of hematopoietic stem cells (HSCs). PRDM3 and PRDM16 are both directly linked to various aspects of oncogenic transformation.


Pssm-ID: 380977  Cd Length: 135  Bit Score: 96.67  E-value: 2.06e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 245 VLSLPPGLRISPSGIPEAGLGVWNEAsDLPVGLHFGPYEGqiTEDEEAANSGYSW-LITKGRNCYEYVDGQDESQANWMR 323
Cdd:cd19200     7 DIPIPPDFELRESAAVGAGLGVWTKV-RIEVGEKFGPFVG--VQRSSVKDPTYAWeIVDEFGKVKFWIDASEPGTGNWMK 83
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1907123471 324 YVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDE-YGQE 370
Cdd:cd19200    84 YIRSAPSCEQQNLMACQIDEQIYYKVVRDIQPGEELLLYMKAAvYPHE 131
PR-SET_PRDM4 cd19189
PR-SET domain found in PR domain zinc finger protein 4 (PRDM4) and similar proteins; PRDM4 ...
246-372 4.22e-21

PR-SET domain found in PR domain zinc finger protein 4 (PRDM4) and similar proteins; PRDM4 (also termed PR domain-containing protein 4, or PFM1) may function as a transcription factor involved in cell differentiation.


Pssm-ID: 380966  Cd Length: 133  Bit Score: 89.83  E-value: 4.22e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPSgIPEAGLGVWNEASdLPVGLHFGPYEGQIT-----EDEEAANSGYSWLITKGRNCYEYVDGQDESQAN 320
Cdd:cd19189     4 LSLPRQLYLRQS-ETGAEVGVWTKET-IPVRTCFGPLIGQQShsaevADWTDKAAPHIWKIYHNDVLEFCIITTDENECN 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1907123471 321 WMRYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELG 372
Cdd:cd19189    82 WMMFVRKARTREEQNLVAYPHDGKIYFCTSRDIPPDQELLFYYSRDYARQLG 133
PR-SET_PRDM10 cd19194
PR-SET domain found in PR domain zinc finger protein 10 (PRDM10) and similar proteins; PRDM10 ...
247-372 5.54e-20

PR-SET domain found in PR domain zinc finger protein 10 (PRDM10) and similar proteins; PRDM10 (also termed PR domain-containing protein 10, or tristanin) may be involved in transcriptional regulation.


Pssm-ID: 380971  Cd Length: 128  Bit Score: 86.64  E-value: 5.54e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 247 SLPPGLRISPSGipEAGLGVWNEASdLPVGLHFGPYEGQITEDEEAANSGYSWLITKGRNCYE-YVDGQDESQANWMRYV 325
Cdd:cd19194     5 SLPLILQIFRFG--ETLGGVFAKRR-IPKRTQFGPLEGPLVKKSELKDNKIHPLELEEDDGEDlYFDLSDENKCNWMMFV 81
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 1907123471 326 NCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEYGQELG 372
Cdd:cd19194    82 RPAQNHLEQNLVAYQYGQEIYFTTIKNIEPKQELKVWYAASYAEFLG 128
PR-SET_PRDM15 cd19199
PR-SET domain found in PR domain zinc finger protein 15 (PRDM15) and similar proteins; PRDM15 ...
246-367 1.18e-19

PR-SET domain found in PR domain zinc finger protein 15 (PRDM15) and similar proteins; PRDM15 (also termed PR domain-containing protein 15, or zinc finger protein 298 (ZNF298)) may be involved in transcriptional regulation. It plays an essential role as a chromatin factor that modulates the transcription of upstream regulators of WNT and MAPK-ERK signaling to safeguard naive pluripotency.


Pssm-ID: 380976  Cd Length: 126  Bit Score: 85.54  E-value: 1.18e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPsgIPEAGLGVWnEASDLPVGLHFGPYEGQITE--DEEAANSGYswLITKGRNCYeYVDGQDESQANWMR 323
Cdd:cd19199     5 SSLPDNLEIRQ--LEDGSEGVF-ALVPLVKRTQFGPFEAKRVArlDGFAVFPLK--VFEKDGSVV-YLDTSNEDDCNWMM 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 1907123471 324 YVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDEY 367
Cdd:cd19199    79 FVRPATDVEHQNLTAYQQGEDIYFTTSRDIQPGAELRVWYAAFY 122
PR-SET_PRDM17 cd10520
PR-SET domain found in PR domain zinc finger protein 17 (PRDM17) and similar proteins; PRDM17 ...
247-366 1.74e-18

PR-SET domain found in PR domain zinc finger protein 17 (PRDM17) and similar proteins; PRDM17 (also termed zinc finger protein 408 (ZNF408)) may be involved in transcriptional regulation.


Pssm-ID: 380918  Cd Length: 121  Bit Score: 82.08  E-value: 1.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 247 SLPPGLRISPSGIPEAGLGVWNEASDLPVGLHFGPYEGqitedeEAANSGyswlITKGRNCYEYVDGQD------ESQAN 320
Cdd:cd10520     4 SLPPGLALGPSLAQEERLGVWCVGDALQKGTFLGPLEE------ELESHD----LTEGGSPRQEESGQSgdvlacEQSSK 73
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1907123471 321 WMRYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGDE 366
Cdd:cd10520    74 WMRFACRARSEEESNVAVVRLSGRLHLRVCKDIEPGSELLLWPEEN 119
PR-SET_PRDM8 cd19192
PR-SET domain found in PR domain zinc finger protein 8 (PRDM8) and similar proteins; PRDM8 ...
254-366 1.73e-16

PR-SET domain found in PR domain zinc finger protein 8 (PRDM8) and similar proteins; PRDM8 (also termed PR domain-containing protein 8) may function as histone methyltransferase, preferentially acting on 'Lys-9' of histone H3.


Pssm-ID: 380969  Cd Length: 131  Bit Score: 76.70  E-value: 1.73e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 254 ISPSGIPEAGL---------GVWNeASDLPVGLHFGPYEGQITEDEEAANSGyswLITKGRNCYEYVDGQDESQAN---- 320
Cdd:cd19192     1 MSHRSLWRGGSksvltdiftSVVT-TTDIPAGTIFGPCVLSFTLGYDIADIA---LKTTDKRVVPYIFRVDTGACNgsse 76
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1907123471 321 ---WMRYVNCARDDEEQNLVAFQY-HRKIFYRTCRVIRPGCELLVWYGDE 366
Cdd:cd19192    77 psdWLRLVQPARDRHEQNLEAFRKnEGQVYFRTLRRIRKGEELLVWYSDE 126
PR-SET_ZFPM cd19201
PR-SET domain found in zinc finger protein ZFPM1, ZFPM2 and similar proteins; ZFPM1 (also ...
246-363 2.67e-15

PR-SET domain found in zinc finger protein ZFPM1, ZFPM2 and similar proteins; ZFPM1 (also termed friend of GATA protein 1, FOG-1, friend of GATA 1, zinc finger protein 89A, or zinc finger protein multitype 1) functions as a transcription regulator that plays an essential role in erythroid and megakaryocytic cell differentiation. ZFPM2 (also termed friend of GATA protein 2, FOG-2, friend of GATA 2, zinc finger protein 89B, or zinc finger protein multitype 2) functions as a transcription regulator that plays a central role in heart morphogenesis and development of coronary vessels from epicardium, by regulating genes that are essential during cardiogenesis.


Pssm-ID: 380978  Cd Length: 122  Bit Score: 72.76  E-value: 2.67e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLR-ISPSGIPEAGLGVWNEAsDLPVGLHFGPYEGQITEDEEaaNSGYSWLITK-GRNCYEYVDGQDESQANWMR 323
Cdd:cd19201     1 LSLPGELElRKPSQDAGRSGGVWAKQ-PLPEGTRFGPYPGKLVKEPL--DPSYEWKVEAqGSKGGEGLLLLTEDSGTWLK 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 1907123471 324 YVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWY 363
Cdd:cd19201    78 LVRSADDEDEANLILYFKGGQIWCEVTKDIPPGEELILVL 117
PR-SET_PRDM16 cd19213
PR-SET domain found in PR domain zinc finger protein 16 (PRDM16) and similar proteins; PRDM16, ...
248-365 3.62e-15

PR-SET domain found in PR domain zinc finger protein 16 (PRDM16) and similar proteins; PRDM16, also termed PR domain-containing protein 16, or transcription factor MEL1, or MDS1/EVI1-like gene 1, functions as a transcriptional regulator. PRDM16 is preferentially expressed by hematopoietic and neuronal stem cells and is closely related to paralog of PRDM3, both of which are directly linked to various aspects of oncogenic transformation.


Pssm-ID: 380990  Cd Length: 162  Bit Score: 73.76  E-value: 3.62e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 248 LPPGLRISPSGIPEAGLGVWNEaSDLPVGLHFGPYEG--------------QITED-EEAANSGYSWLITKGRNCYEY-V 311
Cdd:cd19213    20 IPSDFELRESSIPGAGLGVWAK-RKIEAGERFGPYTGvqrstlkdtnfgweQILNDvEVSSQEGCITKIVDDLGNEKFcV 98
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1907123471 312 DGQDESQANWMRYVNCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWYGD 365
Cdd:cd19213    99 DAGQAGAGSWLKYIRVACSCDEQNLTACQINEQIYYKVIKDIEPGEELLVYVKD 152
PR-SET_PRDM3 cd19214
PR-SET domain found in MDS1 and EVI1 complex locus protein and similar proteins; PRDM3 (also ...
246-370 7.63e-14

PR-SET domain found in MDS1 and EVI1 complex locus protein and similar proteins; PRDM3 (also termed MDS1 and EVI1 complex locus protein, ecotropic virus integration site 1 protein, EVI-1, myelodysplasia syndrome 1 protein, myelodysplasia syndrome-associated protein 1, or MECOM) is a nuclear transcription factor, which is essential for the proliferation/maintenance of hematopoietic stem cells (HSCs). It is closely related to paralog PRDM16, both o fwhich are directly linked to various aspects of oncogenic transformation.


Pssm-ID: 380991  Cd Length: 158  Bit Score: 69.97  E-value: 7.63e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 246 LSLPPGLRISPSGIPEAGLGVWNEASdLPVGLHFGPYEGQITEDEEAANSGYSWLITKGrNCYEYVDGQDESQANWMRYV 325
Cdd:cd19214    28 IPIPSEFELRESNIPGTGLGIWTKRK-IEVGEKFGPYVGEQRSNLKDPSYGWEVLDEFG-NVKFCIDASQPDVGSWLKYI 105
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1907123471 326 NCARDDEEQNLVAFQYHRKIFYRTCRVIRPGCELLVWY-GDEYGQE 370
Cdd:cd19214   106 RFAGCYDQHNLVACQINDQIFYRAVADIDPGEELLLFMkSEDYSHE 151
KRAB smart00349
krueppel associated box;
32-89 6.60e-13

krueppel associated box;


Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 64.15  E-value: 6.60e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907123471   32 FKDISIYFSKEEWAEMGEWEKIRYRNV-KRNYKMLISIGLRAPRPAFMCYQRQAMKPQI 89
Cdd:smart00349   3 FEDVAVYFTQEEWEQLDPAQKNLYRDVmLENYSNLVSLGFQVPKPDLISQLEQGEEPWI 61
PR-SET_PRDM13 cd19197
PR-SET domain found in PR domain zinc finger protein 13 (PRDM13) and similar proteins; PRDM13 ...
311-364 2.72e-12

PR-SET domain found in PR domain zinc finger protein 13 (PRDM13) and similar proteins; PRDM13 (also termed PR domain-containing protein 13) may be involved in transcriptional regulation. It mediates the balance of inhibitory and excitatory neurons in somatosensory circuits.


Pssm-ID: 380974  Cd Length: 103  Bit Score: 63.68  E-value: 2.72e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907123471 311 VDGQDESQANWMRYVNCARDDEEQNLVAFQYHR--KIFYRTCRVIRPGCELLVWYG 364
Cdd:cd19197    41 VDESGSPATEWIGLVRAARNNQEQNLEAIADLPggQIFYRALRDIQPGEELTVWYS 96
SSXRD pfam09514
SSXRD motif; SSX1 can repress transcription, and this has been attributed to a putative ...
181-207 2.61e-10

SSXRD motif; SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localize to this region.


Pssm-ID: 430657  Cd Length: 31  Bit Score: 55.85  E-value: 2.61e-10
                          10        20
                  ....*....|....*....|....*..
gi 1907123471 181 NVEVKMYRLRERKGLAYEEVSEPQDDD 207
Cdd:pfam09514   5 EVEVWMYRLRERKGVVYEEISDPQEDD 31
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
541-846 1.82e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.86  E-value: 1.82e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 541 KPYVCRECGRGFTQNSHLIQHQRTHTGEKPYVCRECGRGFTQK--SDLIKHQRTHTGEKPYvcRECGRGFTQKSDLIKHQ 618
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHHNNPSD--LNSKSLPLSNSKASSSS 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 619 RTHTGEKPYV---CRECGRGFTQKSVLIKHQRTHTGEKPYVCRECGRGF--TQKSVLIKHQRTHTgekpyvcRECGRGFT 693
Cdd:COG5048   110 LSSSSSNSNDnnlLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSSSvnTPQSNSLHPPLPAN-------SLSKDPSS 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 694 AKSVLIQHQRTHTGEKPYVCRECGRgFTAKSNLIQHQRTHTGEKPYVCRECGRGFTAKSVLIQHQRTHTGEKPYVCRECG 773
Cdd:COG5048   183 NLSLLISSNVSTSIPSSSENSPLSS-SYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESP 261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 774 RGFTAKSVLIQHQRTHTGE-------KPYVCRECGRGFTQKSNLIKHQRT--HTGE--KPYVCRE--CGWGFTQKSDLIQ 840
Cdd:COG5048   262 RSSLPTASSQSSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRSvnHSGEslKPFSCPYslCGKLFSRNDALKR 341

                  ....*.
gi 1907123471 841 HQRTHT 846
Cdd:COG5048   342 HILLHT 347
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
391-774 3.37e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 60.09  E-value: 3.37e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 391 TEIHPCLLCSLAFSSQKFLTQHM-----EwNHRTEIFPGTSA-RINPKPGDPCSDQLQEQHVDSQNKNDKASNEVKRKSK 464
Cdd:COG5048    31 PRPDSCPNCTDSFSRLEHLTRHIrshtgE-KPSQCSYSGCDKsFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSS 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 465 PRQRISTTFPSTLKEQM--RSEESKRTVEELRT----------GQTTNTEDTVKSFIASEISSIERQCGQYFSDKSNVNE 532
Cdd:COG5048   110 LSSSSSNSNDNNLLSSHslPPSSRDPQLPDLLSisnlrnnplpGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLIS 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 533 HQKTHTGEKPYVCRECGRgFTQNSHLIQHQRTHTGEKPYVCRECGRGFTQKSDLIKHQRTHTGEKPYVCRECGRGFTQKS 612
Cdd:COG5048   190 SNVSTSIPSSSENSPLSS-SYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTA 268
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 613 DLIKHQRTHTGE-------KPYVCRECGRGFTQKSVLIKHQRT--HTGE--KPYVCRE--CGRGFTQKSVLIKHQRTHTG 679
Cdd:COG5048   269 SSQSSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRSvnHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTS 348
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 680 EKPYVCRECGRGFT-------AKSVLIQHQRTHTGEKPYVC--RECGRGFTAKSNLIQHQRTHTGEKPYVCR--ECGRGF 748
Cdd:COG5048   349 ISPAKEKLLNSSSKfspllnnEPPQSLQQYKDLKNDKKSETlsNSCIRNFKRDSNLSLHIITHLSFRPYNCKnpPCSKSF 428
                         410       420
                  ....*....|....*....|....*.
gi 1907123471 749 TAKSVLIQHQRTHTGEKPYVCRECGR 774
Cdd:COG5048   429 NRHYNLIPHKKIHTNHAPLLCSILKS 454
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
253-367 4.76e-09

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 55.04  E-value: 4.76e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 253 RISPSGIP---EAGLGVWnEASDLPVGLHFGPYEG-QITEDEEAANS----GYSWLITKGRNCYeYVDGqdESQANWMRY 324
Cdd:cd10522     1 KVDISMIPnlsHNGLGLF-AAETIAKGEFVGEYTGeVLDRWEEDRDSvyhyDPLYPFDLNGDIL-VIDA--GKKGNLTRF 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 1907123471 325 VNCARDDeeqNLVAFQYHRK----IFYRTCRVIRPGCELLVWYGDEY 367
Cdd:cd10522    77 INHSDQP---NLELIVRTLKgeqhIGFVAIRDIKPGEELFISYGPKY 120
KRAB_A-box cd07765
KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression ...
32-68 1.06e-08

KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through recruiting corepressors. A key mechanism appears to be the following: KRAB-AFPs tethered to DNA recruit, via their KRAB domain, the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta , KRAB-A interacting protein , and tripartite motif protein 28). The KAP1/ KRAB-AFP complex in turn recruits the heterochromatin protein 1 (HP1) family, and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation.


Pssm-ID: 143639  Cd Length: 40  Bit Score: 51.40  E-value: 1.06e-08
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 1907123471  32 FKDISIYFSKEEWAEMGEWEKIRYRNVKR-NYKMLISI 68
Cdd:cd07765     3 FEDVAVYFSQEEWELLDPAQRDLYRDVMLeNYENLVSL 40
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
252-370 1.99e-08

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 53.49  E-value: 1.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471  252 LRISPSgiPEAGLGVWneAS-DLPVGLHFGPYEGQITEDEEAANSG--YSWLITKGRNCYE-----YVDGQdeSQANWMR 323
Cdd:smart00317   3 LEVFKS--PGKGWGVR--ATeDIPKGEFIGEYVGEIITSEEAEERPkaYDTDGAKAFYLFDidsdlCIDAR--RKGNLAR 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1907123471  324 YVNCARDDEEQNLVAFQ-YHRKIFYRTCRVIRPGCELLVWYGDEYGQE 370
Cdd:smart00317  77 FINHSCEPNCELLFVEVnGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
PR-SET_PRDM5 cd19190
PR-SET domain found in PR domain zinc finger protein 5 (PRDM5) and similar proteins; PRDM5 ...
263-365 9.47e-08

PR-SET domain found in PR domain zinc finger protein 5 (PRDM5) and similar proteins; PRDM5 (also termed PR domain-containing protein 5) is a sequence-specific DNA-binding transcription factor that represses transcription at least in part by recruitment of the histone methyltransferase EHMT2/G9A and histone deacetylases such as HDAC1.


Pssm-ID: 380967  Cd Length: 127  Bit Score: 51.52  E-value: 9.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 263 GLGVWNeASDLPVGLHFGPYEGQ---ITEDEEAANSGYSWLI--TKGRNCYeYVDGQDESQANWMRYVNCARDDEEQNLV 337
Cdd:cd19190    19 GMGLYT-ARRVKKGEKFGPFAGEkrmPNELDESMDPRLMWEVrgSKGEVLY-ILDASNPRHSNWLRFVHEAPSQEQKNLA 96
                          90       100
                  ....*....|....*....|....*...
gi 1907123471 338 AFQYHRKIFYRTCRVIRPGCELLVWYGD 365
Cdd:cd19190    97 AIQEGENIFYLAVDDIETDTELLIGYLD 124
KRAB pfam01352
KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc ...
32-69 1.68e-07

KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.


Pssm-ID: 460171  Cd Length: 42  Bit Score: 48.24  E-value: 1.68e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1907123471  32 FKDISIYFSKEEWAEMGEWEKIRYRNV-KRNYKMLISIG 69
Cdd:pfam01352   4 FEDVAVDFTQEEWALLDPAQRNLYRDVmLENYRNLVSLG 42
zf-H2C2_2 pfam13465
Zinc-finger double domain;
557-582 1.61e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 39.28  E-value: 1.61e-04
                          10        20
                  ....*....|....*....|....*.
gi 1907123471 557 HLIQHQRTHTGEKPYVCRECGRGFTQ 582
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
304-364 1.73e-04

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 41.74  E-value: 1.73e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907123471 304 GRNCYEYVDGQDESQANWMRYVNcaRDDEEQNLVAFQY---HRKIFYRTCRVIRPGCELLVWYG 364
Cdd:pfam00856  54 DEDSEYCIDARALYYGNWARFIN--HSCDPNCEVRVVYvngGPRIVIFALRDIKPGEELTIDYG 115
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
795-817 1.99e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.21  E-value: 1.99e-04
                          10        20
                  ....*....|....*....|...
gi 1907123471 795 YVCRECGRGFTQKSNLIKHQRTH 817
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
725-749 2.52e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 2.52e-04
                          10        20
                  ....*....|....*....|....*
gi 1907123471 725 NLIQHQRTHTGEKPYVCRECGRGFT 749
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-H2C2_2 pfam13465
Zinc-finger double domain;
782-806 3.70e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.51  E-value: 3.70e-04
                          10        20
                  ....*....|....*....|....*
gi 1907123471 782 LIQHQRTHTGEKPYVCRECGRGFTQ 806
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
613-638 3.92e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 3.92e-04
                          10        20
                  ....*....|....*....|....*.
gi 1907123471 613 DLIKHQRTHTGEKPYVCRECGRGFTQ 638
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
585-610 3.92e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 3.92e-04
                          10        20
                  ....*....|....*....|....*.
gi 1907123471 585 DLIKHQRTHTGEKPYVCRECGRGFTQ 610
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
599-621 4.19e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 38.05  E-value: 4.19e-04
                          10        20
                  ....*....|....*....|...
gi 1907123471 599 YVCRECGRGFTQKSDLIKHQRTH 621
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
571-593 4.19e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 38.05  E-value: 4.19e-04
                          10        20
                  ....*....|....*....|...
gi 1907123471 571 YVCRECGRGFTQKSDLIKHQRTH 593
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
642-666 5.01e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 5.01e-04
                          10        20
                  ....*....|....*....|....*
gi 1907123471 642 LIKHQRTHTGEKPYVCRECGRGFTQ 666
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
529-554 5.86e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.74  E-value: 5.86e-04
                          10        20
                  ....*....|....*....|....*.
gi 1907123471 529 NVNEHQKTHTGEKPYVCRECGRGFTQ 554
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
543-565 6.57e-04

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 37.66  E-value: 6.57e-04
                          10        20
                  ....*....|....*....|...
gi 1907123471 543 YVCRECGRGFTQNSHLIQHQRTH 565
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
698-721 8.11e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.35  E-value: 8.11e-04
                          10        20
                  ....*....|....*....|....
gi 1907123471 698 LIQHQRTHTGEKPYVCRECGRGFT 721
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFK 25
zf-H2C2_2 pfam13465
Zinc-finger double domain;
754-777 8.11e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.35  E-value: 8.11e-04
                          10        20
                  ....*....|....*....|....
gi 1907123471 754 LIQHQRTHTGEKPYVCRECGRGFT 777
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFK 25
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
765-813 1.01e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 1.01e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 765 KPYvCRECGRGFTAKSVLIQHQRTHTgekpYVCRECGRGFTQKSNLIKH 813
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
zf-H2C2_2 pfam13465
Zinc-finger double domain;
670-693 1.10e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.97  E-value: 1.10e-03
                          10        20
                  ....*....|....*....|....
gi 1907123471 670 LIKHQRTHTGEKPYVCRECGRGFT 693
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFK 25
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
823-845 1.65e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 1.65e-03
                          10        20
                  ....*....|....*....|...
gi 1907123471 823 YVCRECGWGFTQKSDLIQHQRTH 845
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
PR-SET_ZFPM1 cd19215
PR-SET domain found in zinc finger protein ZFPM1 and similar proteins; ZFPM1 (also termed ...
272-361 1.73e-03

PR-SET domain found in zinc finger protein ZFPM1 and similar proteins; ZFPM1 (also termed friend of GATA protein 1, FOG-1, friend of GATA 1, zinc finger protein 89A, or zinc finger protein multitype 1) functions as a transcription regulator that plays an essential role in erythroid and megakaryocytic cell differentiation.


Pssm-ID: 380992  Cd Length: 110  Bit Score: 38.99  E-value: 1.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 272 DLPVGLHFGPYEGQIT----EDEEAANSGYSWLITKGRNCyeyvdgqdesqanWMRYVNCARDDEEQNLVAFQYHRKIFY 347
Cdd:cd19215    23 SLSEGLSWGPYHGSIQssasSPGQAEESPAVTLLLVDEDC-------------WLRRLPLVSTEAEANCTIYRKGDAIWC 89
                          90
                  ....*....|....
gi 1907123471 348 RTCRVIRPGCELLV 361
Cdd:cd19215    90 KTTKPVPEGELLSA 103
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
681-729 1.76e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.92  E-value: 1.76e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 681 KPYvCRECGRGFTAKSVLIQHQRTHTgekpYVCRECGRGFTAKSNLIQH 729
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
zf-H2C2_2 pfam13465
Zinc-finger double domain;
809-834 2.21e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.20  E-value: 2.21e-03
                          10        20
                  ....*....|....*....|....*.
gi 1907123471 809 NLIKHQRTHTGEKPYVCRECGWGFTQ 834
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
711-733 3.10e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 3.10e-03
                          10        20
                  ....*....|....*....|...
gi 1907123471 711 YVCRECGRGFTAKSNLIQHQRTH 733
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
627-649 3.38e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 3.38e-03
                          10        20
                  ....*....|....*....|...
gi 1907123471 627 YVCRECGRGFTQKSVLIKHQRTH 649
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
655-677 3.38e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 3.38e-03
                          10        20
                  ....*....|....*....|...
gi 1907123471 655 YVCRECGRGFTQKSVLIKHQRTH 677
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
737-785 6.31e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.38  E-value: 6.31e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1907123471 737 KPYvCRECGRGFTAKSVLIQHQRTHTgekpYVCRECGRGFTAKSVLIQH 785
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVH 44
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
637-710 6.49e-03

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 40.06  E-value: 6.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907123471 637 TQKSVLIKHQRTHTGE----KPYVCRECGRGFTQKSVLIKHQRTHTGEKPYVCRECGRG--FTAKSVLIQHQRTHTGEKP 710
Cdd:COG5048    12 NNSVLSSTPKSTLKSLsnapRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDksFSRPLELSRHLRTHHNNPS 91
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
782-846 8.69e-03

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 39.68  E-value: 8.69e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907123471 782 LIQHQRTHTGE----KPYVCRECGRGFTQKSNLIKHQRTHTGEKPYVCRECGWGFTQK--SDLIQHQRTHT 846
Cdd:COG5048    17 SSTPKSTLKSLsnapRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSrpLELSRHLRTHH 87
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH