NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217320701|ref|XP_047294605|]
View 

zinc finger protein 841 isoform X1 [Homo sapiens]

Protein Classification

KRAB domain-containing zinc finger protein( domain architecture ID 12204268)

KRAB (Kruppel-associated box) domain-containing zinc finger protein (KRAB-ZFP) plays important roles in cell differentiation and organ development, and in regulating viral replication and transcription

CATH:  3.30.160.60
Gene Ontology:  GO:0003700|GO:0046872
PubMed:  22803940
SCOP:  4003583

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
KRAB smart00349
krueppel associated box;
8-69 7.95e-30

krueppel associated box;


:

Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 112.30  E-value: 7.95e-30
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217320701    8 LTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFLaGLCLPDLNIISMLEQGKEPWT 69
Cdd:smart00349   1 VTFEDVAVYFTQEEWEQLDPAQKNLYRDVMLENYSNLVSL-GFQVPKPDLISQLEQGEEPWI 61
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
484-886 1.86e-12

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 70.49  E-value: 1.86e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 484 KPYKCNECGKVFSQHSHLAVHQRVHTGEKPYKCNECGKAFNWGSL--LTVHQRIHTGEKPYKCNVCGKVFNYGGYLSVHM 561
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPleLSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 562 RCHTgekplHCNKCGMVFTYYSCLARHQRMHtgEKPYKCNVCGKVFIDSGNLSIHRRSHTGEKPFQCNECGKVFSYYSCL 641
Cdd:COG5048   112 SSSS-----NSNDNNLLSSHSLPPSSRDPQL--PDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNL 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 642 ARH---RKIHTGEKPYKCNDCGKAYTQRSSLTKHLVIHTgENPYHCNEFGEAFIQSSKLARYHRNPTGEKPHKCSECGRT 718
Cdd:COG5048   185 SLLissNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENS-SSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRS 263
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 719 -FSHKTSLVYHQRRHTGEM------PYKCIECGKVFNSTTTLARHRR--IHTGE--KPYKCNE--CGKVFRYRSGLARHW 785
Cdd:COG5048   264 sLPTASSQSSSPNESDSSSekgfslPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHI 343
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 786 SIHTGEKPYKC--NECGKAFRVRSILLNHQ--MMHTGEKPYKCNE-----CGKAFIERSNLVYHQRNHTGEKPYKCM--E 854
Cdd:COG5048   344 LLHTSISPAKEklLNSSSKFSPLLNNEPPQslQQYKDLKNDKKSEtlsnsCIRNFKRDSNLSLHIITHLSFRPYNCKnpP 423
                         410       420       430
                  ....*....|....*....|....*....|..
gi 2217320701 855 CGKAFGRRSCLTKHQRIHSSEKPYKCNECGKS 886
Cdd:COG5048   424 CSKSFNRHYNLIPHKKIHTNHAPLLCSILKSF 455
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
260-636 1.79e-09

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 61.25  E-value: 1.79e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 260 KPYIGNECGKAFRVSSSLINHQMIHTTEKPYRCNESG--KAFHRGSLLTVHQIVHTRGKPYQCDVCGRIFRQNSDLVNHR 337
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGcdKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 338 RSHTGD-KPYICNECGKSFS--KSSHLAVHQRIHTGEKPYK-CNRCGKCFSQSSSLA-------------------THQT 394
Cdd:COG5048   112 SSSSNSnDNNLLSSHSLPPSsrDPQLPDLLSISNLRNNPLPgNNSSSVNTPQSNSLHpplpanslskdpssnlsllISSN 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 395 VHTGDKPYKCNECGKTFKRNSSLTAHHIIHAGKKPYTCDVCGKVFYQNSQLVRHQIIHTGETPYKCNECGKVFFQRSRLA 474
Cdd:COG5048   192 VSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQ 271
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 475 GHRRIHTGE-------KPYKCNECGKVFSQHSHLAVHQR--VHTGE--KPYKCNE--CGKAFNWGSLLTVHQRIHTGEKP 541
Cdd:COG5048   272 SSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISP 351
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 542 YKC--NVCGKVF----NYGGYLSVHMRCHTGEKPLHC---NKCGMVFTYYSCLARHQRMHTGEKP--YKCNVCGKVFIDS 610
Cdd:COG5048   352 AKEklLNSSSKFspllNNEPPQSLQQYKDLKNDKKSEtlsNSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRH 431
                         410       420
                  ....*....|....*....|....*.
gi 2217320701 611 GNLSIHRRSHTGEKPFQCNECGKVFS 636
Cdd:COG5048   432 YNLIPHKKIHTNHAPLLCSILKSFRR 457
 
Name Accession Description Interval E-value
KRAB smart00349
krueppel associated box;
8-69 7.95e-30

krueppel associated box;


Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 112.30  E-value: 7.95e-30
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217320701    8 LTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFLaGLCLPDLNIISMLEQGKEPWT 69
Cdd:smart00349   1 VTFEDVAVYFTQEEWEQLDPAQKNLYRDVMLENYSNLVSL-GFQVPKPDLISQLEQGEEPWI 61
KRAB pfam01352
KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc ...
7-47 6.41e-23

KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.


Pssm-ID: 460171  Cd Length: 42  Bit Score: 92.15  E-value: 6.41e-23
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 2217320701   7 SLTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFL 47
Cdd:pfam01352   1 SVTFEDVAVDFTQEEWALLDPAQRNLYRDVMLENYRNLVSL 41
KRAB_A-box cd07765
KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression ...
8-47 5.16e-20

KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through recruiting corepressors. A key mechanism appears to be the following: KRAB-AFPs tethered to DNA recruit, via their KRAB domain, the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta , KRAB-A interacting protein , and tripartite motif protein 28). The KAP1/ KRAB-AFP complex in turn recruits the heterochromatin protein 1 (HP1) family, and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation.


Pssm-ID: 143639  Cd Length: 40  Bit Score: 83.75  E-value: 5.16e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 2217320701   8 LTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFL 47
Cdd:cd07765     1 VTFEDVAVYFSQEEWELLDPAQRDLYRDVMLENYENLVSL 40
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
484-886 1.86e-12

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 70.49  E-value: 1.86e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 484 KPYKCNECGKVFSQHSHLAVHQRVHTGEKPYKCNECGKAFNWGSL--LTVHQRIHTGEKPYKCNVCGKVFNYGGYLSVHM 561
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPleLSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 562 RCHTgekplHCNKCGMVFTYYSCLARHQRMHtgEKPYKCNVCGKVFIDSGNLSIHRRSHTGEKPFQCNECGKVFSYYSCL 641
Cdd:COG5048   112 SSSS-----NSNDNNLLSSHSLPPSSRDPQL--PDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNL 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 642 ARH---RKIHTGEKPYKCNDCGKAYTQRSSLTKHLVIHTgENPYHCNEFGEAFIQSSKLARYHRNPTGEKPHKCSECGRT 718
Cdd:COG5048   185 SLLissNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENS-SSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRS 263
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 719 -FSHKTSLVYHQRRHTGEM------PYKCIECGKVFNSTTTLARHRR--IHTGE--KPYKCNE--CGKVFRYRSGLARHW 785
Cdd:COG5048   264 sLPTASSQSSSPNESDSSSekgfslPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHI 343
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 786 SIHTGEKPYKC--NECGKAFRVRSILLNHQ--MMHTGEKPYKCNE-----CGKAFIERSNLVYHQRNHTGEKPYKCM--E 854
Cdd:COG5048   344 LLHTSISPAKEklLNSSSKFSPLLNNEPPQslQQYKDLKNDKKSEtlsnsCIRNFKRDSNLSLHIITHLSFRPYNCKnpP 423
                         410       420       430
                  ....*....|....*....|....*....|..
gi 2217320701 855 CGKAFGRRSCLTKHQRIHSSEKPYKCNECGKS 886
Cdd:COG5048   424 CSKSFNRHYNLIPHKKIHTNHAPLLCSILKSF 455
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
260-636 1.79e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 61.25  E-value: 1.79e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 260 KPYIGNECGKAFRVSSSLINHQMIHTTEKPYRCNESG--KAFHRGSLLTVHQIVHTRGKPYQCDVCGRIFRQNSDLVNHR 337
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGcdKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 338 RSHTGD-KPYICNECGKSFS--KSSHLAVHQRIHTGEKPYK-CNRCGKCFSQSSSLA-------------------THQT 394
Cdd:COG5048   112 SSSSNSnDNNLLSSHSLPPSsrDPQLPDLLSISNLRNNPLPgNNSSSVNTPQSNSLHpplpanslskdpssnlsllISSN 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 395 VHTGDKPYKCNECGKTFKRNSSLTAHHIIHAGKKPYTCDVCGKVFYQNSQLVRHQIIHTGETPYKCNECGKVFFQRSRLA 474
Cdd:COG5048   192 VSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQ 271
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 475 GHRRIHTGE-------KPYKCNECGKVFSQHSHLAVHQR--VHTGE--KPYKCNE--CGKAFNWGSLLTVHQRIHTGEKP 541
Cdd:COG5048   272 SSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISP 351
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 542 YKC--NVCGKVF----NYGGYLSVHMRCHTGEKPLHC---NKCGMVFTYYSCLARHQRMHTGEKP--YKCNVCGKVFIDS 610
Cdd:COG5048   352 AKEklLNSSSKFspllNNEPPQSLQQYKDLKNDKKSEtlsNSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRH 431
                         410       420
                  ....*....|....*....|....*.
gi 2217320701 611 GNLSIHRRSHTGEKPFQCNECGKVFS 636
Cdd:COG5048   432 YNLIPHKKIHTNHAPLLCSILKSFRR 457
zf-H2C2_2 pfam13465
Zinc-finger double domain;
753-777 2.28e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 2.28e-04
                          10        20
                  ....*....|....*....|....*
gi 2217320701 753 LARHRRIHTGEKPYKCNECGKVFRY 777
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
360-385 1.02e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.35  E-value: 1.02e-03
                          10        20
                  ....*....|....*....|....*.
gi 2217320701 360 HLAVHQRIHTGEKPYKCNRCGKCFSQ 385
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
344-396 1.27e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 1.27e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2217320701 344 KPYiCNECGKSFSKSSHLAVHQRIHTgekpYKCNRCGKCFSQSSSLATH-QTVH 396
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
512-561 4.44e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.15  E-value: 4.44e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 2217320701 512 KPYkCNECGKAFNWGSLLTVHQRIHTgekpYKCNVCGKVFNYGGYLSVHM 561
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
 
Name Accession Description Interval E-value
KRAB smart00349
krueppel associated box;
8-69 7.95e-30

krueppel associated box;


Pssm-ID: 214630 [Multi-domain]  Cd Length: 61  Bit Score: 112.30  E-value: 7.95e-30
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217320701    8 LTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFLaGLCLPDLNIISMLEQGKEPWT 69
Cdd:smart00349   1 VTFEDVAVYFTQEEWEQLDPAQKNLYRDVMLENYSNLVSL-GFQVPKPDLISQLEQGEEPWI 61
KRAB pfam01352
KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc ...
7-47 6.41e-23

KRAB box; The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.


Pssm-ID: 460171  Cd Length: 42  Bit Score: 92.15  E-value: 6.41e-23
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 2217320701   7 SLTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFL 47
Cdd:pfam01352   1 SVTFEDVAVDFTQEEWALLDPAQRNLYRDVMLENYRNLVSL 41
KRAB_A-box cd07765
KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression ...
8-47 5.16e-20

KRAB (Kruppel-associated box) domain -A box; The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through recruiting corepressors. A key mechanism appears to be the following: KRAB-AFPs tethered to DNA recruit, via their KRAB domain, the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta , KRAB-A interacting protein , and tripartite motif protein 28). The KAP1/ KRAB-AFP complex in turn recruits the heterochromatin protein 1 (HP1) family, and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation.


Pssm-ID: 143639  Cd Length: 40  Bit Score: 83.75  E-value: 5.16e-20
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 2217320701   8 LTFRDVAVEFSQEEWKCLDPVQKALYRDVMLENYRNLGFL 47
Cdd:cd07765     1 VTFEDVAVYFSQEEWELLDPAQRDLYRDVMLENYENLVSL 40
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
484-886 1.86e-12

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 70.49  E-value: 1.86e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 484 KPYKCNECGKVFSQHSHLAVHQRVHTGEKPYKCNECGKAFNWGSL--LTVHQRIHTGEKPYKCNVCGKVFNYGGYLSVHM 561
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPleLSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 562 RCHTgekplHCNKCGMVFTYYSCLARHQRMHtgEKPYKCNVCGKVFIDSGNLSIHRRSHTGEKPFQCNECGKVFSYYSCL 641
Cdd:COG5048   112 SSSS-----NSNDNNLLSSHSLPPSSRDPQL--PDLLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNL 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 642 ARH---RKIHTGEKPYKCNDCGKAYTQRSSLTKHLVIHTgENPYHCNEFGEAFIQSSKLARYHRNPTGEKPHKCSECGRT 718
Cdd:COG5048   185 SLLissNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENS-SSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRS 263
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 719 -FSHKTSLVYHQRRHTGEM------PYKCIECGKVFNSTTTLARHRR--IHTGE--KPYKCNE--CGKVFRYRSGLARHW 785
Cdd:COG5048   264 sLPTASSQSSSPNESDSSSekgfslPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHI 343
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 786 SIHTGEKPYKC--NECGKAFRVRSILLNHQ--MMHTGEKPYKCNE-----CGKAFIERSNLVYHQRNHTGEKPYKCM--E 854
Cdd:COG5048   344 LLHTSISPAKEklLNSSSKFSPLLNNEPPQslQQYKDLKNDKKSEtlsnsCIRNFKRDSNLSLHIITHLSFRPYNCKnpP 423
                         410       420       430
                  ....*....|....*....|....*....|..
gi 2217320701 855 CGKAFGRRSCLTKHQRIHSSEKPYKCNECGKS 886
Cdd:COG5048   424 CSKSFNRHYNLIPHKKIHTNHAPLLCSILKSF 455
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
343-756 3.20e-12

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 69.72  E-value: 3.20e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 343 DKPYICNECGKSFSKSSHLAVHQRIHTGEKPYKCNR--CGKCFSQSSSLATHQTVHTGDKPYKCNecgKTFKRNSSLTAH 420
Cdd:COG5048    31 PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYsgCDKSFSRPLELSRHLRTHHNNPSDLNS---KSLPLSNSKASS 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 421 HIIHAGKKpYTCDVCGKVFYQNSQLVRHQIIHTGETPYK--------CNECGKVFFQRSRL-AGHRRIHTGEKPYKcnec 491
Cdd:COG5048   108 SSLSSSSS-NSNDNNLLSSHSLPPSSRDPQLPDLLSISNlrnnplpgNNSSSVNTPQSNSLhPPLPANSLSKDPSS---- 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 492 gkvfsqHSHLAVHQRVHTGEKPYKCNECGKAFNWGSLLTVHQRIHTGEKPYKCNVCGKVF------NYGGYLSVHMRCHT 565
Cdd:COG5048   183 ------NLSLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSpksllsQSPSSLSSSDSSSS 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 566 GEKPLHCNKCGMVFTYYSCLARHQRMHTG-EKPYKCNVCGKVFIDSGNLSIHRRS--HTGE--KPFQCNE--CGKVFSYY 638
Cdd:COG5048   257 ASESPRSSLPTASSQSSSPNESDSSSEKGfSLPIKSKQCNISFSRSSPLTRHLRSvnHSGEslKPFSCPYslCGKLFSRN 336
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 639 SCLARHRKIHTGEKPYKC--NDCGKAYTQRS--SLTKHLVIHTGENPYHCNE----FGEAFIQSSKLARYHRNPTGEKPH 710
Cdd:COG5048   337 DALKRHILLHTSISPAKEklLNSSSKFSPLLnnEPPQSLQQYKDLKNDKKSEtlsnSCIRNFKRDSNLSLHIITHLSFRP 416
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*....
gi 2217320701 711 ---KCSECGRTFSHKTSLVYHQRRHTGEMPYkCIECGKVFNSTTTLARH 756
Cdd:COG5048   417 yncKNPPCSKSFNRHYNLIPHKKIHTNHAPL-LCSILKSFRRDLDLSNH 464
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
568-895 1.99e-10

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 64.33  E-value: 1.99e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 568 KPLHCNKCGMVFTYYSCLARHQRMHTGEKPYKCNV--CGKVFIDSGNLSIHRRSHTGEKPFQCNECGKVFSYYSC-LARH 644
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYsgCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASsSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 645 RKIHTGEKPYKCNDCGKAYTQRSSLTKHLVIHT--GENPYH-CNEFGEAFIQSSKL-ARYHRNPTGEKPHKCSecgrtfs 720
Cdd:COG5048   112 SSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISnlRNNPLPgNNSSSVNTPQSNSLhPPLPANSLSKDPSSNL------- 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 721 hktSLVYHQRRHTGEMPYKCIECGKVFNSTTTLARHRRIHTGEKPYKCNECGKVFRYRSGLARHWSIHTGEKPYKCNECG 800
Cdd:COG5048   185 ---SLLISSNVSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESP 261
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 801 KAFR--VRSILLNHQMMHTGE-----KPYKCNECGKAFIERSNLVYHQR--NHTGE--KPYKCME--CGKAFGRRSCLTK 867
Cdd:COG5048   262 RSSLptASSQSSSPNESDSSSekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKR 341
                         330       340
                  ....*....|....*....|....*...
gi 2217320701 868 HQRIHSSEKPYKCNECGKSYISRSGLTK 895
Cdd:COG5048   342 HILLHTSISPAKEKLLNSSSKFSPLLNN 369
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
260-636 1.79e-09

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 61.25  E-value: 1.79e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 260 KPYIGNECGKAFRVSSSLINHQMIHTTEKPYRCNESG--KAFHRGSLLTVHQIVHTRGKPYQCDVCGRIFRQNSDLVNHR 337
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGcdKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 338 RSHTGD-KPYICNECGKSFS--KSSHLAVHQRIHTGEKPYK-CNRCGKCFSQSSSLA-------------------THQT 394
Cdd:COG5048   112 SSSSNSnDNNLLSSHSLPPSsrDPQLPDLLSISNLRNNPLPgNNSSSVNTPQSNSLHpplpanslskdpssnlsllISSN 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 395 VHTGDKPYKCNECGKTFKRNSSLTAHHIIHAGKKPYTCDVCGKVFYQNSQLVRHQIIHTGETPYKCNECGKVFFQRSRLA 474
Cdd:COG5048   192 VSTSIPSSSENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSSSDSSSSASESPRSSLPTASSQ 271
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 475 GHRRIHTGE-------KPYKCNECGKVFSQHSHLAVHQR--VHTGE--KPYKCNE--CGKAFNWGSLLTVHQRIHTGEKP 541
Cdd:COG5048   272 SSSPNESDSssekgfsLPIKSKQCNISFSRSSPLTRHLRsvNHSGEslKPFSCPYslCGKLFSRNDALKRHILLHTSISP 351
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 542 YKC--NVCGKVF----NYGGYLSVHMRCHTGEKPLHC---NKCGMVFTYYSCLARHQRMHTGEKP--YKCNVCGKVFIDS 610
Cdd:COG5048   352 AKEklLNSSSKFspllNNEPPQSLQQYKDLKNDKKSEtlsNSCIRNFKRDSNLSLHIITHLSFRPynCKNPPCSKSFNRH 431
                         410       420
                  ....*....|....*....|....*.
gi 2217320701 611 GNLSIHRRSHTGEKPFQCNECGKVFS 636
Cdd:COG5048   432 YNLIPHKKIHTNHAPLLCSILKSFRR 457
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
818-900 1.44e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 45.48  E-value: 1.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 818 GEKPYKCN--ECGKAFIERSNLVYHQRNHtgekpykcmECGKAFGRRSCLTKHQRIHSSEKPYKCNECGKSYISRSGLtK 895
Cdd:COG5189   346 DGKPYKCPveGCNKKYKNQNGLKYHMLHG---------HQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGL-K 415

                  ....*
gi 2217320701 896 HQIKH 900
Cdd:COG5189   416 YHRKH 420
zf-H2C2_2 pfam13465
Zinc-finger double domain;
753-777 2.28e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 2.28e-04
                          10        20
                  ....*....|....*....|....*
gi 2217320701 753 LARHRRIHTGEKPYKCNECGKVFRY 777
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
836-859 2.61e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.89  E-value: 2.61e-04
                          10        20
                  ....*....|....*....|....
gi 2217320701 836 NLVYHQRNHTGEKPYKCMECGKAF 859
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSF 24
zf-H2C2_2 pfam13465
Zinc-finger double domain;
585-607 3.44e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.51  E-value: 3.44e-04
                          10        20
                  ....*....|....*....|...
gi 2217320701 585 LARHQRMHTGEKPYKCNVCGKVF 607
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSF 24
zf-H2C2_2 pfam13465
Zinc-finger double domain;
500-524 4.27e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 4.27e-04
                          10        20
                  ....*....|....*....|....*
gi 2217320701 500 HLAVHQRVHTGEKPYKCNECGKAFN 524
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-H2C2_2 pfam13465
Zinc-finger double domain;
641-665 7.11e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.74  E-value: 7.11e-04
                          10        20
                  ....*....|....*....|....*
gi 2217320701 641 LARHRKIHTGEKPYKCNDCGKAYTQ 665
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
529-553 8.07e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.35  E-value: 8.07e-04
                          10        20
                  ....*....|....*....|....*
gi 2217320701 529 LTVHQRIHTGEKPYKCNVCGKVFNY 553
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
370-448 8.79e-04

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 42.78  E-value: 8.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 370 GEKPYKCN--RCGKCFSQSSSLATHQtvhtgdkpyKCNECGKTFKRNSSLTAHHIIHAGKKPYTCDVCGKVFYQNSQLVR 447
Cdd:COG5189   346 DGKPYKCPveGCNKKYKNQNGLKYHM---------LHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416

                  .
gi 2217320701 448 H 448
Cdd:COG5189   417 H 417
zf-H2C2_2 pfam13465
Zinc-finger double domain;
360-385 1.02e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 37.35  E-value: 1.02e-03
                          10        20
                  ....*....|....*....|....*.
gi 2217320701 360 HLAVHQRIHTGEKPYKCNRCGKCFSQ 385
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
612-637 1.12e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.97  E-value: 1.12e-03
                          10        20
                  ....*....|....*....|....*.
gi 2217320701 612 NLSIHRRSHTGEKPFQCNECGKVFSY 637
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
344-396 1.27e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 38.69  E-value: 1.27e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 2217320701 344 KPYiCNECGKSFSKSSHLAVHQRIHTgekpYKCNRCGKCFSQSSSLATH-QTVH 396
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHcLQVH 49
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
538-621 1.36e-03

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 42.01  E-value: 1.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 538 GEKPYKCNV--CGKVFNYGGYLSVHMrchtgeKPLHCNKcgmVFTYYSCLARHQRMHTGEKPYKCNVCGKVFIDSGNLSI 615
Cdd:COG5189   346 DGKPYKCPVegCNKKYKNQNGLKYHM------LHGHQNQ---KLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416

                  ....*.
gi 2217320701 616 HRRSHT 621
Cdd:COG5189   417 HRKHSH 422
zf-H2C2_2 pfam13465
Zinc-finger double domain;
724-749 1.51e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.58  E-value: 1.51e-03
                          10        20
                  ....*....|....*....|....*.
gi 2217320701 724 SLVYHQRRHTGEMPYKCIECGKVFNS 749
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
476-497 1.82e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 36.58  E-value: 1.82e-03
                          10        20
                  ....*....|....*....|..
gi 2217320701 476 HRRIHTGEKPYKCNECGKVFSQ 497
Cdd:pfam13465   5 HMRTHTGEKPYKCPECGKSFKS 26
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
486-508 1.98e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 1.98e-03
                          10        20
                  ....*....|....*....|...
gi 2217320701 486 YKCNECGKVFSQHSHLAVHQRVH 508
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SFP1 COG5189
Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division ...
342-424 2.19e-03

Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning];


Pssm-ID: 227516 [Multi-domain]  Cd Length: 423  Bit Score: 41.63  E-value: 2.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320701 342 GDKPYICN--ECGKSFSKSSHLAVHqRIHtgekpykcNRCGKCFSQSSSLATHQTVHTGDKPYKCNECGKTFKRNSSLTa 419
Cdd:COG5189   346 DGKPYKCPveGCNKKYKNQNGLKYH-MLH--------GHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLK- 415

                  ....*
gi 2217320701 420 HHIIH 424
Cdd:COG5189   416 YHRKH 420
zf-H2C2_2 pfam13465
Zinc-finger double domain;
388-413 2.92e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.81  E-value: 2.92e-03
                          10        20
                  ....*....|....*....|....*.
gi 2217320701 388 SLATHQTVHTGDKPYKCNECGKTFKR 413
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
809-831 3.59e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.81  E-value: 3.59e-03
                          10        20
                  ....*....|....*....|...
gi 2217320701 809 LLNHQMMHTGEKPYKCNECGKAF 831
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSF 24
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
402-424 4.31e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.35  E-value: 4.31e-03
                          10        20
                  ....*....|....*....|...
gi 2217320701 402 YKCNECGKTFKRNSSLTAHHIIH 424
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
512-561 4.44e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 37.15  E-value: 4.44e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 2217320701 512 KPYkCNECGKAFNWGSLLTVHQRIHTgekpYKCNVCGKVFNYGGYLSVHM 561
Cdd:cd20908     1 KPW-CYYCDREFDDEKILIQHQKAKH----FKCHICHKKLYTAGGLAVHC 45
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
346-368 4.61e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.35  E-value: 4.61e-03
                          10        20
                  ....*....|....*....|...
gi 2217320701 346 YICNECGKSFSKSSHLAVHQRIH 368
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
zf-H2C2_2 pfam13465
Zinc-finger double domain;
445-469 6.23e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.04  E-value: 6.23e-03
                          10        20
                  ....*....|....*....|....*
gi 2217320701 445 LVRHQIIHTGETPYKCNECGKVFFQ 469
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
865-889 7.08e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.04  E-value: 7.08e-03
                          10        20
                  ....*....|....*....|....*
gi 2217320701 865 LTKHQRIHSSEKPYKCNECGKSYIS 889
Cdd:pfam13465   2 LKRHMRTHTGEKPYKCPECGKSFKS 26
zf-H2C2_2 pfam13465
Zinc-finger double domain;
556-581 7.08e-03

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 35.04  E-value: 7.08e-03
                          10        20
                  ....*....|....*....|....*.
gi 2217320701 556 YLSVHMRCHTGEKPLHCNKCGMVFTY 581
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFKS 26
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH