NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|397522430|ref|XP_003831270|]
View 

histone-lysine N-methyltransferase SETMAR isoform X1 [Pan paniscus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
51-303 6.81e-160

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


:

Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 460.23  E-value: 6.81e-160
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITFPGCICVKTPCLPGTCSCLRH-GENYDDNSCLRDIGSggKYAEPVFECNVLCRCSDHCR 129
Cdd:cd10544    3 FQYTPENVPGPGADTDPNEITFPGCDCKTSSCEPETCSCLRKyGPNYDDDGCLLDFDG--KYSGPVFECNSMCKCSESCQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 130 NRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREHVYNGQVMETF 209
Cdd:cd10544   81 NRVVQNGLQFKLQVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIGFEEARRRTKSQTKGDMNYIIVLREHLSSGKVLETF 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 210 VDPTYIGNIGRFLNHSCEPNLLMIPVRIDSMVPKLALFAAKDIVPEEELSYDYSGRYLNLTVSEDKERLDHGKLRKPCYC 289
Cdd:cd10544  161 VDPTYIGNIGRFLNHSCEPNLFMVPVRVDSMVPKLALFAARDIVAGEELSFDYSGEFSNSVESVTLARQDESKSRKPCLC 240
                        250
                 ....*....|....
gi 397522430 290 GAKSCTAFLPFDSS 303
Cdd:cd10544  241 GAENCRGFLPFDES 254
Transposase_1 pfam01359
Transposase (partial DDE domain); This family includes the mariner transposase.
501-580 7.81e-33

Transposase (partial DDE domain); This family includes the mariner transposase.


:

Pssm-ID: 460174  Cd Length: 80  Bit Score: 121.26  E-value: 7.81e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  501 LYDNRRRSAQWLDQEEAPKHFPKPILHPKKVMVTIWWSAAGLIHYSFLNPGETITSEKYAQEIDEMNQKLQRLQLALVNR 580
Cdd:pfam01359   1 LYENKRRSAQWVDAGEPPKKVPKPELHAKKVMLSVWWDAKGVLYVELLPPGKTINAEYYASQLRKLRQALKEKRPGKVRV 80
transpos_IS630 super family cl41314
IS630 family transposase;
383-655 5.64e-16

IS630 family transposase;


The actual alignment was detected with superfamily member NF033545:

Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 79.22  E-value: 5.64e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 383 ERTVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQ----LRAIIEADPLT----TTREVAEELN------VNHSTVVRHL 447
Cdd:NF033545  26 RSTVYRWLKRFNEgGLEGLLDKPRPGRPRKLLSEQqaelLALLLEEPPEGaghwTLRELAALLEeefgveYSRSTVRRLL 105
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 448 KQIG-KVKKLDKWvPHELTENQKNRRFEVssslILRNHNEPFLDRIVTCDEKWI-LYDNRRRSaqWldqeeAPKHFPKPI 525
Cdd:NF033545 106 KRLGlSPKKPRPR-APKQDPEFVEKFKEV----LGLYRAPPDPAEVVFIDESGIqLLDTRGRG--W-----APKGQRRPR 173
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 526 LHPKKVmvtiwWSAAGLihYSFLNP--GETITSEKYAQEIDEMNQKLQRLqLALVNRKGPILLHDNARPHVAQPTLQKLN 603
Cdd:NF033545 174 VHVYGR-----RGTLNL--FGALDPltGKVFVLFTGRINSEDFIEFLEEL-LAAYPGKKIHLILDNASTHKSKKVREWLE 245
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....
gi 397522430 604 ELGY-EVLPHPPYSPDLLPTNYhVFKHLN-NFLQGKRFHNQQDAENAFREFVES 655
Cdd:NF033545 246 EHGRiELHYLPPYSPWLNPIER-VWAVLKrRLLRNRAFRSVDELREAIDAFLNR 298
HTH_48 pfam17906
HTH domain in Mos1 transposase; The N-terminal domain of the Mos1 Mariner transposase ...
348-397 1.74e-14

HTH domain in Mos1 transposase; The N-terminal domain of the Mos1 Mariner transposase comprises two HTH domains. This HTH domain binds in the DNA major groove to the transposons inverted repeats.


:

Pssm-ID: 465558 [Multi-domain]  Cd Length: 50  Bit Score: 67.93  E-value: 1.74e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 397522430  348 KKQIRAIFLFEFKMGRKAAETTRNINNAFGPGTANERTVQWWFKKFCKGD 397
Cdd:pfam17906   1 KIHLRHCILYEFLQGKSAAEAYRNICEVYGEDAVSERTFQRWFARFRSGD 50
 
Name Accession Description Interval E-value
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
51-303 6.81e-160

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 460.23  E-value: 6.81e-160
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITFPGCICVKTPCLPGTCSCLRH-GENYDDNSCLRDIGSggKYAEPVFECNVLCRCSDHCR 129
Cdd:cd10544    3 FQYTPENVPGPGADTDPNEITFPGCDCKTSSCEPETCSCLRKyGPNYDDDGCLLDFDG--KYSGPVFECNSMCKCSESCQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 130 NRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREHVYNGQVMETF 209
Cdd:cd10544   81 NRVVQNGLQFKLQVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIGFEEARRRTKSQTKGDMNYIIVLREHLSSGKVLETF 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 210 VDPTYIGNIGRFLNHSCEPNLLMIPVRIDSMVPKLALFAAKDIVPEEELSYDYSGRYLNLTVSEDKERLDHGKLRKPCYC 289
Cdd:cd10544  161 VDPTYIGNIGRFLNHSCEPNLFMVPVRVDSMVPKLALFAARDIVAGEELSFDYSGEFSNSVESVTLARQDESKSRKPCLC 240
                        250
                 ....*....|....
gi 397522430 290 GAKSCTAFLPFDSS 303
Cdd:cd10544  241 GAENCRGFLPFDES 254
Transposase_1 pfam01359
Transposase (partial DDE domain); This family includes the mariner transposase.
501-580 7.81e-33

Transposase (partial DDE domain); This family includes the mariner transposase.


Pssm-ID: 460174  Cd Length: 80  Bit Score: 121.26  E-value: 7.81e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  501 LYDNRRRSAQWLDQEEAPKHFPKPILHPKKVMVTIWWSAAGLIHYSFLNPGETITSEKYAQEIDEMNQKLQRLQLALVNR 580
Cdd:pfam01359   1 LYENKRRSAQWVDAGEPPKKVPKPELHAKKVMLSVWWDAKGVLYVELLPPGKTINAEYYASQLRKLRQALKEKRPGKVRV 80
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
139-268 1.06e-32

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 122.44  E-value: 1.06e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430   139 FHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHL--QTKSDSNYIIAIREHVYngqvmetfVDPTYIG 216
Cdd:smart00317   1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAydTDGAKAFYLFDIDSDLC--------IDARRKG 72
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 397522430   217 NIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYSGRYLN 268
Cdd:smart00317  73 NLARFINHSCEPNCELLFVEVNGD-DRIVIFALRDIKPGEELTIDYGSDYAN 123
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
150-262 1.00e-23

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 96.44  E-value: 1.00e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  150 GWGLRTLEFIPKGRFVCEYAGEVLgFSEVQRRIHLQTKSDSNYIIAIREHVYN-GQVMETFVDPT--YIGNIGRFLNHSC 226
Cdd:pfam00856   1 GRGLFATEDIPKGEFIGEYVEVLL-ITKEEADKRELLYYDKLELRLWGPYLFTlDEDSEYCIDARalYYGNWARFINHSC 79
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 397522430  227 EPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDY 262
Cdd:pfam00856  80 DPNCEVRVVYVNGG-PRIVIFALRDIKPGEELTIDY 114
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
139-295 6.92e-21

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 88.87  E-value: 6.92e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 139 FHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDsNYIIaireHVYNGQVmetfVDPTYIGNI 218
Cdd:COG2940    6 PRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERREPHKEPLH-TYLF----ELDDDGV----IDGALGGNP 76
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 397522430 219 GRFLNHSCEPNLLMIPVRidsmvPKLALFAAKDIVPEEELSYDYSgrylnltvsedkerLDHGKLRKPCYCgaKSCT 295
Cdd:COG2940   77 ARFINHSCDPNCEADEED-----GRIFIVALRDIAAGEELTYDYG--------------LDYDEEEYPCRC--PNCR 132
transpos_IS630 NF033545
IS630 family transposase;
383-655 5.64e-16

IS630 family transposase;


Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 79.22  E-value: 5.64e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 383 ERTVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQ----LRAIIEADPLT----TTREVAEELN------VNHSTVVRHL 447
Cdd:NF033545  26 RSTVYRWLKRFNEgGLEGLLDKPRPGRPRKLLSEQqaelLALLLEEPPEGaghwTLRELAALLEeefgveYSRSTVRRLL 105
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 448 KQIG-KVKKLDKWvPHELTENQKNRRFEVssslILRNHNEPFLDRIVTCDEKWI-LYDNRRRSaqWldqeeAPKHFPKPI 525
Cdd:NF033545 106 KRLGlSPKKPRPR-APKQDPEFVEKFKEV----LGLYRAPPDPAEVVFIDESGIqLLDTRGRG--W-----APKGQRRPR 173
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 526 LHPKKVmvtiwWSAAGLihYSFLNP--GETITSEKYAQEIDEMNQKLQRLqLALVNRKGPILLHDNARPHVAQPTLQKLN 603
Cdd:NF033545 174 VHVYGR-----RGTLNL--FGALDPltGKVFVLFTGRINSEDFIEFLEEL-LAAYPGKKIHLILDNASTHKSKKVREWLE 245
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....
gi 397522430 604 ELGY-EVLPHPPYSPDLLPTNYhVFKHLN-NFLQGKRFHNQQDAENAFREFVES 655
Cdd:NF033545 246 EHGRiELHYLPPYSPWLNPIER-VWAVLKrRLLRNRAFRSVDELREAIDAFLNR 298
HTH_48 pfam17906
HTH domain in Mos1 transposase; The N-terminal domain of the Mos1 Mariner transposase ...
348-397 1.74e-14

HTH domain in Mos1 transposase; The N-terminal domain of the Mos1 Mariner transposase comprises two HTH domains. This HTH domain binds in the DNA major groove to the transposons inverted repeats.


Pssm-ID: 465558 [Multi-domain]  Cd Length: 50  Bit Score: 67.93  E-value: 1.74e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 397522430  348 KKQIRAIFLFEFKMGRKAAETTRNINNAFGPGTANERTVQWWFKKFCKGD 397
Cdd:pfam17906   1 KIHLRHCILYEFLQGKSAAEAYRNICEVYGEDAVSERTFQRWFARFRSGD 50
Csa3 COG3415
CRISPR-associated protein Csa3, CARF domain [Defense mechanisms]; CRISPR-associated protein ...
382-650 8.32e-08

CRISPR-associated protein Csa3, CARF domain [Defense mechanisms]; CRISPR-associated protein Csa3, CARF domain is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 442641 [Multi-domain]  Cd Length: 325  Bit Score: 54.47  E-value: 8.32e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 382 NERTVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQLRAIIE--------ADPLTTTREVAEEL------NVNHSTVVRH 446
Cdd:COG3415   51 SRSTVYRWLKRYREgGLAGLKDRPRGGRPSKLSDEQRERLLEllrekspdQGSRWTLAELAELLeeefgvEVSPSTVRRL 130
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 447 LKQIGKVKKLDKWVPHEltenqKNRRFEVSSSLILRNHNEPFLDRIVTCDEKWILYDNRRRSAQWLDQEEAPKHFPKPIL 526
Cdd:COG3415  131 LKRLGLSYKKPRPRAPK-----QDPEAVEKFKKELEALLAKPAKAEVAGVDEGDEIGALRRAAGGRLLAPGGRVRRTGAS 205
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 527 HPKkvmvtiwwSAAGLIHYSFLNPGETITSEKYAQEIDEMNQKLQRLQLALVNRKGPILLHDNARPHVAQPTLQKLNELG 606
Cdd:COG3415  206 VRR--------GRALLFLALGLALGLRIGLTIDALLAGALLLFLRLLAERALALLLLLLLLLLLSPSKAVARAAKLLKLE 277
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*
gi 397522430 607 YEVLPHPPYSP-DLLPTNYHVFKHLNNFLQGKRFHNQQDAENAFR 650
Cdd:COG3415  278 LLRLLFLPPSAsLLLALEALLRELRERNLLRLAHASLLLLFAAAR 322
HTH_32 pfam13565
Homeodomain-like domain;
385-447 1.62e-07

Homeodomain-like domain;


Pssm-ID: 463923 [Multi-domain]  Cd Length: 73  Bit Score: 48.83  E-value: 1.62e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 397522430  385 TVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQLRAIIEA---DPLTTTREVAEEL------NVNHSTVVRHL 447
Cdd:pfam13565   1 TVYRWRKRYNEeGLEGLEDRPRRGRPRKLTDEQEARLLALlceEPRWSPRLLAERLeeefgvKVSRSTVRRIL 73
 
Name Accession Description Interval E-value
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
51-303 6.81e-160

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 460.23  E-value: 6.81e-160
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITFPGCICVKTPCLPGTCSCLRH-GENYDDNSCLRDIGSggKYAEPVFECNVLCRCSDHCR 129
Cdd:cd10544    3 FQYTPENVPGPGADTDPNEITFPGCDCKTSSCEPETCSCLRKyGPNYDDDGCLLDFDG--KYSGPVFECNSMCKCSESCQ 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 130 NRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREHVYNGQVMETF 209
Cdd:cd10544   81 NRVVQNGLQFKLQVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIGFEEARRRTKSQTKGDMNYIIVLREHLSSGKVLETF 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 210 VDPTYIGNIGRFLNHSCEPNLLMIPVRIDSMVPKLALFAAKDIVPEEELSYDYSGRYLNLTVSEDKERLDHGKLRKPCYC 289
Cdd:cd10544  161 VDPTYIGNIGRFLNHSCEPNLFMVPVRVDSMVPKLALFAARDIVAGEELSFDYSGEFSNSVESVTLARQDESKSRKPCLC 240
                        250
                 ....*....|....
gi 397522430 290 GAKSCTAFLPFDSS 303
Cdd:cd10544  241 GAENCRGFLPFDES 254
SET_SETDB-like cd10538
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
51-263 9.42e-100

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2, and similar proteins; The family includes SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2. SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis. SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin. This family also includes the pre-SET domain, which is found in a number of histone methyltransferases (HMTase), N-terminal to the SET domain. Pre-SET domain is a zinc binding motif which contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilizing SET domains. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380936 [Multi-domain]  Cd Length: 217  Bit Score: 304.68  E-value: 9.42e-100
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITFPGCICvKTPCLPGTCSCLRH---GENYDDNSCLRDIgsggKYAEPVFECNVLCRCSDH 127
Cdd:cd10538    3 FTYIKDNIVGKNVQPFSNIIDSVGCKC-KDDCLDSKCACAAEsdgIFAYTKNGLLRLN----NSPPPIFECNSKCSCDDD 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 128 CRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREHVYN-GQVM 206
Cdd:cd10538   78 CKNRVVQRGLQARLQVFRTSKKGWGVRSLEFIPKGSFVCEYVGEVITTSEADRRGKIYDKSGGSYLFDLDEFSDSdGDGE 157
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 207 ETFVDPTYIGNIGRFLNHSCEPNLLMIPVRI---DSMVPKLALFAAKDIVPEEELSYDYS 263
Cdd:cd10538  158 ELCVDATFCGNVSRFINHSCDPNLFPFNVVIdhdDLRYPRIALFATRDILPGEELTFDYG 217
SET_SUV39H cd10542
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
51-298 5.59e-55

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homologs, SUV39H1, SUV39H2 and similar proteins; This family includes SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. Also included are Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (SUV39H homolog) and Neurospora crassa DIM-5, both of which also methylate 'Lys-9' of histone H3.


Pssm-ID: 380940 [Multi-domain]  Cd Length: 245  Bit Score: 188.27  E-value: 5.59e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITfpGCICVkTPCLPGTCSC----LRHGENYDDNSCLRdIGSGGkyaePVFECNVLCRCSD 126
Cdd:cd10542    3 FQYINDYIPGDGVKIPEDFLV--GCECT-EDCHNNNPTCcpaeSGVKFAYDKQGRLR-LPPGT----PIYECNSRCKCGP 74
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 127 HCRNRVVQKGLQFHFQVFKTH-KKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIRehvYNGQV 205
Cdd:cd10542   75 DCPNRVVQRGRKVPLCIFRTSnGRGWGVKTLEDIKKGTFVMEYVGEIITSEEAERRGKIYDANGRTYLFDLD---YNDDD 151
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 206 METFVDPTYIGNIGRFLNHSCEPNLLMIPVRI---DSMVPKLALFAAKDIVPEEELSYDYSGrYLNLTVSEDKERLDhGK 282
Cdd:cd10542  152 CEYTVDAAYYGNISHFINHSCDPNLAVYAVWInhlDPRLPRIAFFAKRDIKAGEELTFDYLM-TGTGGSSESTIPKP-KD 229
                        250
                 ....*....|....*.
gi 397522430 283 LRKPCYCGAKSCTAFL 298
Cdd:cd10542  230 VRVPCLCGSKNCRKYL 245
SET_EHMT cd10543
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
51-266 5.41e-51

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase EHMT1, EHMT2 and similar proteins; This family includes EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380941 [Multi-domain]  Cd Length: 231  Bit Score: 176.76  E-value: 5.41e-51
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITFPGCICVKTpCLPGTCSCLRHGEN--YDDNSCLrdIGSGGKYAEPV-FECNVLCRCSDH 127
Cdd:cd10543    3 FLYVTENCETSPLNIDRNITSLQTCSCRDD-CSSDNCVCGRLSVRcwYDKEGRL--LPDFNKLDPPLiFECNRACSCWRN 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 128 CRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRihlqtkSDSNYI--IAIREHvyngqv 205
Cdd:cd10543   80 CRNRVVQNGIRYRLQLFRTRGMGWGVRALQDIPKGTFVCEYIGELISDSEADSR------EDDSYLfdLDNKDG------ 147
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 397522430 206 mETF-VDPTYIGNIGRFLNHSCEPNllMIPVRI-----DSMVPKLALFAAKDIVPEEELSYDYSGRY 266
Cdd:cd10543  148 -ETYcIDARRYGNISRFINHLCEPN--LIPVRVfvehqDLRFPRIAFFASRDIKAGEELGFDYGEKF 211
SET_AtSUVH-like cd10545
SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar ...
74-262 6.44e-47

SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar proteins; Arabidopsis thaliana SUVH protein (also termed suppressor of variegation 3-9 homolog protein) is a histone-lysine N-methyltransferase that methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. Some family members contain a post-SET domain which binds a Zn2+ ion. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380943 [Multi-domain]  Cd Length: 232  Bit Score: 165.65  E-value: 6.44e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  74 GCICvKTPCLPGT--CSC-LRHGENYDDNSclrdigsGGKYAEP---VFECNVLCRCSDHCRNRVVQKGLQFHFQVFKTH 147
Cdd:cd10545   23 GCDC-KNRCTDGAsdCACvKKNGGEIPYNF-------NGRLIRAkpaIYECGPLCKCPPSCYNRVTQKGLRYRLEVFKTA 94
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 148 KKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIhlqtkSDSNYIIAIREHvYNGQV---------------------- 205
Cdd:cd10545   95 ERGWGVRSWDSIPAGSFICEYVGELLDTSEADTRS-----GNDDYLFDIDNR-QTNRGwdggqrldvgmsdgerssaede 168
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 397522430 206 --METFVDPTYIGNIGRFLNHSCEPNLLMIPVRI---DSMVPKLALFAAKDIVPEEELSYDY 262
Cdd:cd10545  169 esSEFTIDAGSFGNVARFINHSCSPNLFVQCVLYdhnDLRLPRVMLFAADNIPPLQELTYDY 230
SET_SETDB1 cd10517
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
29-295 7.90e-43

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes.


Pssm-ID: 380915 [Multi-domain]  Cd Length: 288  Bit Score: 156.30  E-value: 7.90e-43
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  29 DVACGQENLPVGAWPP--GAAPAPFQYTPDHVVGPGADIDPTQITFPGCICvKTPCLPGT-CSClrHGENYDDNSCL--- 102
Cdd:cd10517    8 DISYGKEGVPIPCVNEidNSSPPYVEYSKERIPGKGVNINLDPDFLVGCDC-TDGCRDKSkCAC--QQLTIEATAATpgg 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 103 RDIGSGG--------KYAEPVFECNVLCRCSDHCRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLG 174
Cdd:cd10517   85 QINPSAGyqyrrlmeKLPTGVYECNSRCKCDKRCYNRVVQNGLQVRLQVFKTEKKGWGIRCLDDIPKGSFVCIYAGQILT 164
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 175 FSEVQRRihlQTKSDSNYI-----IAIREHV---YNGQVMET--FVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSM---V 241
Cdd:cd10517  165 EDEANEE---GLQYGDEYFaeldyIEVVEKLkegYESDVEEHcyIIDAKSEGNLGRYLNHSCSPNLFVQNVFVDTHdlrF 241
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 397522430 242 PKLALFAAKDIVPEEELSYDYsgrylNLTVS--EDKERLdhgklrkpCYCGAKSCT 295
Cdd:cd10517  242 PWVAFFASRYIRAGTELTWDY-----NYEVGsvPGKVLY--------CYCGSSNCR 284
SET_SUV39H_Clr4-like cd20073
SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 ...
84-298 1.57e-41

SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 methyltransferase Clr4, and similar proteins; This subfamily contains fission yeast Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (also known as Suv39h), the sole homolog of the mammalian SUV39H1 and SUV39H2 enzymes, that has a critical role in preventing aberrant heterochromatin formation. It is known to di- and tri-methylate Lys-9 of histone H3, a central heterochromatic histone modification, with its specificity profile most similar to that of the human SUV39H2 homolog.


Pssm-ID: 380999 [Multi-domain]  Cd Length: 259  Bit Score: 151.96  E-value: 1.57e-41
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  84 PGTCSCLRHGEN----YDDNSCLRDIGSGgkyaePVFECNVLCRCSDHCRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFI 159
Cdd:cd20073   39 PGSCQCLEDSNEksfaYDEYGRVRANTGS-----IIYECNENCDCGINCPNRVVQRGRKLPLEIFKTKHKGWGLRCPRFI 113
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 160 PKGRFVCEYAGEVLGFSEVQRRihlQTKSDSNYIIAIRE-HVYNGQVMETF-VDPTYIGNIGRFLNHSCEPNLLMIPVRI 237
Cdd:cd20073  114 KAGTFIGVYLGEVITQSEAEIR---GKKYDNVGVTYLFDlDLFEDQVDEYYtVDAQYCGDVTRFINHSCDPNLAIYSVLR 190
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 397522430 238 D---SMVPKLALFAAKDIVPEEELSYDYSGRYLNLTVSEDKERLDHG----KLRKPCYCGAKSCTAFL 298
Cdd:cd20073  191 DksdSKIYDLAFFAIKDIPALEELTFDYSGRNNFDQLGFIGNRSNSKyinlKNKRPCYCGSANCRGWL 258
SET_SUV39H1 cd10525
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
51-298 2.74e-41

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 1 (SUV39H1) and similar proteins; SUV39H1 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A (KMT1A), position-effect variegation 3-9 homolog (SUV39H), or Su(var)3-9 homolog 1) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380923 [Multi-domain]  Cd Length: 255  Bit Score: 151.20  E-value: 2.74e-41
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQItfpGCIC---VKTP---CLPGTcscLRHGENYDDNSCLRDigsggKYAEPVFECNVLCRC 124
Cdd:cd10525    3 FVYINEYKVGEGVTLNQVAV---GCECqdcLSQPvggCCPGA---SKHRFAYNEQGQVKV-----RPGLPIYECNSRCRC 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 125 SDHCRNRVVQKGLQFHFQVFKTHK-KGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIR--EHVY 201
Cdd:cd10525   72 GPDCPNRVVQKGIQYDLCIFRTDNgRGWGVRTLEKIRKNSFVMEYVGEIITSEEAERRGQIYDRQGATYLFDLDyvEDVY 151
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 202 NgqvmetfVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSM---VPKLALFAAKDIVPEEELSYDYSGRYLNLTVSEDKERL 278
Cdd:cd10525  152 T-------VDAAYYGNISHFVNHSCDPNLQVYNVFIDNLderLPRIALFATRTIRAGEELTFDYNMQVDPVDAESTKMDS 224
                        250       260       270
                 ....*....|....*....|....*....|
gi 397522430 279 DHG----------KLRKPCYCGAKSCTAFL 298
Cdd:cd10525  225 NFGlaglpgspkkRVRIECKCGVRSCRKYL 254
SET_SUV39H_DIM5-like cd19473
SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; ...
56-298 2.13e-39

SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; This subfamily contains Neurospora crassa DIM-5 (also termed H3-K9-HMTase dim-5, or HKMT) which functions as histone-lysine N-methyltransferase that specifically trimethylates histone H3 to form H3K9me3.


Pssm-ID: 380996 [Multi-domain]  Cd Length: 274  Bit Score: 146.31  E-value: 2.13e-39
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  56 DHVVgPGADIDPTQITF-PGCICVKTP-CLPGTCSCLRHGENYDD----------------NSCLRDIGSGGKYaePVFE 117
Cdd:cd19473    7 EKSI-LGEGVELADEEFrSGCECTDDEdCMYSGCLCLQDVDPDDDrdpgkkknayhssgakKGCLRGHMLNSRL--PIYE 83
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 118 CNVLCRCSDHCRNRVVQKGLQFHFQVFKT-HKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHlqtksdsNYIIAI 196
Cdd:cd19473   84 CHEGCACSDDCPNRVVERGRKVPLQIFRTsDGRGWGVRSTVDIKRGQFVDCYVGEIITPEEAQRRRD-------AATIAQ 156
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 197 REHVY-----------------NGQVMEtfVDPTYIGNIGRFLNHSCEPNLlmipvRI--------DSMVPKLALFAAKD 251
Cdd:cd19473  157 RKDVYlfaldkfsdpdsldprlRGDPYE--IDGEFMSGPTRFINHSCDPNL-----RIfarvgdhaDKHIHDLAFFAIKD 229
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*....
gi 397522430 252 IVPEEELSYDysgrYLNLTVSEDKERLD--HGKLRKPCYCGAKSCTAFL 298
Cdd:cd19473  230 IPRGTELTFD----YVDGVTGLDDDAGDeeKEKEMTKCLCGSPKCRGYL 274
SET_EHMT1 cd10535
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
51-294 8.76e-39

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 1 (EHMT1) and similar proteins; EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, or lysine N-methyltransferase 1D (KMT1D)) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380933 [Multi-domain]  Cd Length: 231  Bit Score: 143.15  E-value: 8.76e-39
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDPTQITFPGCICVKTpCLPGTCSC--LRHGENYDDNSCLRdigSGGKYAEP--VFECNVLCRCSD 126
Cdd:cd10535    3 YKYVSQNCVTSPMNIDRNITHLQYCVCIDD-CSSSNCMCgqLSMRCWYDKDGRLL---PEFNMAEPplIFECNHACSCWR 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 127 HCRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRihlqtkSDSNYIIAIREHvyNGQVM 206
Cdd:cd10535   79 NCRNRVVQNGLRARLQLYRTRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVR------EEDSYLFDLDNK--DGEVY 150
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 207 etFVDPTYIGNIGRFLNHSCEPNLlmIPVRI-----DSMVPKLALFAAKDIVPEEELSYDYSGRYLNLTvsedkerldhG 281
Cdd:cd10535  151 --CIDARFYGNVSRFINHHCEPNL--VPVRVfmahqDLRFPRIAFFSTRLIEAGEQLGFDYGERFWDIK----------G 216
                        250
                 ....*....|...
gi 397522430 282 KLRKpCYCGAKSC 294
Cdd:cd10535  217 KLFS-CRCGSPKC 228
SET_SUV39H2 cd10532
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
51-298 1.77e-38

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 2 (SUV39H2) and similar proteins; SUV39H2 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B (KMT1B), or Su(var)3-9 homolog 2) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380930 [Multi-domain]  Cd Length: 243  Bit Score: 142.72  E-value: 1.77e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  51 FQYTPDHVVGPGADIDpTQITFpGCICvkTPCLPGTCSCLRHGEN--YDDNSCLRdIGSGgkyaEPVFECNVLCRCSDHC 128
Cdd:cd10532    3 FYYINEYKPAPGINLD-NEATV-GCDC--SDCFFGKCCPAEAGVLfaYNEHGQLK-IPPG----TPIYECNSRCKCGPDC 73
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 129 RNRVVQKGLQFHFQVFKTHK-KGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIrehvyNGQVME 207
Cdd:cd10532   74 PNRVVQKGTQYSLCIFRTSNgRGWGVKTLQKIKKNSFVMEYVGEVITSEEAERRGQFYDSKGITYLFDL-----DYESDE 148
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 208 TFVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSM---VPKLALFAAKDIVPEEELSYDYSGR-YLNLTVSEDKERLDHGKL 283
Cdd:cd10532  149 FTVDAARYGNVSHFVNHSCDPNLQVFNVFIDNLdtrLPRIALFSTRTIKAGEELTFDYQMKgSGDLSSDSIDNSPAKKRV 228
                        250
                 ....*....|....*
gi 397522430 284 RKPCYCGAKSCTAFL 298
Cdd:cd10532  229 RTVCKCGAVTCRGYL 243
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
113-294 6.34e-36

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 135.53  E-value: 6.34e-36
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 113 EP--VFECNVLCRCSDHCRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRihlqtkSDS 190
Cdd:cd10533   63 EPplIFECNQACSCWRNCKNRVVQSGIKVRLQLYRTAKMGWGVRALQTIPQGTFICEYVGELISDAEADVR------EDD 136
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 191 NYIIAIREHvyNGQVMetFVDPTYIGNIGRFLNHSCEPNLlmIPVRI-----DSMVPKLALFAAKDIVPEEELSYDYSGR 265
Cdd:cd10533  137 SYLFDLDNK--DGEVY--CIDARYYGNISRFINHLCDPNI--IPVRVfmlhqDLRFPRIAFFSSRDIRTGEELGFDYGDR 210
                        170       180
                 ....*....|....*....|....*....
gi 397522430 266 YLNLTVSedkerldhgklRKPCYCGAKSC 294
Cdd:cd10533  211 FWDIKSK-----------YFTCQCGSEKC 228
SET_SETDB2 cd10523
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) ...
86-294 3.14e-35

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) and similar proteins; SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380921 [Multi-domain]  Cd Length: 266  Bit Score: 134.19  E-value: 3.14e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  86 TCSCLR--HGENYDDNSCLRDIGSGGKYA---EPV----FECNVLCRCSDH-CRNRVVQKGLQFHFQVFKTHKKGWGLRT 155
Cdd:cd10523   45 KCACLQltARAFSKSESSPSKGGRGYKYKrlqEPIpsglYECNVSCKCNRMlCQNRVVQHGLQVRLQVFKTEKKGWGVRC 124
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 156 LEFIPKGRFVCEYAGEVLgfsevqRRIHLQTKSDSNYIIAIREHVYNGQVMET-------------FVDPTYIGNIGRFL 222
Cdd:cd10523  125 LDDIDKGTFVCIYAGRVL------SRARSPTEPLPPKLELPSENEVEVVTSWLilskkrklrenvcFLDASKEGNVGRFL 198
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 397522430 223 NHSCEPNLLMIPVRIDSM---VPKLALFAAKDIVPEEELSYDYSgrYLNLTVSEdKErldhgklrKPCYCGAKSC 294
Cdd:cd10523  199 NHSCCPNLFVQNVFVDTHdknFPWVAFFTNRVVKAGTELTWDYS--YDAGTSPE-QE--------IPCLCGVNKC 262
SET_SETDB cd10541
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), ...
115-295 2.69e-33

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), SET domain bifurcated 2 (SETDB2), and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380939 [Multi-domain]  Cd Length: 236  Bit Score: 128.05  E-value: 2.69e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 115 VFECNVLCRCSDH-CRNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGfsevqrrihlQTKSDSNYI 193
Cdd:cd10541   67 VYECNKLCKCDPNmCQNRLVQHGLQVRLQLFKTQNKGWGIRCLDDIAKGTFVCIYAGKILT----------DDFADKEGL 136
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 194 IAIREHVYNGQVMET---FVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSM---VPKLALFAAKDIVPEEELSYDYSgrYL 267
Cdd:cd10541  137 EMGDEYFANLDHIEEscyIIDAKLEGNLGRYLNHSCSPNLFVQNVFVDTHdlrFPWVAFFASKRIKAGTELTWDYN--YE 214
                        170       180
                 ....*....|....*....|....*...
gi 397522430 268 NLTVsEDKERLdhgklrkpCYCGAKSCT 295
Cdd:cd10541  215 VGSV-EGKELL--------CCCGSNECR 233
Transposase_1 pfam01359
Transposase (partial DDE domain); This family includes the mariner transposase.
501-580 7.81e-33

Transposase (partial DDE domain); This family includes the mariner transposase.


Pssm-ID: 460174  Cd Length: 80  Bit Score: 121.26  E-value: 7.81e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  501 LYDNRRRSAQWLDQEEAPKHFPKPILHPKKVMVTIWWSAAGLIHYSFLNPGETITSEKYAQEIDEMNQKLQRLQLALVNR 580
Cdd:pfam01359   1 LYENKRRSAQWVDAGEPPKKVPKPELHAKKVMLSVWWDAKGVLYVELLPPGKTINAEYYASQLRKLRQALKEKRPGKVRV 80
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
139-268 1.06e-32

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 122.44  E-value: 1.06e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430   139 FHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHL--QTKSDSNYIIAIREHVYngqvmetfVDPTYIG 216
Cdd:smart00317   1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAydTDGAKAFYLFDIDSDLC--------IDARRKG 72
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 397522430   217 NIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYSGRYLN 268
Cdd:smart00317  73 NLARFINHSCEPNCELLFVEVNGD-DRIVIFALRDIKPGEELTIDYGSDYAN 123
SET_SETD2-like cd10531
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), ...
143-295 1.31e-30

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2), ASH1-like protein (ASH1L) and similar proteins; This family includes SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2) and ASH1-like protein (ASH1L), which function as histone-lysine N-methyltransferases. SETD2 specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. NSD2 shows histone H3 'Lys-27' (H3K27me) methyltransferase activity. ASH1L specifically methylates 'Lys-36' of histone H3 (H3K36me). The family also includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins.


Pssm-ID: 380929  Cd Length: 136  Bit Score: 116.97  E-value: 1.31e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 143 VFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSN--YIIAIREHVyngqvmetFVDPTYIGNIGR 220
Cdd:cd10531    4 LFRTEKKGWGVKAKEDIQKGEFIIEYVGEVIDKKEFKERLDEYEELGKSnfYILSLSDDV--------VIDATRKGNLSR 75
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 397522430 221 FLNHSCEPNLLMIPVRIDSmVPKLALFAAKDIVPEEELSYDY-SGRYLNLTVsedkerldhgklrkPCYCGAKSCT 295
Cdd:cd10531   76 FINHSCEPNCETQKWIVNG-EYRIGIFALRDIPAGEELTFDYnFVNYNEAKQ--------------VCLCGAQNCR 136
SET_ASH1L cd19174
SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ...
140-298 9.83e-30

SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ASH1L (EC 2.1.1.43; also termed absent small and homeotic disks protein 1 homolog, KMT2H, or lysine N-methyltransferase 2H) acts as histone-lysine N-methyltransferase that specifically methylates 'Lys-36' of histone H3 (H3K36me). It plays important roles in development; heterozygous mutation of ASH1L is associated with severe intellectual disability (ID) and multiple congenital anomaly (MCA).


Pssm-ID: 380951 [Multi-domain]  Cd Length: 141  Bit Score: 114.70  E-value: 9.83e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 140 HFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAireHVYNGQVmetfVDPTYIGNIG 219
Cdd:cd19174    1 GLERFRTEDKGWGVRTKEPIKAGQFIIEYVGEVVSEQEFRRRMIEQYHNHSHHYCL---NLDSGMV----IDGYRMGNEA 73
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 220 RFLNHSCEPNLLMIPVRIDSmVPKLALFAAKDIVPEEELSYDYsgRYLNLTVSEdkerldhgklRKPCYCGAKSCTAFL 298
Cdd:cd19174   74 RFVNHSCDPNCEMQKWSVNG-VYRIGLFALKDIPAGEELTYDY--NFHSFNVEK----------QQPCKCGSPNCRGVI 139
SET_SETD2 cd19172
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and ...
143-298 2.36e-29

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and similar proteins; SETD2 (also termed HIF-1, huntingtin yeast partner B, huntingtin-interacting protein 1 (HIP-1), huntingtin-interacting protein B, lysine N-methyltransferase 3A or protein-lysine N-methyltransferase SETD2) acts as histone-lysine N-methyltransferase that specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. It has been shown that methylation is a posttranslational modification of dynamic microtubules and that SETD2 methylates alpha-tubulin at lysine 40, the same lysine that is marked by acetylation on microtubules. Methylation of microtubules occurs during mitosis and cytokinesis and can be ablated by SETD2 deletion, which causes mitotic spindle and cytokinesis defects, micronuclei, and polyploidy.


Pssm-ID: 380949 [Multi-domain]  Cd Length: 142  Bit Score: 113.44  E-value: 2.36e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 143 VFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLqtksdsnYIIAIREHVYngqVM----ETFVDPTYIGNI 218
Cdd:cd19172    6 VFRTEKKGWGLRAAEDLPKGTFVIEYVGEVLDEKEFKRRMKE-------YAREGNRHYY---FMalksDEIIDATKKGNL 75
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 219 GRFLNHSCEPNLLMIPVRIDSMVpKLALFAAKDIVPEEELSYDYS-GRYlnltvsedkerldhGKLRKPCYCGAKSCTAF 297
Cdd:cd19172   76 SRFINHSCEPNCETQKWTVNGEL-RVGFFAKRDIPAGEELTFDYQfERY--------------GKEAQKCYCGSPNCRGY 140

                 .
gi 397522430 298 L 298
Cdd:cd19172  141 I 141
SET_NSD cd19173
SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, ...
142-298 6.54e-26

SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, NSD2, NSD3 and similar proteins; The nuclear receptor-binding SET Domain (NSD) family of histone H3 lysine 36 methyltransferases is comprised of NSD1, NSD2, and NSD3, which are primarily known to be involved in chromatin integrity and gene expression through mono-, di-, or tri-methylating lysine 36 of histone H3 (H3K36), respectively. NSD1 (EC 2.1.1.43; also termed histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B) or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3.


Pssm-ID: 380950 [Multi-domain]  Cd Length: 142  Bit Score: 103.55  E-value: 6.54e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 142 QVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRI-HLQTKSDSN-YIIAIREHVYngqvmetfVDPTYIGNIG 219
Cdd:cd19173    5 EPFKTGDRGWGLRTKRDIKKGDFVIEYVGELIDEEECRRRLkKAHENNITNfYMLTLDKDRI--------IDAGPKGNLS 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 220 RFLNHSCEPNL--LMIPVRIDsmvPKLALFAAKDIVPEEELSYDYsgrylNLTvsedkerlDHGKLRKPCYCGAKSCTAF 297
Cdd:cd19173   77 RFMNHSCQPNCetQKWTVNGD---TRVGLFAVRDIPAGEELTFNY-----NLD--------CLGNEKKVCRCGAPNCSGF 140

                 .
gi 397522430 298 L 298
Cdd:cd19173  141 L 141
SET_SETD1-like cd10518
SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), ...
126-294 8.56e-26

SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), histone-lysine N-methyltransferases (KMT2A/KMT2B/KMT2C/KMT2D) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A), 1B (SETD1B), as well as histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B), 2C (KMT2C), 2D (KMT2D). These proteins are histone-lysine N-methyltransferases (EC 2.1.1.43) that specifically methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380916  Cd Length: 150  Bit Score: 103.45  E-value: 8.56e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 126 DHCRNRVVQKgLQFHfqvfKTHKKGWGLRTLEFIPKGRFVCEYAGEVLG--FSEVQRRIHLQTKSDSNYIIAIREHVYng 203
Cdd:cd10518    6 RQLRSRLKER-LRVG----KSGIHGWGLFAKRPIAAGEMVIEYVGEVIRpiVADKREKRYDEEGGGGTYMFRIDEDLV-- 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 204 qvmetfVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSmVPKLALFAAKDIVPEEELSYDYSgryLNltvSEDKERLdhgkl 283
Cdd:cd10518   79 ------IDATKKGNIARFINHSCDPNCYAKIITVDG-EKHIVIFAKRDIAPGEELTYDYK---FP---IEDEEKI----- 140
                        170
                 ....*....|.
gi 397522430 284 rkPCYCGAKSC 294
Cdd:cd10518  141 --PCLCGAPNC 149
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
150-262 1.00e-23

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 96.44  E-value: 1.00e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430  150 GWGLRTLEFIPKGRFVCEYAGEVLgFSEVQRRIHLQTKSDSNYIIAIREHVYN-GQVMETFVDPT--YIGNIGRFLNHSC 226
Cdd:pfam00856   1 GRGLFATEDIPKGEFIGEYVEVLL-ITKEEADKRELLYYDKLELRLWGPYLFTlDEDSEYCIDARalYYGNWARFINHSC 79
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 397522430  227 EPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDY 262
Cdd:pfam00856  80 DPNCEVRVVYVNGG-PRIVIFALRDIKPGEELTIDY 114
SET_ASHR3-like cd19175
SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 ...
142-298 3.44e-23

SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins; This family includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3, also termed protein SET DOMAIN GROUP 4 or protein stamen loss), ASH1 homolog 3 (ASHH3, also termed protein SET DOMAIN GROUP 7) and homolog 4 (ASHH4, also termed protein SET DOMAIN GROUP 24). They all function as histone-lysine N-methyltransferases (EC 2.1.1.43).


Pssm-ID: 380952 [Multi-domain]  Cd Length: 139  Bit Score: 95.95  E-value: 3.44e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 142 QVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRI-HLQTKSDSNYiiairehvYNGQVMETFV-DPTYIGNIG 219
Cdd:cd19175    3 KLVKTEKCGWGLVADEDINAGEFIIEYVGEVIDDKTCEERLwDMKHKGEKNF--------YMCEIDKDMViDATFKGNLS 74
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 220 RFLNHSCEPNLLMIPVRIDSMVpKLALFAAKDIVPEEELSYDYsgRYLNLTVSEDkerldhgklrkpCYCGAKSCTAFL 298
Cdd:cd19175   75 RFINHSCDPNCELQKWQVDGET-RIGVFAIRDIKKGEELTYDY--QFVQFGADQD------------CHCGSKNCRGKL 138
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
139-295 6.92e-21

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 88.87  E-value: 6.92e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 139 FHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDsNYIIaireHVYNGQVmetfVDPTYIGNI 218
Cdd:COG2940    6 PRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERREPHKEPLH-TYLF----ELDDDGV----IDGALGGNP 76
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 397522430 219 GRFLNHSCEPNLLMIPVRidsmvPKLALFAAKDIVPEEELSYDYSgrylnltvsedkerLDHGKLRKPCYCgaKSCT 295
Cdd:COG2940   77 ARFINHSCDPNCEADEED-----GRIFIVALRDIAAGEELTYDYG--------------LDYDEEEYPCRC--PNCR 132
SET_NSD3 cd19212
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
142-298 2.39e-19

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 3 (NSD3) and similar proteins; NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. NSD3 is amplified and overexpressed in multiple cancer types, including acute myeloid leukemia (AML), breast, lung, pancreatic and bladder cancers, as well as squamous cell carcinoma of the head and neck (SCCHN). NSD3 contributes to tumorigenesis by interacting with bromodomain-containing protein 4 (BRD4), the bromodomain and extraterminal (BET) protein, which is a potential therapeutic target in acute myeloid leukemia (AML). NSD3 is amplified in primary tumors and cell lines from breast carcinoma, and can promote the cell viability of small-cell lung cancer and pancreatic ductal adenocarcinoma. High NSD3 expression is implicated in poor grade and heavy smoking history in SCCHN. Thus, NSD3 may serve as a potential druggable target for selective cancer therapy.


Pssm-ID: 380989 [Multi-domain]  Cd Length: 142  Bit Score: 84.98  E-value: 2.39e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 142 QVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRI-HLQTKSDSN-YIIAIREhvyngqvmETFVDPTYIGNIG 219
Cdd:cd19212    5 EIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLRIkRAHENSVTNfYMLTVTK--------DRIIDAGPKGNYS 76
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 220 RFLNHSCEPNLLMIPVRIDSMVpKLALFAAKDIVPEEELSYDYSGRYLnltvsedkerldhGKLRKPCYCGAKSCTAFL 298
Cdd:cd19212   77 RFMNHSCNPNCETQKWTVNGDV-RVGLFALCDIPAGMELTFNYNLDCL-------------GNGRTECHCGADNCSGFL 141
PreSET smart00468
N-terminal to some SET domains; A Cys-rich putative Zn2+-binding domain that occurs N-terminal ...
27-123 2.84e-19

N-terminal to some SET domains; A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.


Pssm-ID: 128744 [Multi-domain]  Cd Length: 98  Bit Score: 83.23  E-value: 2.84e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430    27 QLDVACGQENLP---VGAWPPGAAPAPFQYTPDHVVGPGADIDPTQITFPGCICVKtPCLPGT-CSCLRHGENYDDNScl 102
Cdd:smart00468   1 CLDISNGKENVPvplVNEVDEDPPPPDFEYISEYIYGQGVPIDRSPSPLVGCSCSG-DCSSSNkCECARKNGGEFAYE-- 77
                           90       100
                   ....*....|....*....|.
gi 397522430   103 RDIGSGGKYAEPVFECNVLCR 123
Cdd:smart00468  78 LNGGLRLKRKPLIYECNSRCS 98
SET_KMT2A_2B cd19170
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), ...
129-298 3.11e-19

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B) and similar proteins; This family includes KMT2A and KMT2B. Both KMT2A (also termed ALL-1 or CXXC7 or MLL or MLL1 or TRX1 or HRX) and KMT2B (also termed MLL4 or TRX2) act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380947 [Multi-domain]  Cd Length: 152  Bit Score: 84.75  E-value: 3.11e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 129 RNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLgfsevqrRIHLQTKsdsnyiiaiREHVYNGQVM-- 206
Cdd:cd19170    4 RFRHLRKTAKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGEVI-------RSVLTDK---------REKYYESKGIgc 67
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 207 -------ETFVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSmVPKLALFAAKDIVPEEELSYDYsgrylnltvsedKERLD 279
Cdd:cd19170   68 ymfriddDEVVDATMHGNAARFINHSCEPNCYSRVVNIDG-KKHIVIFALRRILRGEELTYDY------------KFPIE 134
                        170
                 ....*....|....*....
gi 397522430 280 HGKLrkPCYCGAKSCTAFL 298
Cdd:cd19170  135 DVKI--PCTCGSKKCRKYL 151
SET_SET1 cd20072
SET domain (including post-SET domain) found in catalytic component of the Saccharomyces ...
138-294 6.11e-19

SET domain (including post-SET domain) found in catalytic component of the Saccharomyces cerevisiae COMPASS complex and similar proteins; The family contains mostly fungal SET domains, including SET1 found in the catalytic component of the Saccharomyces cerevisiae COMPASS (complex of proteins associated with Set1). SET1 is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex. The activity of this catalytic domain is established through forming a complex with a set of core proteins; it is extensively contacted by Cps60 (Bre2), Cps50 (Swd1), and Cps30 (Swd3).


Pssm-ID: 380998  Cd Length: 148  Bit Score: 84.01  E-value: 6.11e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 138 QFHFQVFKTHKkgWGLRTLEFIPKGRFVCEYAGEVL--GFSEVQRRIHLQTKSDSNYIIAIREhvyngqvmETFVDPTYI 215
Cdd:cd20072   14 QLKFARSAIHN--WGLYAMENISAKDMVIEYVGEVIrqQVADEREKRYLRQGIGSSYLFRIDD--------DTVVDATKK 83
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 216 GNIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYSgrylnltVSEDKERLdhgklrkPCYCGAKSC 294
Cdd:cd20072   84 GNIARFINHCCDPNCTAKIIKVEGE-KRIVIYAKRDIAAGEELTYDYK-------FPREEDKI-------PCLCGAPNC 147
SET_NSD1 cd19210
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
140-298 1.37e-18

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and similar proteins; NSD1 (EC 2.1.1.43; also termed Histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B), or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD1 is altered in approximately 10% of head and neck cancer patients with 55% decrease in risk of death in NSD1-mutated versus non-mutated patients; its disruption promotes favorable chemotherapeutic responses linked to hypomethylation.


Pssm-ID: 380987 [Multi-domain]  Cd Length: 142  Bit Score: 82.67  E-value: 1.37e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 140 HFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRI-HLQTKSDSNYIIAIREHvyngqvmETFVDPTYIGNI 218
Cdd:cd19210    3 EVEIFRTLGRGWGLRCKTDIKKGEFVNEYVGELIDEEECRARIrYAQEHDITNFYMLTLDK-------DRIIDAGPKGNY 75
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 219 GRFLNHSCEPNLLMIPVRIDSMVpKLALFAAKDIVPEEELSYDYsgrylNLtvsedkERLDHGKlrKPCYCGAKSCTAFL 298
Cdd:cd19210   76 ARFMNHCCQPNCETQKWTVNGDT-RVGLFALCDIKAGTELTFNY-----NL------ECLGNGK--TVCKCGAPNCSGFL 141
SET_NSD2 cd19211
SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) ...
142-298 1.66e-18

SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-36' (H3K36me) methyltransferase activity. NSD2 has been shown to mediate di- and trimethylation of H3K36 and dimethylation of H4K20 in different systems, and has been characterized as a transcriptional repressor interacting with histone deacetylase HDAC1 and histone demethylase LSD1. NSD2 mediates constitutive NF-kappaB signaling for cancer cell proliferation, survival and tumor growth. It is highly overexpressed in several types of human cancers, including small-cell lung cancers, neuroblastoma, carcinomas of stomach and colon, and bladder cancers, and its overexpression tends to be associated with tumor aggressiveness. WHSC1 is frequently deleted in Wolf-Hirschhorn syndrome (WHS).


Pssm-ID: 380988 [Multi-domain]  Cd Length: 142  Bit Score: 82.35  E-value: 1.66e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 142 QVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSN--YIIAIREhvyngqvmETFVDPTYIGNIG 219
Cdd:cd19211    5 KIIKTEGKGWGLIAKRDIKKGEFVNEYVGELIDEEECMARIKHAHENDIThfYMLTIDK--------DRIIDAGPKGNYS 76
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 220 RFLNHSCEPNLLMIPVRIDSMVpKLALFAAKDIVPEEELSYDYSGRYLnltvsedkerldhGKLRKPCYCGAKSCTAFL 298
Cdd:cd19211   77 RFMNHSCQPNCETQKWTVNGDT-RVGLFAVCDIPAGTELTFNYNLDCL-------------GNEKTVCRCGAPNCSGFL 141
SET_SETD8 cd10528
SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2. ...
131-265 5.27e-18

SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2.1.1.43; also termed N-lysine methyltransferase KMT5A, H4-K20-HMTase KMT5A, lysine N-methyltransferase 5A, lysine-specific methylase 5A, PR/SET domain-containing protein 07, PR-Set7 or PR/SET07) is a nucleosomal histone-lysine N-methyltransferase that specifically monomethylates 'Lys-20' of histone H4 (H4K20me1). It plays a central role in the silencing of euchromatic genes.


Pssm-ID: 380926 [Multi-domain]  Cd Length: 141  Bit Score: 81.08  E-value: 5.27e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 131 RVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRR--IHLQTKSDSNYIIAIRehvYNGQVMet 208
Cdd:cd10528    9 ELILSGKEEGLKVIEIDGKGRGVIATRPFEKGDFVVEYHGDLITITEAKKReaLYAKDPSTGCYMYYFQ---YKGKTY-- 83
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 209 FVDPTYI-GNIGRFLNHSC-EPNLLMIPVRIDSmVPKLALFAAKDIVPEEELSYDYSGR 265
Cdd:cd10528   84 CVDATKEsGRLGRLINHSKkKPNLKTKLLVIDG-VPHLILVAKRDIKPGEELLYDYGDR 141
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
138-294 9.64e-18

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


Pssm-ID: 380946  Cd Length: 148  Bit Score: 80.46  E-value: 9.64e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 138 QFHFQVFKTHkkGWGLRTLEFIPKGRFVCEYAGEVL--GFSEVQRRIHLQTKSDSNYIIAIREhvyngqvmETFVDPTYI 215
Cdd:cd19169   14 QLKFAKSRIH--DWGLFALEPIAADEMVIEYVGQVIrqSVADEREKRYEAIGIGSSYLFRVDD--------DTIIDATKC 83
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 216 GNIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylNLTVSEDKerldhgklrKPCYCGAKSC 294
Cdd:cd19169   84 GNLARFINHSCNPNCYAKIITVESQ-KKIVIYSKRPIAVNEEITYDY-----KFPIEDEK---------IPCLCGAPQC 147
SET_EZH cd10519
SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar ...
150-263 4.66e-17

SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both, EZH1 and EZH2, can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380917  Cd Length: 117  Bit Score: 77.67  E-value: 4.66e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 150 GWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREhvyngqvmETFVDPTYIGNIGRFLNHSCEPN 229
Cdd:cd10519   12 GWGLFLKEPIKKDEFIGEYTGELISQDEADRRGKIYDKYNSSYLFNLND--------QFVVDATRKGNKIRFANHSSNPN 83
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 397522430 230 LLmipVRIdSMVP---KLALFAAKDIVPEEELSYDYS 263
Cdd:cd10519   84 CY---AKV-MMVNgdhRIGIFAKRDIEAGEELFFDYG 116
transpos_IS630 NF033545
IS630 family transposase;
383-655 5.64e-16

IS630 family transposase;


Pssm-ID: 468076 [Multi-domain]  Cd Length: 298  Bit Score: 79.22  E-value: 5.64e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 383 ERTVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQ----LRAIIEADPLT----TTREVAEELN------VNHSTVVRHL 447
Cdd:NF033545  26 RSTVYRWLKRFNEgGLEGLLDKPRPGRPRKLLSEQqaelLALLLEEPPEGaghwTLRELAALLEeefgveYSRSTVRRLL 105
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 448 KQIG-KVKKLDKWvPHELTENQKNRRFEVssslILRNHNEPFLDRIVTCDEKWI-LYDNRRRSaqWldqeeAPKHFPKPI 525
Cdd:NF033545 106 KRLGlSPKKPRPR-APKQDPEFVEKFKEV----LGLYRAPPDPAEVVFIDESGIqLLDTRGRG--W-----APKGQRRPR 173
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 526 LHPKKVmvtiwWSAAGLihYSFLNP--GETITSEKYAQEIDEMNQKLQRLqLALVNRKGPILLHDNARPHVAQPTLQKLN 603
Cdd:NF033545 174 VHVYGR-----RGTLNL--FGALDPltGKVFVLFTGRINSEDFIEFLEEL-LAAYPGKKIHLILDNASTHKSKKVREWLE 245
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....
gi 397522430 604 ELGY-EVLPHPPYSPDLLPTNYhVFKHLN-NFLQGKRFHNQQDAENAFREFVES 655
Cdd:NF033545 246 EHGRiELHYLPPYSPWLNPIER-VWAVLKrRLLRNRAFRSVDELREAIDAFLNR 298
SET_SETD5-like cd10529
SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine ...
137-262 2.12e-15

SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. KMT2E (also termed inactive lysine N-methyltransferase 2E or myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. The family also includes Saccharomyces cerevisiae SET domain-containing proteins, SET3 and SET4, and Schizosaccharomyces pombe SET3. Most of these family members contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380927  Cd Length: 127  Bit Score: 73.08  E-value: 2.12e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 137 LQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDsNYIIairehVYNGQVMETF-VDPTYI 215
Cdd:cd10529    3 LTSRTVEKVVGKNRKGLVATEDISPGEPILEYKGEVSLRSEFKEDNGFFKRPS-PFVF-----FYDGFEGLPLcVDARKY 76
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....
gi 397522430 216 GNIGRFLNHSCEPNllmipVRIDSMV-----PKLALFAAKDIVPEEE--LSYDY 262
Cdd:cd10529   77 GNEARFIRRSCRPN-----AELRHVVvsngeLRLFIFALKDIRKGTEitIPFDY 125
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
147-268 1.14e-14

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 70.83  E-value: 1.14e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 147 HKKGWGLRTLEFIPKGRFVCEYAGEVLgfSEVQRRIHLQTKSDSNYIIAI-REHVYngqvmetfVDPTYIGNIGRFLNHS 225
Cdd:cd10522   11 SHNGLGLFAAETIAKGEFVGEYTGEVL--DRWEEDRDSVYHYDPLYPFDLnGDILV--------IDAGKKGNLTRFINHS 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|...
gi 397522430 226 CEPNLLMIpVRIDSMVPKLALFAAKDIVPEEELSYDYSGRYLN 268
Cdd:cd10522   81 DQPNLELI-VRTLKGEQHIGFVAIRDIKPGEELFISYGPKYWK 122
SET_KMT2A cd19206
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) ...
129-298 1.18e-14

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) and similar proteins; KMT2A (EC2.1.1.43; also termed lysine N-methyltransferase 2A, ALL-1, CXXC-type zinc finger protein 7 (CXXC7), myeloid/lymphoid or mixed-lineage leukemia (MLL), myeloid/lymphoid or mixed-lineage leukemia protein 1 (MLL1), trithorax-like protein (TRX1), or zinc finger protein HRX) acts as a histone methyltransferase that plays an essential role in early development and hematopoiesis. It is a catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac).


Pssm-ID: 380983 [Multi-domain]  Cd Length: 154  Bit Score: 71.98  E-value: 1.18e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 129 RNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRR-IHLQTKSDSNYIIAIREHvyngqvme 207
Cdd:cd19206    4 RFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKReKYYDSKGIGCYMFRIDDS-------- 75
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 208 TFVDPTYIGNIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylNLTVSEDKERLdhgklrkPC 287
Cdd:cd19206   76 EVVDATMHGNAARFINHSCEPNCYSRVINIDGQ-KHIVIFAMRKIYRGEELTYDY-----KFPIEDASNKL-------PC 142
                        170
                 ....*....|.
gi 397522430 288 YCGAKSCTAFL 298
Cdd:cd19206  143 NCGAKKCRKFL 153
HTH_48 pfam17906
HTH domain in Mos1 transposase; The N-terminal domain of the Mos1 Mariner transposase ...
348-397 1.74e-14

HTH domain in Mos1 transposase; The N-terminal domain of the Mos1 Mariner transposase comprises two HTH domains. This HTH domain binds in the DNA major groove to the transposons inverted repeats.


Pssm-ID: 465558 [Multi-domain]  Cd Length: 50  Bit Score: 67.93  E-value: 1.74e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 397522430  348 KKQIRAIFLFEFKMGRKAAETTRNINNAFGPGTANERTVQWWFKKFCKGD 397
Cdd:pfam17906   1 KIHLRHCILYEFLQGKSAAEAYRNICEVYGEDAVSERTFQRWFARFRSGD 50
SET_KMT2C_2D cd19171
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), ...
149-294 4.93e-14

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), 2D (KMT2D) and similar proteins; This family includes KMT2C and KMT2D. Both, KMT2C (also termed HALR or MLL3) and KMT2D (also termed ALR or MLL2), act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me). They are subunits of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380948 [Multi-domain]  Cd Length: 153  Bit Score: 70.15  E-value: 4.93e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 149 KGWGLRTLEFIPKGRFVCEYAGEVLGfSEVQRRihlqtksdsnyiiaiREHVYNGQ----VM-----ETFVDPTYIGNIG 219
Cdd:cd19171   24 QGLGLYAARDIEKHTMVIEYIGEIIR-NEVANR---------------REKIYESQnrgiYMfridnDWVIDATMTGGPA 87
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 397522430 220 RFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYSGRYlnltvsEDkerlDHGKLrkPCYCGAKSC 294
Cdd:cd19171   88 RYINHSCNPNCVAEVVTFDKE-KKIIIISNRRIAKGEELTYDYKFDF------ED----DQHKI--PCLCGAPNC 149
Pre-SET pfam05033
Pre-SET motif; This protein motif is a zinc binding motif. It contains 9 conserved cysteines ...
30-131 1.85e-13

Pre-SET motif; This protein motif is a zinc binding motif. It contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilising SET domains.


Pssm-ID: 461530 [Multi-domain]  Cd Length: 99  Bit Score: 66.67  E-value: 1.85e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430   30 VACGQENLPVG---AWPPGAAPAPFQYTPDHVVGPGadidPTQITFPGCICvkTPCLPGTCSCLR-----HGENYDDNSC 101
Cdd:pfam05033   1 ISKGKENVPIPvvnEVDDEPPPPDFTYITSYIYPKE----FLLIIPQGCDC--GDCSSEKCSCAQlnggeFRFPYDKDGL 74
                          90       100       110
                  ....*....|....*....|....*....|
gi 397522430  102 LRDigsggKYAEPVFECNVLCRCSDHCRNR 131
Cdd:pfam05033  75 LVP-----ESKPPIYECNPLCGCPPSCPNR 99
SET_SETD1A cd19204
SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and ...
145-298 5.89e-13

SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and similar proteins; SETD1A (EC2.1.1.43), also termed lysine N-methyltransferase 2F, or Set1/Ash2 histone methyltransferase complex subunit SET1, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Human SET domain containing protein 1A (hSETD1A) expression occurs at a high rate in hepatocellular carcinoma patients and controls tumor metastasis in breast cancer by activating MMP expression.


Pssm-ID: 380981 [Multi-domain]  Cd Length: 153  Bit Score: 66.97  E-value: 5.89e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 145 KTHKKGWGLRTLEFIPKGRFVCEYAGEVLG--FSEVQRRIHLQTKSDSNYIIAIREhvyngqvmETFVDPTYIGNIGRFL 222
Cdd:cd19204   20 RSRIHEWGLFAMEPIAADEMVIEYVGQNIRqvVADMREKRYVQEGIGSSYLFRVDH--------DTIIDATKCGNLARFI 91
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 397522430 223 NHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylnltvsedKERLDHGKLrkPCYCGAKSCTAFL 298
Cdd:cd19204   92 NHCCTPNCYAKVITIESQ-KKIVIYSKQPIGVNEEITYDY------------KFPIEDNKI--PCLCGTENCRGTL 152
SET_SpSET3-like cd19183
SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET ...
142-262 7.06e-12

SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET domain-containing protein 3 (SETD3) and similar proteins; Schizosaccharomyces pombe SETD3 functions as a transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. It is required for both, gene activation and repression.


Pssm-ID: 380960  Cd Length: 173  Bit Score: 64.35  E-value: 7.06e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 142 QVFKTHKKGW-GLRTLEFIPKGRFVCEYAGEVLGFSEvqrrihLQTKSDSNY--IIAIREHVYngqvmetFVD--PTYI- 215
Cdd:cd19183    4 SSIGLANASRfGLFADRPIPAGDPIQELLGEIGLQSE------YIADPENQYqiLGAPKPHVF-------FHPqsPLYId 70
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 397522430 216 ----GNIGRFLNHSCEPNLLMIPVRIDS-MVPKLALFAAKDIVPEEELSYDY 262
Cdd:cd19183   71 trrsGSVARFIRRSCRPNAELVTVASDSgSVLKFVLYASRDISPGEEITIGW 122
SET cd08161
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
140-262 1.41e-11

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


Pssm-ID: 380914 [Multi-domain]  Cd Length: 72  Bit Score: 60.34  E-value: 1.41e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 140 HFQVFKTHKKGWGLRTLEFIPKGRFVCeyagevlgfsevqrrihlqtksdsnyiiairehvyngqvmetfvdptyignIG 219
Cdd:cd08161    1 EIRPSTIPGAGFGLFATRDIPKGEVIG---------------------------------------------------LA 29
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|...
gi 397522430 220 RFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDY 262
Cdd:cd08161   30 RFINHSCEPNCEFEEVYVGGK-PRVFIVALRDIKAGEELTVDY 71
SET_EZH-like cd19168
SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb ...
150-266 1.48e-11

SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb repressive complex 2 (PRC2), and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both EZH1 and EZH2 can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380945  Cd Length: 124  Bit Score: 62.21  E-value: 1.48e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 150 GWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREHVYngqvmetfVDPTYIGNIGRFLNHS---- 225
Cdd:cd19168   13 GLGLFAAEDIKEGEFVIEYTGELISHDEGVRREHRRGDVSYLYLFEEQEGIW--------VDAAIYGNLSRYINHAtdkv 84
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 397522430 226 ----CEPNLLMIPVRIdsmvpKLALFAAKDIVPEEELSYDYSGRY 266
Cdd:cd19168   85 ktgnCMPKIMYVNHEW-----RIKFTAIKDIKIGEELFFNYGDNF 124
SET_KMT2B cd19207
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) ...
129-298 2.65e-11

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) and similar proteins; KMT2B (EC2.1.1.43; also termed lysine N-methyltransferase 2B, myeloid/lymphoid or mixed-lineage leukemia protein 4 (MLL2/MLL4), trithorax homolog 2 (TRX2), or WW domain-binding protein 7 (WBP-7)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is required during the transcriptionally active period of oocyte growth for the establishment and/or maintenance of bulk H3K4 trimethylation (H3K4me3), global transcriptional silencing that precedes resumption of meiosis, oocyte survival and normal zygotic genome activation.


Pssm-ID: 380984 [Multi-domain]  Cd Length: 154  Bit Score: 62.35  E-value: 2.65e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 129 RNRVVQKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLgfsevqrRIHLQTKsdsnyiiaiREHVYNGQ---- 204
Cdd:cd19207    4 RFRHLKKTSKEAVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVI-------RSVLTDK---------REKFYDSKgigc 67
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 205 ---VMETF--VDPTYIGNIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylnltvsedKERLD 279
Cdd:cd19207   68 ymfRIDDFdvVDATMHGNAARFINHSCEPNCYSRVIHVEGQ-KHIVIFALRKIYRGEELTYDY------------KFPIE 134
                        170
                 ....*....|....*....
gi 397522430 280 HGKLRKPCYCGAKSCTAFL 298
Cdd:cd19207  135 DASNKLPCNCGAKRCRRFL 153
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
134-262 2.96e-11

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 61.62  E-value: 2.96e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 134 QKGLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREhvyngqvmETFVDPT 213
Cdd:cd19217    1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNN--------DFVVDAT 72
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 397522430 214 YIGNIGRFLNHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDY 262
Cdd:cd19217   73 RKGNKIRFANHSVNPNCYAKVVMVNGD-HRIGIFAKRAIQQGEELFFDY 120
SET_SETD1B cd19205
SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and ...
145-298 4.70e-11

SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and similar proteins; SETD1B (EC2.1.1.43), also termed lysine N-methyltransferase 2G, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Loss of SETD1B occurs in up to half the gastric and colorectal cancers, most commonly via SETD1B mutations, while de novo variants in SETD1B are associated with intellectual disability, epilepsy and autism.


Pssm-ID: 380982 [Multi-domain]  Cd Length: 153  Bit Score: 61.61  E-value: 4.70e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 145 KTHKKGWGLRTLEFIPKGRFVCEYAGEVLG--FSEVQRRIHLQTKSDSNYIIAIrEHvyngqvmETFVDPTYIGNIGRFL 222
Cdd:cd19205   20 KSHIHDWGLFAMEPIAADEMVIEYVGQNIRqvIADMREKRYEDEGIGSSYMFRV-DH-------DTIIDATKCGNFARFI 91
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 397522430 223 NHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylNLTVSEDKerldhgklrKPCYCGAKSCTAFL 298
Cdd:cd19205   92 NHSCNPNCYAKVITVESQ-KKIVIYSKQHINVNEEITYDY-----KFPIEDVK---------IPCLCGSENCRGTL 152
SET_EZH2 cd19218
SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43) ...
136-263 1.05e-09

SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43), also termed lysine N-methyltransferase 6, or ENX-1, or histone-lysine N-methyltransferase EZH2, is a catalytic subunit of the polycomb repressive complex 2 (PRC2)/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380995  Cd Length: 120  Bit Score: 56.46  E-value: 1.05e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 136 GLQFHFQVFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKsdsnYIIAIREHVYNGQVmetfVDPTYI 215
Cdd:cd19218    1 GSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDK----YMCSFLFNLNNDFV----VDATRK 72
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 397522430 216 GNIGRFLNHSCEPNLL--MIPVRIDSmvpKLALFAAKDIVPEEELSYDYS 263
Cdd:cd19218   73 GNKIRFANHSVNPNCYakVMMVNGDH---RIGIFAKRAIQTGEELFFDYR 119
SET_KMT2D cd19209
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) ...
145-298 2.38e-08

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) and similar proteins; KMT2D (EC2.1.1.43; also termed lysine N-methyltransferase 2D, ALL1-related protein (ALR), or myeloid/lymphoid or mixed-lineage leukemia protein 2 (MLL2)), acts as histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is a coactivator for estrogen receptor by being recruited by ESR1, thereby activating transcription. KMT2D is a subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380986 [Multi-domain]  Cd Length: 155  Bit Score: 53.55  E-value: 2.38e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 145 KTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSN-YIIAIR-EHVyngqvmetfVDPTYIGNIGRFL 222
Cdd:cd19209   22 RSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGiYMFRINnEHV---------IDATLTGGPARYI 92
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 397522430 223 NHSCEPNLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylNLTVSEDKERLdhgklrkPCYCGAKSCTAFL 298
Cdd:cd19209   93 NHSCAPNCVAEVVTFDKE-DKIIIISSRRIPKGEELTYDY-----QFDFEDDQHKI-------PCHCGAWNCRKWM 155
Csa3 COG3415
CRISPR-associated protein Csa3, CARF domain [Defense mechanisms]; CRISPR-associated protein ...
382-650 8.32e-08

CRISPR-associated protein Csa3, CARF domain [Defense mechanisms]; CRISPR-associated protein Csa3, CARF domain is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 442641 [Multi-domain]  Cd Length: 325  Bit Score: 54.47  E-value: 8.32e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 382 NERTVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQLRAIIE--------ADPLTTTREVAEEL------NVNHSTVVRH 446
Cdd:COG3415   51 SRSTVYRWLKRYREgGLAGLKDRPRGGRPSKLSDEQRERLLEllrekspdQGSRWTLAELAELLeeefgvEVSPSTVRRL 130
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 447 LKQIGKVKKLDKWVPHEltenqKNRRFEVSSSLILRNHNEPFLDRIVTCDEKWILYDNRRRSAQWLDQEEAPKHFPKPIL 526
Cdd:COG3415  131 LKRLGLSYKKPRPRAPK-----QDPEAVEKFKKELEALLAKPAKAEVAGVDEGDEIGALRRAAGGRLLAPGGRVRRTGAS 205
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 527 HPKkvmvtiwwSAAGLIHYSFLNPGETITSEKYAQEIDEMNQKLQRLQLALVNRKGPILLHDNARPHVAQPTLQKLNELG 606
Cdd:COG3415  206 VRR--------GRALLFLALGLALGLRIGLTIDALLAGALLLFLRLLAERALALLLLLLLLLLLSPSKAVARAAKLLKLE 277
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*
gi 397522430 607 YEVLPHPPYSP-DLLPTNYHVFKHLNNFLQGKRFHNQQDAENAFR 650
Cdd:COG3415  278 LLRLLFLPPSAsLLLALEALLRELRERNLLRLAHASLLLLFAAAR 322
HTH_32 pfam13565
Homeodomain-like domain;
385-447 1.62e-07

Homeodomain-like domain;


Pssm-ID: 463923 [Multi-domain]  Cd Length: 73  Bit Score: 48.83  E-value: 1.62e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 397522430  385 TVQWWFKKFCK-GDESLEDEERSGRPSEVDNDQLRAIIEA---DPLTTTREVAEEL------NVNHSTVVRHL 447
Cdd:pfam13565   1 TVYRWRKRYNEeGLEGLEDRPRRGRPRKLTDEQEARLLALlceEPRWSPRLLAERLeeefgvKVSRSTVRRIL 73
SET_KMT2C cd19208
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) ...
149-298 2.21e-07

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) and similar proteins; KMT2C (EC2.1.1.43; also termed lysine N-methyltransferase 2C, homologous to ALR protein (HALR) myeloid/lymphoid, or mixed-lineage leukemia protein 3 (MLL3)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me) and may be involved in leukemogenesis and developmental disorder. KMT2C is a catalytic subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation. Overexpression of KMT2C is associated with estrogen receptor-positive breast cancer; KMT2C mediates the estrogen dependence of breast cancer through regulation of estrogen receptor alpha (ERalpha) enhancer function. KMT2C is frequently mutated in certain populations with diffuse-type gastric adenocarcinomas (DGA); its loss promotes epithelial-to-mesenchymal transition (EMT) and is associated with worse overall survival.


Pssm-ID: 380985 [Multi-domain]  Cd Length: 154  Bit Score: 50.78  E-value: 2.21e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 149 KGWGLRTLEFIPKGRFVCEYAGEVLGfSEVQRRIHLQTKSDSNYIIAIRehVYNGQVmetfVDPTYIGNIGRFLNHSCEP 228
Cdd:cd19208   25 QGLGLYAARDIEKHTMVIEYIGTIIR-NEVANRKEKLYESQNRGVYMFR--IDNDHV----IDATLTGGPARYINHSCAP 97
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 229 NLLMIPVRIDSMvPKLALFAAKDIVPEEELSYDYsgrylNLTVSEDKERLdhgklrkPCYCGAKSCTAFL 298
Cdd:cd19208   98 NCVAEVVTFEKG-HKIIISSSRRIQKGEELCYDY-----KFDFEDDQHKI-------PCHCGAVNCRKWM 154
SET_ATXR5_6-like cd10539
SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The ...
158-264 5.95e-06

SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The family includes Arabidopsis thaliana ATXR5 and ATXR6. Both ATXR5 (also termed protein SET DOMAIN GROUP 15, or TRX-related protein 5) and ATXR6 (also termed protein SET DOMAIN GROUP 34, or TRX-related protein 6) function as histone methyltransferase that specifically monomethylates 'Lys-37' of histone H3 (H3K27me1). They are required for chromatin structure and gene silencing.


Pssm-ID: 380937  Cd Length: 138  Bit Score: 46.25  E-value: 5.95e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 158 FIPKGRFVCEYAGEVLGFSEvqrRIHLQTKSdsnyiiaIREHVYNGQVMETFVD-PTYIGNIGRFL----NHSCE----P 228
Cdd:cd10539   23 FIKDLTIIAEYTGDVDYIRN---REFDDNDS-------IMTLLLAGDPSKSLVIcPDKRGNIARFIsginNHTKDgkkkQ 92
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 397522430 229 NLLMIPVRIDSMVPKLaLFAAKDIVPEEELSYDYSG 264
Cdd:cd10539   93 NCKCVRYSINGEARVL-LVATRDIAKGERLYYDYNG 127
COG3335 COG3335
Transposase [Mobilome: prophages, transposons];
588-655 7.33e-06

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 442564 [Multi-domain]  Cd Length: 292  Bit Score: 48.32  E-value: 7.33e-06
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 397522430 588 DNARPHVAQPTLQKLNELGYEVLPHPPYSPDLLPTNyHVFKHLNNFL-QGKRFHNQQDAENAFREFVES 655
Cdd:COG3335  206 DNASFHKSKAVREWLEEAGIELLFLPPYSPDLNPIE-RLWSKLKALLrKNRAFRSKDELKAAVRSFLDR 273
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
221-270 4.85e-04

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 40.44  E-value: 4.85e-04
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|
gi 397522430 221 FLNHSCEPNLlmipVRIDSMVPKLALFAAKDIVPEEELSYDYSGRYLNLT 270
Cdd:cd20071   58 LLNHSCDPNA----VVVFDGNGTLRVRALRDIKAGEELTISYIDPLLPRT 103
SET_Suv4-20-like cd10524
SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of ...
148-266 1.08e-03

SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of variegation 4-20 (Suv4-20) and similar proteins; Suv4-20 (also termed Su(var)4-20) is a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-20' of histone H4. It acts as a dominant suppressor of position-effect variegation. The family also includes Suv4-20 homologs, lysine N-methyltransferase 5B (KMT5B) and lysine N-methyltransferase 5C (KMT5C). Both KMT5B (also termed lysine-specific methyltransferase 5B, or suppressor of variegation 4-20 homolog 1, or Su(var)4-20 homolog 1, or Suv4-20h1) and KMT5C (also termed lysine-specific methyltransferase 5C, or suppressor of variegation 4-20 homolog 2, or Su(var)4-20 homolog 2, or Suv4-20h2) are histone methyltransferases that specifically trimethylate 'Lys-20' of histone H4 (H4K20me3). They play central roles in the establishment of constitutive heterochromatin in pericentric heterochromatin regions.


Pssm-ID: 380922 [Multi-domain]  Cd Length: 141  Bit Score: 39.96  E-value: 1.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 148 KKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSDSNYIIAIREHVyngqvMETFVDPtyigniGRFLNHSCE 227
Cdd:cd10524   17 HYGAKIIATKPIKKGEKIHELCGCIAELSEEEEALLRPGGNDFSVMYSSRKKC-----SQLWLGP------AAFINHDCR 85
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 397522430 228 PNLLMIPVRIDSMVPKlalfAAKDIVPEEELSYDYSGRY 266
Cdd:cd10524   86 PNCKFVPTGKSTACVK----VLRDIEPGEEITVYYGDNY 120
SET_SETD5 cd19181
SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and ...
136-263 1.38e-03

SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. SETD5 loss-of-function mutations are a likely cause of a familial syndromic intellectual disability with variable phenotypic expression.


Pssm-ID: 380958  Cd Length: 150  Bit Score: 39.61  E-value: 1.38e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 136 GLQFHFQ---VFKTHKKGWGLRTLEFIPKGRFVCEYAGEVLGFSEVQRRIHLQTKSdsnYIIAIREHVYNGqvMETFVDP 212
Cdd:cd19181    1 GSQMQLQlgrVTRVQKHRKILRAARDLALDTLIIEYRGKVMLRQQFEVNGHFFKRP---YPFVLFYSKFNG--VEMCVDA 75
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 397522430 213 TYIGNIGRFLNHSCEPNLLMIPVRIDSMVpKLALFAAKDIVPEEELS----YDYS 263
Cdd:cd19181   76 RTFGNDARFIRRSCTPNAEVRHMIADGMI-HLCIYAVAAIAKDAEVTiafdYEYS 129
SET_SpSet7-like cd10540
SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces ...
140-266 5.42e-03

SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces pombe Set7 is a novel histone-lysine N-methyltransferase. The family also includes a viral histone H3 lysine 27 methyltransferase from Paramecium bursaria Chlorella virus 1 (PBCV-1).


Pssm-ID: 380938  Cd Length: 112  Bit Score: 37.23  E-value: 5.42e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 397522430 140 HFQVFKTHKKGWGLRTLEFIPKGRFVcEYAGeVLGFSEvQRRIHLQTKSDSNYIIAIREhvyNGQVMEtfvdptyIGnIG 219
Cdd:cd10540    1 RLEVKPSTLKGRGVFATRPIKKGEVI-EEAP-VIVLPK-EEYQHLCKTVLDHYVFSWGD---GCLALA-------LG-YG 66
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 397522430 220 RFLNHSCEPNLLmipVRIDSMVPKLALFAAKDIVPEEELSYDYSGRY 266
Cdd:cd10540   67 SMFNHSYTPNAE---YEIDFENQTIVFYALRDIEAGEELTINYGDDL 110
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH