NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|504508736|ref|WP_014695838|]
View 

GIY-YIG nuclease family protein [Deinococcus gobiensis]

Protein Classification

GIY-YIG nuclease family protein( domain architecture ID 61901)

GIY-YIG nuclease family protein similar to Escherichia virus T4 endonuclease segA that may be involved in the movement of the endonuclease-encoding DNA

CATH:  3.40.1440.10
Gene Ontology:  GO:0004518
PubMed:  16646971
SCOP:  3000597

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GIY-YIG_SF super family cl15257
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
3-92 2.45e-13

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


The actual alignment was detected with superfamily member cd10437:

Pssm-ID: 472790  Cd Length: 90  Bit Score: 61.13  E-value: 2.45e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736   3 GIYPVT--PTSRLYVGASDNLRRRQQEHERARQGGNWSNKALRADLKTYGPDAFSFHVLQRTGDLGELRRLEAQWaqqLR 80
Cdd:cd10437    2 GIYKITnlENGKIYIGSSKNILKRLSQHKRDLKAGKHHNKKLQRDFNKYGEENFEFTILEEGEEIEKLLELEQEY---ID 78
                         90
                 ....*....|..
gi 504508736  81 AFDPVTGYNTQP 92
Cdd:cd10437   79 KYKSEKGYNIEP 90
 
Name Accession Description Interval E-value
GIY-YIG_HE_I-TevI_like cd10437
N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and ...
3-92 2.45e-13

N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and similar proteins; I-TevI is a site-specific GIY-YIG homing endonuclease encoded within the group I intron of the thymidylate synthase gene (td) from Escherichia coli phage T4. It functions as an endonuclease that catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site. I-TevI recognizes its extensive 37 base pair DNA target in a site-specific, but sequence-tolerant manner. The cleavage site is located at 23 (upper strand) and 25 (lower strand) nucleotides upstream of the intron insertion site. A divalent cation, such as Mg2+, is required for the catalysis. I-TevI also acts as a repressor of its own transcription. It binds an operator that is located upstream of the I-TevI coding sequence and overlaps the T4 late promoter, which drives I-TevI expression from within the td intron. I-TevI binds the homing sites and the operator with the same affinity, but cleaves the homing site more efficiently than the operator. I-TevI consists of an N-terminal catalytic domain, containing the GIY-YIG motif, and a C-terminal DNA-binding domain that binds DNA as a monomer, joined by a flexible linker. The C-terminal domain includes three subdomains: a zinc finger, a minor-groove binding alpha-helix (NUMOD3, nuclease-associated modular domain 3), and a helix-turn-helix domain (HTH). The last two are responsible for DNA-binding. The zinc finger is part of the linker and not required for DNA-binding. It is implicated as a distance sensor to constrain the catalytic domain to cleave the homing site at a fixed position. None of other GIY-YIG endonucleases have been found to have the zinc finger motif. This family also includes a reduced activity isoschizomer of I-TevI, I-BmoI, which is encoded within the group I intron of the thymidylate synthase (TS) gene (thyA) from Bacillus mojavensis. I-BmoI catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site in the presence of a divalent cation cofactor, such as Mg2+. In the absence of Mg2+, I-Bmol only nicks one of the strands. Both I-BmoI and I-TevI bind a homologous stretch of TS-encoding DNA as monomers, but use different strategies to distinguish intronless from intron-containing substrates. I-TevI recognizes substrates at the level of DNA-binding. However, I-BmoI binds both intron-containing and intronless TS-encoding substrates, but efficiently cleaves only intronless substrate. Afterwards they cleave their respective intronless substrates in the same positions, and both require a critical G-C base pair adjacent to the top strand site for efficient cleavage. The C-terminal domain of I-BmoI has nuclease-associated modular DNA-binding domains (NUMODs), but lacks the zinc finger, which is different from that of I-TevI. Although the zinc finger implicated as a distance determination in I-TevI is absent, I-BmoI still possesses some cleavage distance discrimination. Besides I-TevI and I-BmoI, this family contains a putative GIY-YIG homing endonuclease, I-BanI, encoded within the self-splicing group I intron of nrdE gene from Bacillus anthracis. It contains two major domains, the N-terminal GIY-YIG domain and the C-terminal DNA-binding domain that consists of a minor-groove DNA binding alpha-helix motif and a helix-turn-helix (HTH) motif. I-BanI generates a double-strand break (DSB) in the intronless nrdE gene. The cleavage site is located at 5 and 7 nucleotides upstream of the intron insertion site, with 2-nucleotide 3' extensions. The recognition site is 35 to 40 base pairs and covers the cleavage site with a bias toward the downstream region including the (intervening sequence) IVS insertion site. Moreover, this family contains another putative GIY-YIG homing endonuclease, I-BthII, encoded within the self-splicing group I intron of nrdF gene from Bacillus thuringiensis ssp. pakistani. It contains a GIY-YIG motif that generates a double-strand break (DSB) in the intronless nrdF gene. The cleavage site is located at 7 and 9 nucleotides upstream of the intron insertion site, leaving 2-nucleotide 3' extensions. The recognition site is 27 to 29 base pairs with the DSB cleavage site at the 5'-end of the top strand, and with the intervening sequence (IVS) insertion site approximately in the middle of the recognition site.


Pssm-ID: 198384  Cd Length: 90  Bit Score: 61.13  E-value: 2.45e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736   3 GIYPVT--PTSRLYVGASDNLRRRQQEHERARQGGNWSNKALRADLKTYGPDAFSFHVLQRTGDLGELRRLEAQWaqqLR 80
Cdd:cd10437    2 GIYKITnlENGKIYIGSSKNILKRLSQHKRDLKAGKHHNKKLQRDFNKYGEENFEFTILEEGEEIEKLLELEQEY---ID 78
                         90
                 ....*....|..
gi 504508736  81 AFDPVTGYNTQP 92
Cdd:cd10437   79 KYKSEKGYNIEP 90
grpIintron_endo TIGR01453
group I intron endonuclease; This model represents one subfamily of endonucleases containing ...
1-89 5.19e-08

group I intron endonuclease; This model represents one subfamily of endonucleases containing the endo/excinuclease amino terminal domain, pfam01541 at its amino end. A distinct subfamily includes excinuclease abc subunit c (uvrC). Members of pfam01541 are often termed GIY-YIG endonucleases after conserved motifs near the amino end. This subfamily in this model is found in open reading frames of group I introns in both phage and mitochondria. The closely related endonucleases of phage T4: segA, segB, segC, segD and segE, score below the trusted cutoff for the family.


Pssm-ID: 273636 [Multi-domain]  Cd Length: 214  Bit Score: 49.31  E-value: 5.19e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736    1 MTGIYPVT--PTSRLYVGASDNLRRRQQEHERARQGGNwsNKALRADLKTYGPDAFSFHVLQRTGDLGELRRLEAQWaqq 78
Cdd:TIGR01453   1 KSGIYKITnnINGKIYVGSSVNLEKRLKEHLKLLKKGN--RIKLQKALNKYGWSNFSFEILEYYCNKDDLIERETYY--- 75
                          90
                  ....*....|.
gi 504508736   79 LRAFDPVTGYN 89
Cdd:TIGR01453  76 IKLLNPDAGYN 86
GIYc smart00465
GIY-YIG type nucleases (URI domain);
1-94 3.50e-04

GIY-YIG type nucleases (URI domain);


Pssm-ID: 214677 [Multi-domain]  Cd Length: 84  Bit Score: 37.02  E-value: 3.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736     1 MTGIYPVT--PTSRLYVGASDNLRRRQQEHerarQGGNWSNKALRADLKTYGpDAFSFHVLQRTGDlgELRRLEAQWaqq 78
Cdd:smart00465   1 KPGVYYITnkKNGKLYVGKAKNLRNRLKRH----FSGSRKGRLLIDALLKYG-GNFEFIILESFDE--SALELEKYL--- 70
                           90
                   ....*....|....*.
gi 504508736    79 LRAFDPvtGYNTQPIR 94
Cdd:smart00465  71 IKEYKP--KYNLLLKG 84
 
Name Accession Description Interval E-value
GIY-YIG_HE_I-TevI_like cd10437
N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and ...
3-92 2.45e-13

N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and similar proteins; I-TevI is a site-specific GIY-YIG homing endonuclease encoded within the group I intron of the thymidylate synthase gene (td) from Escherichia coli phage T4. It functions as an endonuclease that catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site. I-TevI recognizes its extensive 37 base pair DNA target in a site-specific, but sequence-tolerant manner. The cleavage site is located at 23 (upper strand) and 25 (lower strand) nucleotides upstream of the intron insertion site. A divalent cation, such as Mg2+, is required for the catalysis. I-TevI also acts as a repressor of its own transcription. It binds an operator that is located upstream of the I-TevI coding sequence and overlaps the T4 late promoter, which drives I-TevI expression from within the td intron. I-TevI binds the homing sites and the operator with the same affinity, but cleaves the homing site more efficiently than the operator. I-TevI consists of an N-terminal catalytic domain, containing the GIY-YIG motif, and a C-terminal DNA-binding domain that binds DNA as a monomer, joined by a flexible linker. The C-terminal domain includes three subdomains: a zinc finger, a minor-groove binding alpha-helix (NUMOD3, nuclease-associated modular domain 3), and a helix-turn-helix domain (HTH). The last two are responsible for DNA-binding. The zinc finger is part of the linker and not required for DNA-binding. It is implicated as a distance sensor to constrain the catalytic domain to cleave the homing site at a fixed position. None of other GIY-YIG endonucleases have been found to have the zinc finger motif. This family also includes a reduced activity isoschizomer of I-TevI, I-BmoI, which is encoded within the group I intron of the thymidylate synthase (TS) gene (thyA) from Bacillus mojavensis. I-BmoI catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site in the presence of a divalent cation cofactor, such as Mg2+. In the absence of Mg2+, I-Bmol only nicks one of the strands. Both I-BmoI and I-TevI bind a homologous stretch of TS-encoding DNA as monomers, but use different strategies to distinguish intronless from intron-containing substrates. I-TevI recognizes substrates at the level of DNA-binding. However, I-BmoI binds both intron-containing and intronless TS-encoding substrates, but efficiently cleaves only intronless substrate. Afterwards they cleave their respective intronless substrates in the same positions, and both require a critical G-C base pair adjacent to the top strand site for efficient cleavage. The C-terminal domain of I-BmoI has nuclease-associated modular DNA-binding domains (NUMODs), but lacks the zinc finger, which is different from that of I-TevI. Although the zinc finger implicated as a distance determination in I-TevI is absent, I-BmoI still possesses some cleavage distance discrimination. Besides I-TevI and I-BmoI, this family contains a putative GIY-YIG homing endonuclease, I-BanI, encoded within the self-splicing group I intron of nrdE gene from Bacillus anthracis. It contains two major domains, the N-terminal GIY-YIG domain and the C-terminal DNA-binding domain that consists of a minor-groove DNA binding alpha-helix motif and a helix-turn-helix (HTH) motif. I-BanI generates a double-strand break (DSB) in the intronless nrdE gene. The cleavage site is located at 5 and 7 nucleotides upstream of the intron insertion site, with 2-nucleotide 3' extensions. The recognition site is 35 to 40 base pairs and covers the cleavage site with a bias toward the downstream region including the (intervening sequence) IVS insertion site. Moreover, this family contains another putative GIY-YIG homing endonuclease, I-BthII, encoded within the self-splicing group I intron of nrdF gene from Bacillus thuringiensis ssp. pakistani. It contains a GIY-YIG motif that generates a double-strand break (DSB) in the intronless nrdF gene. The cleavage site is located at 7 and 9 nucleotides upstream of the intron insertion site, leaving 2-nucleotide 3' extensions. The recognition site is 27 to 29 base pairs with the DSB cleavage site at the 5'-end of the top strand, and with the intervening sequence (IVS) insertion site approximately in the middle of the recognition site.


Pssm-ID: 198384  Cd Length: 90  Bit Score: 61.13  E-value: 2.45e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736   3 GIYPVT--PTSRLYVGASDNLRRRQQEHERARQGGNWSNKALRADLKTYGPDAFSFHVLQRTGDLGELRRLEAQWaqqLR 80
Cdd:cd10437    2 GIYKITnlENGKIYIGSSKNILKRLSQHKRDLKAGKHHNKKLQRDFNKYGEENFEFTILEEGEEIEKLLELEQEY---ID 78
                         90
                 ....*....|..
gi 504508736  81 AFDPVTGYNTQP 92
Cdd:cd10437   79 KYKSEKGYNIEP 90
GIY-YIG_LuxR_like cd10451
GIY-YIG domain of LuxR and ArsR family transcriptional regulators, and uncharacterized ...
3-79 1.58e-09

GIY-YIG domain of LuxR and ArsR family transcriptional regulators, and uncharacterized hypothetical proteins found in bacteria; The family includes some bacterial LuxR and ArsR family transcriptional regulators. The a C-terminal conserved domain shows sequence similarity to the N-terminal catalytic GIY-YIG domains of intron-encoded homing endonucleases. Besides, they have an N-terminally fused transcriptional regulators module, comprising the winged helix-turn-helix (wHTH) domain and uncharacterized domain DUF2087. At this point, they are distinct from GIY-YIG homing endonucleases, which typically contain a variety of C-terminally fused nuclease-associated modular DNA-binding domains (NUMODs). Moreover, some key residues relevant to catalysis in GIY-YIG endonucleases are mutanted or absent in this family, which suggests that members in this family might lose the catalytic function that GIY-YIG endonucleases possess. This family also includes many uncharacterized hypothetical proteins that consist of a standalone GIY-YIG like domain.


Pssm-ID: 198398  Cd Length: 101  Bit Score: 51.40  E-value: 1.58e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736   3 GIYPV--TPTSRLYVGASDNLRRRQQEHERARQGGNWSNKALRADLKTYGPDAFSFHVLQR-------TGDLG-ELRRLE 72
Cdd:cd10451   15 GVYAIrnTATGKVFIGSSPNLKGTLNRLRFQLNTGSHPNKELQADWNEHGAEAFTFEVLEElekdedqTFDYKdELKALL 94

                 ....*..
gi 504508736  73 AQWAQQL 79
Cdd:cd10451   95 ELWLEKL 101
grpIintron_endo TIGR01453
group I intron endonuclease; This model represents one subfamily of endonucleases containing ...
1-89 5.19e-08

group I intron endonuclease; This model represents one subfamily of endonucleases containing the endo/excinuclease amino terminal domain, pfam01541 at its amino end. A distinct subfamily includes excinuclease abc subunit c (uvrC). Members of pfam01541 are often termed GIY-YIG endonucleases after conserved motifs near the amino end. This subfamily in this model is found in open reading frames of group I introns in both phage and mitochondria. The closely related endonucleases of phage T4: segA, segB, segC, segD and segE, score below the trusted cutoff for the family.


Pssm-ID: 273636 [Multi-domain]  Cd Length: 214  Bit Score: 49.31  E-value: 5.19e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736    1 MTGIYPVT--PTSRLYVGASDNLRRRQQEHERARQGGNwsNKALRADLKTYGPDAFSFHVLQRTGDLGELRRLEAQWaqq 78
Cdd:TIGR01453   1 KSGIYKITnnINGKIYVGSSVNLEKRLKEHLKLLKKGN--RIKLQKALNKYGWSNFSFEILEYYCNKDDLIERETYY--- 75
                          90
                  ....*....|.
gi 504508736   79 LRAFDPVTGYN 89
Cdd:TIGR01453  76 IKLLNPDAGYN 86
GIY-YIG_HE_Tlr8p_PBC-V_like cd10443
GIY-YIG domain of uncharacterized hypothetical protein found in phycodnavirus PBCV-1 DNA virus, ...
4-90 1.26e-05

GIY-YIG domain of uncharacterized hypothetical protein found in phycodnavirus PBCV-1 DNA virus, T. thermophila Tlr element eoncoding protein Tlr8p, and similar proteins found in bacteria; The family includes a group of diverse uncharacterized hypothetical proteins with a GIY-YIG domain that shows statistically significant similarity to the N-terminal catalytic domains of GIY-YIG family of intron-encoded homing endonuclease I-TevI. Similar to I-TevI, family members from phycodnavirus PBCV-1 DNA virus have nuclease-associated modular DNA-binding domains (NUMODs) and a helix-turn-helix (HTH) domain C-terminally fused to the GIY-YIG domain, which suggests that these PBCV-1 acquired the I-TevI-like homing endonucleases from phages by horizontal gene transfer. This family also includes proteins that appear to connect homing endonucleases with Penelope elements, such as Tetrahymena thermophila Tlr element encoding protein Tlr8p that possess additional N-terminal and central structural regions, followed by a putative superfamily 1 helicase domain and I-TevI-like GIY-YIG domain, but lacks the NUMOD domains and HTH domain. It is suggested that the Tlr8p element could have acquired its GIY-YIG domain w ithin the nucleus of the ciliate cell infected by the Phycodnavirus. Some family members only contain a standalone GIY-YIG domain and their biological functions are unclear.


Pssm-ID: 198390  Cd Length: 90  Bit Score: 41.13  E-value: 1.26e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736   4 IYPVT--PTSRLYVGAS-DNLRRRQQEHERARQGGNwsNKALRADLKTYGPDAFSFHVLQRTGDLGELRRLEAQWAQQLR 80
Cdd:cd10443    3 IYKITnpINGKVYIGQTtRTLEERLKQHKKAASAGP--KKDFYNAIRKYGWDNFTIEILDEGESKEELNELEIEYIAEYN 80
                         90
                 ....*....|
gi 504508736  81 AFDPVtGYNT 90
Cdd:cd10443   81 SNGPN-GYNL 89
GIYc smart00465
GIY-YIG type nucleases (URI domain);
1-94 3.50e-04

GIY-YIG type nucleases (URI domain);


Pssm-ID: 214677 [Multi-domain]  Cd Length: 84  Bit Score: 37.02  E-value: 3.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 504508736     1 MTGIYPVT--PTSRLYVGASDNLRRRQQEHerarQGGNWSNKALRADLKTYGpDAFSFHVLQRTGDlgELRRLEAQWaqq 78
Cdd:smart00465   1 KPGVYYITnkKNGKLYVGKAKNLRNRLKRH----FSGSRKGRLLIDALLKYG-GNFEFIILESFDE--SALELEKYL--- 70
                           90
                   ....*....|....*.
gi 504508736    79 LRAFDPvtGYNTQPIR 94
Cdd:smart00465  71 IKEYKP--KYNLLLKG 84
GIY-YIG_SF cd00719
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
13-79 6.22e-03

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


Pssm-ID: 198380 [Multi-domain]  Cd Length: 69  Bit Score: 33.49  E-value: 6.22e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 504508736  13 LYVGASDNLRRRQQEHERARQGGNwsnkalradlkTYGPDAFSFHVLQRTGDLG-ELRRLEAQWAQQL 79
Cdd:cd00719   13 IYVGQTKNLRNRIKEHLRKQRSDW-----------TKGLKPFEILYLEVAPEAEsELLDLEAALIKKL 69
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH