formylglycine-generating enzyme family protein similar to human sulfatase-modifying factor 1 (SUMF1), which oxidizes a cysteine residue in the substrate sulfatase, to an active site 3-oxoalanine residue, also called C(alpha)-formylglycine
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for ...
91-347
7.24e-80
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for post-translational sulfatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulfatase deficiency (MSD). The protein product of the SUMF1 gene is FGE, formylglycine (FGly),-generating enzyme, which is a sulfatase. Sulfatases are enzymes essential for degradation and remodelling of sulfate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulfatases. FGE is localized to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesized sulfatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilized by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulfide bond is present in the active site of FGE. An oxidized cysteine residue, possibly cysteine sulfenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulfatases. In Mycobacteria and Treponema denticola this enzyme functions as an iron(II)-dependent oxidoreductase.
:
Pssm-ID: 397722 [Multi-domain] Cd Length: 259 Bit Score: 246.26 E-value: 7.24e-80
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for ...
91-347
7.24e-80
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for post-translational sulfatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulfatase deficiency (MSD). The protein product of the SUMF1 gene is FGE, formylglycine (FGly),-generating enzyme, which is a sulfatase. Sulfatases are enzymes essential for degradation and remodelling of sulfate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulfatases. FGE is localized to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesized sulfatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilized by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulfide bond is present in the active site of FGE. An oxidized cysteine residue, possibly cysteine sulfenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulfatases. In Mycobacteria and Treponema denticola this enzyme functions as an iron(II)-dependent oxidoreductase.
Pssm-ID: 397722 [Multi-domain] Cd Length: 259 Bit Score: 246.26 E-value: 7.24e-80
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to ...
79-352
1.49e-36
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. This model represents a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture than that found in Flavobacterium johnsoniae and related species (represented by (TIGR03525). Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Pssm-ID: 274632 [Multi-domain] Cd Length: 344 Bit Score: 136.21 E-value: 1.49e-36
selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a ...
76-250
5.72e-24
selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a sulfur-containing derivative of a hercynine, which derives from histidine by trimethylation of its amino group. SenA, a homolog of ergothioneine biosynthesis protein EgtB, completes selenoneine biosynthesis in a pathway that also requires a SelD protein to prepare selenophosphate, EgtD to prepare hercynine, and the selenosugar-producing glycosyltransferase SenB (TIGR04348). SenA catalyzes selenium-carbon bond formation between hercynine and selenosugar to produce selenoneine. Other enzymes that create carbon-selenium bonds include SelA, involved in selenocysteine biosynthesis, SelU, involved in selenouridine biosynthesis, and SbtM, a radical SAM enzyme involved in post-translational modification of the RiPP precursor SbtA (see PMID:34730336).
Pssm-ID: 469092 [Multi-domain] Cd Length: 384 Bit Score: 102.19 E-value: 5.72e-24
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for ...
91-347
7.24e-80
Sulfatase-modifying factor enzyme 1; This domain is found in eukaryotic proteins required for post-translational sulfatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulfatase deficiency (MSD). The protein product of the SUMF1 gene is FGE, formylglycine (FGly),-generating enzyme, which is a sulfatase. Sulfatases are enzymes essential for degradation and remodelling of sulfate esters, and formylglycine (FGly), the key catalytic in the active site, is unique to sulfatases. FGE is localized to the endoplasmic reticulum (ER) and interacts with and modifies the unfolded form of newly synthesized sulfatases. FGE is a single-domain monomer with a surprising paucity of secondary structure that adopts a unique fold which is stabilized by two Ca2+ ions. The effect of all mutations found in MSD patients is explained by the FGE structure, providing a molecular basis for MSD. A redox-active disulfide bond is present in the active site of FGE. An oxidized cysteine residue, possibly cysteine sulfenic acid, has been detected that may allow formulation of a structure-based mechanism for FGly formation from cysteine residues in all sulfatases. In Mycobacteria and Treponema denticola this enzyme functions as an iron(II)-dependent oxidoreductase.
Pssm-ID: 397722 [Multi-domain] Cd Length: 259 Bit Score: 246.26 E-value: 7.24e-80
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to ...
79-352
1.49e-36
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. This model represents a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture than that found in Flavobacterium johnsoniae and related species (represented by (TIGR03525). Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Pssm-ID: 274632 [Multi-domain] Cd Length: 344 Bit Score: 136.21 E-value: 1.49e-36
ergothioneine biosynthesis protein EgtB; Members of this family include EgtB, and enzyme of ...
76-307
7.02e-32
ergothioneine biosynthesis protein EgtB; Members of this family include EgtB, and enzyme of the ergothioneine biosynthesis, as found in numerous Actinobacteria. Characterized homologs to this family include a formylglycine-generating enzyme that serves as a maturase for an aerobic sulfatase (cf. the radical SAM enzymes that serve as anaerobic sulfatase maturases). [Biosynthesis of cofactors, prosthetic groups, and carriers, Glutathione and analogs]
Pssm-ID: 274581 [Multi-domain] Cd Length: 406 Bit Score: 124.75 E-value: 7.02e-32
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to ...
91-344
3.22e-24
gliding motility-associated lipoprotein GldK; Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Pssm-ID: 274629 Cd Length: 450 Bit Score: 103.79 E-value: 3.22e-24
selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a ...
76-250
5.72e-24
selenoneine synthase SenA; Selenoneine is the selenium analog of ergothioneine, a sulfur-containing derivative of a hercynine, which derives from histidine by trimethylation of its amino group. SenA, a homolog of ergothioneine biosynthesis protein EgtB, completes selenoneine biosynthesis in a pathway that also requires a SelD protein to prepare selenophosphate, EgtD to prepare hercynine, and the selenosugar-producing glycosyltransferase SenB (TIGR04348). SenA catalyzes selenium-carbon bond formation between hercynine and selenosugar to produce selenoneine. Other enzymes that create carbon-selenium bonds include SelA, involved in selenocysteine biosynthesis, SelU, involved in selenouridine biosynthesis, and SbtM, a radical SAM enzyme involved in post-translational modification of the RiPP precursor SbtA (see PMID:34730336).
Pssm-ID: 469092 [Multi-domain] Cd Length: 384 Bit Score: 102.19 E-value: 5.72e-24
Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01
References:
Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
of the residues that compose this conserved feature have been mapped to the query sequence.
Click on the triangle to view details about the feature, including a multiple sequence alignment
of your query sequence and the protein sequences used to curate the domain model,
where hash marks (#) above the aligned sequences show the location of the conserved feature residues.
The thumbnail image, if present, provides an approximate view of the feature's location in 3 dimensions.
Click on the triangle for interactive 3D structure viewing options.
Functional characterization of the conserved domain architecture found on the query.
Click here to see more details.
This image shows a graphical summary of conserved domains identified on the query sequence.
The Show Concise/Full Display button at the top of the page can be used to select the desired level of detail: only top scoring hits
(labeled illustration) or all hits
(labeled illustration).
Domains are color coded according to superfamilies
to which they have been assigned. Hits with scores that pass a domain-specific threshold
(specific hits) are drawn in bright colors.
Others (non-specific hits) and
superfamily placeholders are drawn in pastel colors.
if a domain or superfamily has been annotated with functional sites (conserved features),
they are mapped to the query sequence and indicated through sets of triangles
with the same color and shade of the domain or superfamily that provides the annotation. Mouse over the colored bars or triangles to see descriptions of the domains and features.
click on the bars or triangles to view your query sequence embedded in a multiple sequence alignment of the proteins used to develop the corresponding domain model.
The table lists conserved domains identified on the query sequence. Click on the plus sign (+) on the left to display full descriptions, alignments, and scores.
Click on the domain model's accession number to view the multiple sequence alignment of the proteins used to develop the corresponding domain model.
To view your query sequence embedded in that multiple sequence alignment, click on the colored bars in the Graphical Summary portion of the search results page,
or click on the triangles, if present, that represent functional sites (conserved features)
mapped to the query sequence.
Concise Display shows only the best scoring domain model, in each hit category listed below except non-specific hits, for each region on the query sequence.
(labeled illustration) Standard Display shows only the best scoring domain model from each source, in each hit category listed below for each region on the query sequence.
(labeled illustration) Full Display shows all domain models, in each hit category below, that meet or exceed the RPS-BLAST threshold for statistical significance.
(labeled illustration) Four types of hits can be shown, as available,
for each region on the query sequence:
specific hits meet or exceed a domain-specific e-value threshold
(illustrated example)
and represent a very high confidence that the query sequence belongs to the same protein family as the sequences use to create the domain model
non-specific hits
meet or exceed the RPS-BLAST threshold for statistical significance (default E-value cutoff of 0.01, or an E-value selected by user via the
advanced search options)
the domain superfamily to which the specific and non-specific hits belong
multi-domain models that were computationally detected and are likely to contain multiple single domains
Retrieve proteins that contain one or more of the domains present in the query sequence, using the Conserved Domain Architecture Retrieval Tool
(CDART).
Modify your query to search against a different database and/or use advanced search options