glycosyltransferase family protein may synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein
glycosyltransferase family 1 and related proteins with GTB topology; Glycosyltransferases ...
4-365
2.99e-54
glycosyltransferase family 1 and related proteins with GTB topology; Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. The structures of the formed glycoconjugates are extremely diverse, reflecting a wide range of biological functions. The members of this family share a common GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility.
The actual alignment was detected with superfamily member TIGR04516:
Pssm-ID: 471961 Cd Length: 418 Bit Score: 184.38 E-value: 2.99e-54
glycosyltransferase, activator-dependent family; Many biosynthesis clusters for secondary ...
4-365
2.99e-54
glycosyltransferase, activator-dependent family; Many biosynthesis clusters for secondary metabolites feature a glycosyltransferase gene next to a P450 homolog, often with the P450 lacking a critical heme-binding Cys. These P540-derived sequences seem to be allosteric activators of glycosyltransferases such as the member of this family. This model describes a set of related glycosyltransferases, many of which can be recognized as activator-dependent from genomic context.
Pssm-ID: 275309 Cd Length: 418 Bit Score: 184.38 E-value: 2.99e-54
UDP-glycosyltransferases and similar proteins; This family includes the Gtfs, a group of ...
4-363
3.72e-48
UDP-glycosyltransferases and similar proteins; This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics.
Pssm-ID: 340817 [Multi-domain] Cd Length: 404 Bit Score: 167.73 E-value: 3.72e-48
Protein of unknown function (DUF1205); This family represents a conserved region of unknown ...
164-259
1.31e-33
Protein of unknown function (DUF1205); This family represents a conserved region of unknown function within bacterial glycosyl transferases. Many family members contain pfam03033.
Pssm-ID: 369050 Cd Length: 95 Bit Score: 120.31 E-value: 1.31e-33
Semialdehyde dehydrogenase, NAD binding domain; The semialdehyde dehydrogenase family is found ...
2-55
2.15e-03
Semialdehyde dehydrogenase, NAD binding domain; The semialdehyde dehydrogenase family is found in N-acetyl-glutamine semialdehyde dehydrogenase (AgrC), which is involved in arginine biosynthesis, and aspartate-semialdehyde dehydrogenase, an enzyme involved in the biosynthesis of various amino acids from aspartate. This family is also found in yeast and fungal Arg5,6 protein, which is cleaved into the enzymes N-acety-gamma-glutamyl-phosphate reductase and acetylglutamate kinase. These are also involved in arginine biosynthesis. All proteins in this entry contain a NAD binding region of semialdehyde dehydrogenase.
Pssm-ID: 214863 [Multi-domain] Cd Length: 123 Bit Score: 37.53 E-value: 2.15e-03
glycosyltransferase, activator-dependent family; Many biosynthesis clusters for secondary ...
4-365
2.99e-54
glycosyltransferase, activator-dependent family; Many biosynthesis clusters for secondary metabolites feature a glycosyltransferase gene next to a P450 homolog, often with the P450 lacking a critical heme-binding Cys. These P540-derived sequences seem to be allosteric activators of glycosyltransferases such as the member of this family. This model describes a set of related glycosyltransferases, many of which can be recognized as activator-dependent from genomic context.
Pssm-ID: 275309 Cd Length: 418 Bit Score: 184.38 E-value: 2.99e-54
UDP-glycosyltransferases and similar proteins; This family includes the Gtfs, a group of ...
4-363
3.72e-48
UDP-glycosyltransferases and similar proteins; This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics.
Pssm-ID: 340817 [Multi-domain] Cd Length: 404 Bit Score: 167.73 E-value: 3.72e-48
Protein of unknown function (DUF1205); This family represents a conserved region of unknown ...
164-259
1.31e-33
Protein of unknown function (DUF1205); This family represents a conserved region of unknown function within bacterial glycosyl transferases. Many family members contain pfam03033.
Pssm-ID: 369050 Cd Length: 95 Bit Score: 120.31 E-value: 1.31e-33
glycosyltransferase, MGT family; This model describes the MGT (macroside glycosyltransferase) ...
11-368
1.55e-27
glycosyltransferase, MGT family; This model describes the MGT (macroside glycosyltransferase) subfamily of the UDP-glucuronosyltransferase family. Members include a number of glucosyl transferases for macrolide antibiotic inactivation, but also include transferases of glucose-related sugars for macrolide antibiotic production. [Cellular processes, Toxin production and resistance]
Pssm-ID: 273616 [Multi-domain] Cd Length: 392 Bit Score: 111.70 E-value: 1.55e-27
Semialdehyde dehydrogenase, NAD binding domain; The semialdehyde dehydrogenase family is found ...
2-55
2.15e-03
Semialdehyde dehydrogenase, NAD binding domain; The semialdehyde dehydrogenase family is found in N-acetyl-glutamine semialdehyde dehydrogenase (AgrC), which is involved in arginine biosynthesis, and aspartate-semialdehyde dehydrogenase, an enzyme involved in the biosynthesis of various amino acids from aspartate. This family is also found in yeast and fungal Arg5,6 protein, which is cleaved into the enzymes N-acety-gamma-glutamyl-phosphate reductase and acetylglutamate kinase. These are also involved in arginine biosynthesis. All proteins in this entry contain a NAD binding region of semialdehyde dehydrogenase.
Pssm-ID: 214863 [Multi-domain] Cd Length: 123 Bit Score: 37.53 E-value: 2.15e-03
Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01
References:
Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
of the residues that compose this conserved feature have been mapped to the query sequence.
Click on the triangle to view details about the feature, including a multiple sequence alignment
of your query sequence and the protein sequences used to curate the domain model,
where hash marks (#) above the aligned sequences show the location of the conserved feature residues.
The thumbnail image, if present, provides an approximate view of the feature's location in 3 dimensions.
Click on the triangle for interactive 3D structure viewing options.
Functional characterization of the conserved domain architecture found on the query.
Click here to see more details.
This image shows a graphical summary of conserved domains identified on the query sequence.
The Show Concise/Full Display button at the top of the page can be used to select the desired level of detail: only top scoring hits
(labeled illustration) or all hits
(labeled illustration).
Domains are color coded according to superfamilies
to which they have been assigned. Hits with scores that pass a domain-specific threshold
(specific hits) are drawn in bright colors.
Others (non-specific hits) and
superfamily placeholders are drawn in pastel colors.
if a domain or superfamily has been annotated with functional sites (conserved features),
they are mapped to the query sequence and indicated through sets of triangles
with the same color and shade of the domain or superfamily that provides the annotation. Mouse over the colored bars or triangles to see descriptions of the domains and features.
click on the bars or triangles to view your query sequence embedded in a multiple sequence alignment of the proteins used to develop the corresponding domain model.
The table lists conserved domains identified on the query sequence. Click on the plus sign (+) on the left to display full descriptions, alignments, and scores.
Click on the domain model's accession number to view the multiple sequence alignment of the proteins used to develop the corresponding domain model.
To view your query sequence embedded in that multiple sequence alignment, click on the colored bars in the Graphical Summary portion of the search results page,
or click on the triangles, if present, that represent functional sites (conserved features)
mapped to the query sequence.
Concise Display shows only the best scoring domain model, in each hit category listed below except non-specific hits, for each region on the query sequence.
(labeled illustration) Standard Display shows only the best scoring domain model from each source, in each hit category listed below for each region on the query sequence.
(labeled illustration) Full Display shows all domain models, in each hit category below, that meet or exceed the RPS-BLAST threshold for statistical significance.
(labeled illustration) Four types of hits can be shown, as available,
for each region on the query sequence:
specific hits meet or exceed a domain-specific e-value threshold
(illustrated example)
and represent a very high confidence that the query sequence belongs to the same protein family as the sequences use to create the domain model
non-specific hits
meet or exceed the RPS-BLAST threshold for statistical significance (default E-value cutoff of 0.01, or an E-value selected by user via the
advanced search options)
the domain superfamily to which the specific and non-specific hits belong
multi-domain models that were computationally detected and are likely to contain multiple single domains
Retrieve proteins that contain one or more of the domains present in the query sequence, using the Conserved Domain Architecture Retrieval Tool
(CDART).
Modify your query to search against a different database and/or use advanced search options