U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from GEO Profiles

    • Showing Current items.

    WDR33 WD repeat domain 33 [ Homo sapiens (human) ]

    Gene ID: 55339, updated on 10-Dec-2024

    Summary

    Official Symbol
    WDR33provided by HGNC
    Official Full Name
    WD repeat domain 33provided by HGNC
    Primary source
    HGNC:HGNC:25651
    See related
    Ensembl:ENSG00000136709 MIM:618082; AllianceGenome:HGNC:25651
    Gene type
    protein coding
    RefSeq status
    REVIEWED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Also known as
    NET14; WDC146
    Summary
    This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This gene is highly expressed in testis and the protein is localized to the nucleus. This gene may play important roles in the mechanisms of cytodifferentiation and/or DNA recombination. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene. [provided by RefSeq, Jul 2008]
    Expression
    Ubiquitous expression in testis (RPKM 5.8), lymph node (RPKM 4.9) and 25 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See WDR33 in Genome Data Viewer
    Location:
    2q14.3
    Exon count:
    24
    Annotation release Status Assembly Chr Location
    RS_2024_08 current GRCh38.p14 (GCF_000001405.40) 2 NC_000002.12 (127701027..127811171, complement)
    RS_2024_08 current T2T-CHM13v2.0 (GCF_009914755.1) 2 NC_060926.1 (128136260..128246425, complement)
    RS_2024_09 previous assembly GRCh37.p13 (GCF_000001405.25) 2 NC_000002.11 (128458601..128568745, complement)

    Chromosome 2 - NC_000002.12Genomic Context describing neighboring genes Neighboring gene myosin VIIB Neighboring gene uncharacterized LOC101927834 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 11934 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 11935 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 16498 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128393428-128394374 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128394375-128395321 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128395322-128396268 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128396269-128397214 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:128406535-128407262 Neighboring gene Sharpr-MPRA regulatory region 7437 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128407989-128408715 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128408716-128409441 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 16499 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 16500 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:128416288-128416788 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:128419363-128420360 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:128420398-128420938 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:128420939-128421478 Neighboring gene LIM zinc finger domain containing 2 Neighboring gene G protein-coupled receptor 17 Neighboring gene BRD4-independent group 4 enhancer GRCh37_chr2:128457593-128458792 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 11936 Neighboring gene CDK7 strongly-dependent group 2 enhancer GRCh37_chr2:128478937-128480136 Neighboring gene SFT2 domain containing 3 Neighboring gene MPRA-validated peak3854 silencer Neighboring gene MPRA-validated peak3855 silencer Neighboring gene MPRA-validated peak3856 silencer Neighboring gene MPRA-validated peak3857 silencer Neighboring gene H3K4me1 hESC enhancer GRCh37_chr2:128566820-128567727 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr2:128567810-128568750 Neighboring gene uncharacterized LOC124907885 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 16505 Neighboring gene MPRA-validated peak3858 silencer Neighboring gene RNY4 pseudogene 7 Neighboring gene ZFP91 pseudogene 1

    Genomic regions, transcripts, and products

    Expression

    • Project title: Tissue-specific circular RNA induction during human fetal development
    • Description: 35 human fetal samples from 6 tissues (3 - 7 replicates per tissue) collected between 10 and 20 weeks gestational time were sequenced using Illumina TruSeq Stranded Total RNA
    • BioProject: PRJNA270632
    • Publication: PMID 26076956
    • Analysis date: Mon Apr 2 22:54:59 2018

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    Phenotypes

    EBI GWAS Catalog

    Description
    Gene network analysis in a pediatric cohort identifies novel lung function genes.
    EBI GWAS Catalog
    Genetics of coronary artery calcification among African Americans, a meta-analysis.
    EBI GWAS Catalog

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Clone Names

    • FLJ11294

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    enables RNA binding HDA PubMed 
    Process Evidence Code Pubs
    involved_in mRNA 3'-end processing IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in postreplication repair NAS
    Non-traceable Author Statement
    more info
    PubMed 
    involved_in spermatogenesis NAS
    Non-traceable Author Statement
    more info
    PubMed 
    Component Evidence Code Pubs
    part_of collagen trimer IEA
    Inferred from Electronic Annotation
    more info
     
    located_in fibrillar center IDA
    Inferred from Direct Assay
    more info
     
    part_of mRNA cleavage and polyadenylation specificity factor complex IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    located_in nucleoplasm IDA
    Inferred from Direct Assay
    more info
     
    located_in nucleoplasm TAS
    Traceable Author Statement
    more info
     
    located_in nucleus IDA
    Inferred from Direct Assay
    more info
    PubMed 

    General protein information

    Preferred Names
    pre-mRNA 3' end processing protein WDR33
    Names
    WD repeat-containing protein 33
    WD repeat-containing protein WDC146
    WD repeat-containing protein of 146 kDa

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001006622.3NP_001006623.1  pre-mRNA 3' end processing protein WDR33 isoform 2

      See identical proteins and their annotated locations for NP_001006623.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (2) represents the shortest transcript. It lacks multiple 3' exons but has an alternate 3' segment, as compared to variant 1. The encoded isoform 2 has a shorter and distinct C-terminus, has only two WD repeats, and lacks the collagen-like and GPR domains, compared to isoform 1.
      Source sequence(s)
      AI039494, AK002156, BC005401, BM673679
      Consensus CDS
      CCDS46407.1
      UniProtKB/Swiss-Prot
      Q9C0J8
      Related
      ENSP00000387186.3, ENST00000409658.7
      Conserved Domains (2) summary
      sd00039
      Location:122158
      7WD40; WD40 repeat [structural motif]
      cl25539
      Location:119205
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    2. NM_001006623.4NP_001006624.1  pre-mRNA 3' end processing protein WDR33 isoform 3

      See identical proteins and their annotated locations for NP_001006624.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (3) lacks multiple 3' exons but has an alternate 3' exon, as compared to variant 1. It encodes the shortest isoform (3), which has a shorter and distinct C-terminus, as compared to isoform 1, has only two WD repeats, and lacks the collagen-like and GPR domains.
      Source sequence(s)
      BC068484, BU597855
      Consensus CDS
      CCDS42746.1
      UniProtKB/Swiss-Prot
      Q9C0J8
      Related
      ENSP00000376730.1, ENST00000393006.5
      Conserved Domains (3) summary
      COG2319
      Location:104251
      WD40; WD40 repeat [General function prediction only]
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      cl02567
      Location:119230
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    3. NM_018383.5NP_060853.3  pre-mRNA 3' end processing protein WDR33 isoform 1

      See identical proteins and their annotated locations for NP_060853.3

      Status: REVIEWED

      Description
      Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1) with eight WD repeats, a collagen-like domain, and a GPR (Gly, Pro and Arg)-rich domain at the N-terminal, central, and C-terminal portion, respectively.
      Source sequence(s)
      AB044749, AC006011, AL834365, BC010283, BQ896760, DA768238
      Consensus CDS
      CCDS2150.1
      UniProtKB/Swiss-Prot
      Q05DP8, Q53FG9, Q587J1, Q69YF7, Q6NUQ0, Q9C0J8, Q9NUL1
      Related
      ENSP00000325377.3, ENST00000322313.9
      Conserved Domains (5) summary
      pfam01391
      Location:730791
      Collagen; Collagen triple helix repeat (20 copies)
      COG2319
      Location:121405
      WD40; WD40 repeat [General function prediction only]
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      pfam09606
      Location:594989
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2024_08

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000002.12 Reference GRCh38.p14 Primary Assembly

      Range
      127701027..127811171 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_005263697.4XP_005263754.1  pre-mRNA 3' end processing protein WDR33 isoform X2

      Conserved Domains (4) summary
      pfam01391
      Location:730791
      Collagen; Collagen triple helix repeat (20 copies)
      COG2319
      Location:121405
      WD40; WD40 repeat [General function prediction only]
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
    2. XM_011511436.2XP_011509738.1  pre-mRNA 3' end processing protein WDR33 isoform X1

      See identical proteins and their annotated locations for XP_011509738.1

      UniProtKB/Swiss-Prot
      Q05DP8, Q53FG9, Q587J1, Q69YF7, Q6NUQ0, Q9C0J8, Q9NUL1
      Conserved Domains (5) summary
      pfam01391
      Location:730791
      Collagen; Collagen triple helix repeat (20 copies)
      COG2319
      Location:121405
      WD40; WD40 repeat [General function prediction only]
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      pfam09606
      Location:594989
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
    3. XM_017004436.3XP_016859925.1  pre-mRNA 3' end processing protein WDR33 isoform X3

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060926.1 Alternate T2T-CHM13v2.0

      Range
      128136260..128246425 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_054342825.1XP_054198800.1  pre-mRNA 3' end processing protein WDR33 isoform X1

      UniProtKB/Swiss-Prot
      Q05DP8, Q53FG9, Q587J1, Q69YF7, Q6NUQ0, Q9C0J8, Q9NUL1
    2. XM_054342826.1XP_054198801.1  pre-mRNA 3' end processing protein WDR33 isoform X2

    3. XM_054342827.1XP_054198802.1  pre-mRNA 3' end processing protein WDR33 isoform X3