U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from Nucleotide

    • Showing Current items.

    Gm2237 predicted gene 2237 [ Mus musculus (house mouse) ]

    Gene ID: 100039441, updated on 27-Nov-2024

    Summary

    Official Symbol
    Gm2237provided by MGI
    Official Full Name
    predicted gene 2237provided by MGI
    Primary source
    MGI:MGI:3780407
    See related
    Ensembl:ENSMUSG00000093979 AllianceGenome:MGI:3780407
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Expression
    Broad expression in CNS E18 (RPKM 3.3), CNS E14 (RPKM 2.1) and 18 other tissues See more
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Gm2237 in Genome Data Viewer
    Location:
    14 A3; 14 10.05 cM
    Exon count:
    9
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 14 NC_000080.7 (19613869..19635866, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 14 NC_000080.6 (19563801..19585798, complement)

    Chromosome 14 - NC_000080.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 48105 Neighboring gene ubiquitin specific peptidase 7 pseudogene Neighboring gene predicted gene 2244 Neighboring gene predicted gene 5458 Neighboring gene predicted gene, 41102

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    General protein information

    Preferred Names
    uncharacterized protein LOC100039441
    Names
    alpha6-takusan

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001374119.1NP_001361048.1  uncharacterized protein LOC100039441 isoform a

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) encodes the longest isoform (a).
      Source sequence(s)
      AC174797
      Conserved Domains (1) summary
      pfam04822
      Location:68148
      Takusan
    2. NM_001374120.1NP_001361049.1  uncharacterized protein LOC100039441 isoform b

      Status: VALIDATED

      Source sequence(s)
      AC174797
      Consensus CDS
      CCDS88584.1
      UniProtKB/TrEMBL
      A6NAS6, E9Q501, L7N2C2
      Related
      ENSMUSP00000133164.2, ENSMUST00000170694.9
      Conserved Domains (1) summary
      pfam04822
      Location:48128
      Takusan
    3. NM_001374121.1NP_001361050.1  uncharacterized protein LOC100039441 isoform c

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3), as well as variant 4, encodes isoform c.
      Source sequence(s)
      AC174797
      Consensus CDS
      CCDS88583.1
      UniProtKB/TrEMBL
      A6NAS0, A6NAU1
      Conserved Domains (1) summary
      pfam04822
      Location:173
      Takusan
    4. NM_001374122.1NP_001361051.1  uncharacterized protein LOC100039441 isoform c

      Status: VALIDATED

      Description
      Transcript Variant: This variant (4), as well as variant 3, encodes isoform c.
      Source sequence(s)
      AC174797
      Consensus CDS
      CCDS88583.1
      UniProtKB/TrEMBL
      A6NAS0, A6NAU1
      Conserved Domains (1) summary
      pfam04822
      Location:173
      Takusan

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000080.7 Reference GRCm39 C57BL/6J

      Range
      19613869..19635866 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_036158298.1XP_036014191.1  uncharacterized protein LOC100039441 isoform X2

      UniProtKB/TrEMBL
      A6NAS6, E9Q501, L7N2C2
      Conserved Domains (1) summary
      pfam04822
      Location:48128
      Takusan
    2. XM_036158299.1XP_036014192.1  uncharacterized protein LOC100039441 isoform X3

      UniProtKB/TrEMBL
      A6NAS0, A6NAU1
      Related
      ENSMUSP00000108214.3, ENSMUST00000112595.3
      Conserved Domains (1) summary
      pfam04822
      Location:173
      Takusan
    3. XM_036158297.1XP_036014190.1  uncharacterized protein LOC100039441 isoform X1

      UniProtKB/TrEMBL
      E9PW72
      Conserved Domains (1) summary
      pfam04822
      Location:59139
      Takusan
    4. XM_036158300.1XP_036014193.1  uncharacterized protein LOC100039441 isoform X4

    5. XM_017316239.3XP_017171728.1  uncharacterized protein LOC100039441 isoform X5

    6. XM_017316240.3XP_017171729.1  uncharacterized protein LOC100039441 isoform X6

    Suppressed Reference Sequence(s)

    The following Reference Sequences have been suppressed. Explain

    1. NG_008060.3: Suppressed sequence

      Description
      NG_008060.3: This RefSeq was removed because it is now thought that this gene does encode a protein.