U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from GEO Profiles

    • Showing Current items.

    Col20a1 collagen, type XX, alpha 1 [ Mus musculus (house mouse) ]

    Gene ID: 73368, updated on 27-Dec-2024

    Summary

    Official Symbol
    Col20a1provided by MGI
    Official Full Name
    collagen, type XX, alpha 1provided by MGI
    Primary source
    MGI:MGI:1920618
    See related
    Ensembl:ENSMUSG00000016356 AllianceGenome:MGI:1920618
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    1700051I12Rik
    Summary
    Predicted to be located in extracellular matrix and extracellular region. Predicted to be part of collagen trimer. Is expressed in central nervous system; sensory organ; and skeleton. Orthologous to human COL20A1 (collagen type XX alpha 1 chain). [provided by Alliance of Genome Resources, Dec 2024]
    Expression
    Ubiquitous expression in testis adult (RPKM 8.7), ovary adult (RPKM 3.1) and 26 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Col20a1 in Genome Data Viewer
    Location:
    2 H4; 2 103.53 cM
    Exon count:
    37
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 2 NC_000068.8 (180626629..180659338)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 2 NC_000068.7 (180983323..181017545)

    Chromosome 2 - NC_000068.8Genomic Context describing neighboring genes Neighboring gene Na+/K+ transporting ATPase interacting 4 Neighboring gene STARR-seq mESC enhancer starr_06745 Neighboring gene ARF GTPase activating protein 1 Neighboring gene cholinergic receptor, nicotinic, alpha polypeptide 4 Neighboring gene potassium voltage-gated channel, subfamily Q, member 2

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)

    Pathways from PubChem

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_028518.1NP_082794.1  collagen alpha-1(XX) chain precursor

      See identical proteins and their annotated locations for NP_082794.1

      Status: VALIDATED

      Source sequence(s)
      AL450341, BX649560
      Consensus CDS
      CCDS50844.1
      UniProtKB/Swiss-Prot
      A8WIS2, Q91WC4, Q923P0, Q923P1, Q923P2, Q9D9L7
      UniProtKB/TrEMBL
      F6UFI2
      Related
      ENSMUSP00000153871.2, ENSMUST00000228434.2
      Conserved Domains (4) summary
      pfam01391
      Location:11501221
      Collagen; Collagen triple helix repeat (20 copies)
      cd01482
      Location:176339
      vWA_collagen_alphaI-XII-like; Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different ...
      pfam00041
      Location:750818
      fn3; Fibronectin type III domain
      cl22861
      Location:8401034
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000068.8 Reference GRCm39 C57BL/6J

      Range
      180626629..180659338
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_036162614.1XP_036018507.1  collagen alpha-1(XX) chain isoform X1

      UniProtKB/TrEMBL
      A0A2K6EDL8, F6UFI2
      Related
      ENSMUSP00000104484.3, ENSMUST00000108856.9
      Conserved Domains (4) summary
      cd01482
      Location:218381
      vWA_collagen_alphaI-XII-like; Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different ...
      pfam00041
      Location:792860
      fn3; Fibronectin type III domain
      pfam01391
      Location:11951263
      Collagen; Collagen triple helix repeat (20 copies)
      cl22861
      Location:8821076
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...
    2. XM_036162615.1XP_036018508.1  collagen alpha-1(XX) chain isoform X2

      UniProtKB/TrEMBL
      F6UFI2
      Conserved Domains (4) summary
      cd01482
      Location:218381
      vWA_collagen_alphaI-XII-like; Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different ...
      pfam00041
      Location:792860
      fn3; Fibronectin type III domain
      pfam01391
      Location:11941262
      Collagen; Collagen triple helix repeat (20 copies)
      cl22861
      Location:8811075
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...
    3. XM_017319288.2XP_017174777.1  collagen alpha-1(XX) chain isoform X3

      Conserved Domains (3) summary
      pfam01391
      Location:805876
      Collagen; Collagen triple helix repeat (20 copies)
      pfam00041
      Location:405473
      fn3; Fibronectin type III domain
      cl22861
      Location:495689
      LamG; Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of ...