U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from Protein

    • Showing Current items.

    Emid1 EMI domain containing 1 [ Mus musculus (house mouse) ]

    Gene ID: 140703, updated on 9-Dec-2024

    Summary

    Official Symbol
    Emid1provided by MGI
    Official Full Name
    EMI domain containing 1provided by MGI
    Primary source
    MGI:MGI:2155091
    See related
    Ensembl:ENSMUSG00000034164 AllianceGenome:MGI:2155091
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    CO-5; Emu1
    Summary
    Predicted to be an extracellular matrix structural constituent. Located in Golgi apparatus; endoplasmic reticulum; and extracellular matrix. Is expressed in brain; branchial groove; genitourinary system; neural tube; and sensory organ. Orthologous to human EMID1 (EMI domain containing 1). [provided by Alliance of Genome Resources, Dec 2024]
    Expression
    Broad expression in genital fat pad adult (RPKM 13.0), bladder adult (RPKM 8.3) and 23 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Emid1 in Genome Data Viewer
    Location:
    11 A1; 11 3.26 cM
    Exon count:
    21
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 11 NC_000077.7 (5056265..5102322, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 11 NC_000077.6 (5106265..5152322, complement)

    Chromosome 11 - NC_000077.7Genomic Context describing neighboring genes Neighboring gene growth arrest-specific 2 like 1 Neighboring gene RAS-like, family 10, member A Neighboring gene Ewing sarcoma breakpoint region 1 Neighboring gene STARR-positive B cell enhancer ABC_E8383 Neighboring gene STARR-positive B cell enhancer ABC_E11468 Neighboring gene rhomboid domain containing 3 Neighboring gene predicted gene, 46266 Neighboring gene predicted gene, 39572 Neighboring gene CapStarr-seq enhancer MGSCv37_chr11:5050795-5051007 Neighboring gene CapStarr-seq enhancer MGSCv37_chr11:5076723-5076924 Neighboring gene STARR-positive B cell enhancer ABC_E11763 Neighboring gene ribosomal protein, large, P0 pseudogene Neighboring gene kringle containing transmembrane protein 1 Neighboring gene CapStarr-seq enhancer MGSCv37_chr11:5115988-5116097 Neighboring gene predicted gene, 25142

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (2) 
    • Targeted (2) 

    General protein information

    Preferred Names
    EMI domain-containing protein 1
    Names
    emilin and multimerin domain-containing protein 1

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001371000.1NP_001357929.1  EMI domain-containing protein 1 isoform 2 precursor

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Conserved Domains (2) summary
      pfam01391
      Location:318370
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:35100
      EMI; EMI domain
    2. NM_001371001.1NP_001357930.1  EMI domain-containing protein 1 isoform 3 precursor

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Conserved Domains (2) summary
      pfam01391
      Location:316368
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:3598
      EMI; EMI domain
    3. NM_001371002.1NP_001357931.1  EMI domain-containing protein 1 isoform 4 precursor

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Conserved Domains (2) summary
      pfam01391
      Location:318370
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:35100
      EMI; EMI domain
    4. NM_001371003.1NP_001357932.1  EMI domain-containing protein 1 isoform 5 precursor

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Consensus CDS
      CCDS88125.1
      Related
      ENSMUSP00000131391.2, ENSMUST00000163299.8
      Conserved Domains (2) summary
      pfam01391
      Location:316368
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:3598
      EMI; EMI domain
    5. NM_001371004.1NP_001357933.1  EMI domain-containing protein 1 isoform 6

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Conserved Domains (2) summary
      pfam01391
      Location:253305
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:235
      EMI; EMI domain
    6. NM_001371005.1NP_001357934.1  EMI domain-containing protein 1 isoform 7 precursor

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Conserved Domains (2) summary
      pfam01391
      Location:257309
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:35100
      EMI; EMI domain
    7. NM_080595.3NP_542162.1  EMI domain-containing protein 1 isoform 1 precursor

      See identical proteins and their annotated locations for NP_542162.1

      Status: VALIDATED

      Source sequence(s)
      AL645845
      Consensus CDS
      CCDS24398.1
      UniProtKB/Swiss-Prot
      Q91VF5
      UniProtKB/TrEMBL
      Q5SUT2
      Related
      ENSMUSP00000061704.6, ENSMUST00000062821.13
      Conserved Domains (2) summary
      pfam01391
      Location:318370
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:35100
      EMI; EMI domain

    RNA

    1. NR_152138.2 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AL645845
    2. NR_163830.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AL645845

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000077.7 Reference GRCm39 C57BL/6J

      Range
      5056265..5102322 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_006514521.5XP_006514584.1  EMI domain-containing protein 1 isoform X3

      See identical proteins and their annotated locations for XP_006514584.1

      Conserved Domains (1) summary
      pfam01391
      Location:233285
      Collagen; Collagen triple helix repeat (20 copies)
    2. XM_030245552.2XP_030101412.1  EMI domain-containing protein 1 isoform X4

      Conserved Domains (1) summary
      pfam01391
      Location:233285
      Collagen; Collagen triple helix repeat (20 copies)
    3. XM_030245553.1XP_030101413.1  EMI domain-containing protein 1 isoform X6

      Conserved Domains (1) summary
      pfam01391
      Location:172224
      Collagen; Collagen triple helix repeat (20 copies)
    4. XM_006514522.5XP_006514585.1  EMI domain-containing protein 1 isoform X3

      See identical proteins and their annotated locations for XP_006514585.1

      Conserved Domains (1) summary
      pfam01391
      Location:233285
      Collagen; Collagen triple helix repeat (20 copies)
    5. XM_006514524.2XP_006514587.1  EMI domain-containing protein 1 isoform X5

      Conserved Domains (1) summary
      pfam01391
      Location:198250
      Collagen; Collagen triple helix repeat (20 copies)
    6. XM_036156325.1XP_036012218.1  EMI domain-containing protein 1 isoform X1

      Conserved Domains (2) summary
      pfam01391
      Location:321370
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:35100
      EMI; EMI domain
    7. XM_011243663.3XP_011241965.1  EMI domain-containing protein 1 isoform X2

      See identical proteins and their annotated locations for XP_011241965.1

      Conserved Domains (2) summary
      pfam01391
      Location:318370
      Collagen; Collagen triple helix repeat (20 copies)
      pfam07546
      Location:35100
      EMI; EMI domain

    RNA

    1. XR_004936745.1 RNA Sequence