U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from Nucleotide

    • Showing Current items.

    Spen spen family transcription repressor [ Mus musculus (house mouse) ]

    Gene ID: 56381, updated on 27-Nov-2024

    Summary

    Official Symbol
    Spenprovided by MGI
    Official Full Name
    spen family transcription repressorprovided by MGI
    Primary source
    MGI:MGI:1891706
    See related
    Ensembl:ENSMUSG00000040761 AllianceGenome:MGI:1891706
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    Mint; mKIAA0929
    Summary
    Enables DNA-binding transcription factor activity; single-stranded DNA binding activity; and transcription corepressor activity. Involved in random inactivation of X chromosome. Acts upstream of or within negative regulation of DNA-templated transcription and positive regulation of DNA-templated transcription. Located in nucleus. Is expressed in central nervous system; genitourinary system; hemolymphoid system; sensory organ; and tooth. Human ortholog(s) of this gene implicated in esophagus squamous cell carcinoma. Orthologous to human SPEN (spen family transcriptional repressor). [provided by Alliance of Genome Resources, Nov 2024]
    Expression
    Ubiquitous expression in thymus adult (RPKM 10.9), adrenal adult (RPKM 8.5) and 28 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Spen in Genome Data Viewer
    Location:
    4 D3; 4 74.26 cM
    Exon count:
    16
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 4 NC_000070.7 (141195199..141265955, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 4 NC_000070.6 (141467885..141538644, complement)

    Chromosome 4 - NC_000070.7Genomic Context describing neighboring genes Neighboring gene steroid receptor associated and regulated protein Neighboring gene predicted gene, 42337 Neighboring gene zinc finger and BTB domain containing 17 Neighboring gene STARR-positive B cell enhancer mm9_chr4:141033664-141033965 Neighboring gene RIKEN cDNA B330016D10 gene Neighboring gene STARR-positive B cell enhancer ABC_E6250 Neighboring gene predicted gene 4123 Neighboring gene STARR-positive B cell enhancer ABC_E6251 Neighboring gene STARR-positive B cell enhancer ABC_E6252 Neighboring gene STARR-seq mESC enhancer starr_11919 Neighboring gene STARR-seq mESC enhancer starr_11920 Neighboring gene STARR-seq mESC enhancer starr_11921 Neighboring gene STARR-positive B cell enhancer ABC_E1289 Neighboring gene STARR-positive B cell enhancer ABC_E1662 Neighboring gene filamin binding LIM protein 1 Neighboring gene STARR-seq mESC enhancer starr_11923

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (1) 
    • Targeted (5)  1 citation

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    NOT enables DNA binding IDA
    Inferred from Direct Assay
    more info
    PubMed 
    enables DNA-binding transcription factor activity IDA
    Inferred from Direct Assay
    more info
    PubMed 
    enables RNA binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables RNA polymerase II-specific DNA-binding transcription factor binding ISO
    Inferred from Sequence Orthology
    more info
     
    enables mRNA binding IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    enables protein binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    enables single-stranded DNA binding IDA
    Inferred from Direct Assay
    more info
    PubMed 
    enables transcription corepressor activity IDA
    Inferred from Direct Assay
    more info
    PubMed 
    enables transcription corepressor activity ISO
    Inferred from Sequence Orthology
    more info
     
    Component Evidence Code Pubs
    located_in nucleoplasm ISO
    Inferred from Sequence Orthology
    more info
     
    is_active_in nucleus IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    located_in nucleus IDA
    Inferred from Direct Assay
    more info
    PubMed 
    part_of transcription repressor complex ISO
    Inferred from Sequence Orthology
    more info
     

    General protein information

    Preferred Names
    msx2-interacting protein
    Names
    Msx2 interacting nuclear target protein
    SMART/HDAC1-associated repressor protein
    SPEN homolog, transcriptional regulator

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001347235.1NP_001334164.1  msx2-interacting protein isoform 2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) lacks an alternate in-frame exon compared to variant 1. The resulting isoform (2) has the same N- and C-termini but is shorter compared to isoform 1.
      Source sequence(s)
      AL670285, AL670446
      Consensus CDS
      CCDS84815.1
      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      UniProtKB/TrEMBL
      A2ADB1
      Related
      ENSMUSP00000077925.4, ENSMUST00000078886.10
    2. NM_019763.2NP_062737.2  msx2-interacting protein isoform 1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) represents the longer transcript and encodes the longer isoform (1).
      Source sequence(s)
      AF156529, AK137647, AL670285, BY726481
      Consensus CDS
      CCDS38940.1
      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      UniProtKB/TrEMBL
      A2ADB0
      Related
      ENSMUSP00000101412.3, ENSMUST00000105786.3
      Conserved Domains (9) summary
      COG0724
      Location:8145
      RRM; RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis]
      cd12348
      Location:781
      RRM1_SHARP; RNA recognition motif 1 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12349
      Location:338411
      RRM2_SHARP; RNA recognition motif 2 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12350
      Location:438511
      RRM3_SHARP; RNA recognition motif 3 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12351
      Location:512588
      RRM4_SHARP; RNA recognition motif 4 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      TIGR01642
      Location:128596
      U2AF_lg; U2 snRNP auxilliary factor, large subunit, splicing factor
      pfam05466
      Location:16111834
      BASP1; Brain acid soluble protein 1 (BASP1 protein)
      pfam07744
      Location:34883609
      SPOC; SPOC domain
      pfam15984
      Location:26412722
      Collagen_mid; Bacterial collagen, middle region

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000070.7 Reference GRCm39 C57BL/6J

      Range
      141195199..141265955 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_036164267.1XP_036020160.1  msx2-interacting protein isoform X4

      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      Conserved Domains (8) summary
      PTZ00121
      Location:5801211
      PTZ00121; MAEBL; Provisional
      PHA03247
      Location:15682002
      PHA03247; large tegument protein UL36; Provisional
      pfam07744
      Location:34203583
      SPOC; SPOC domain
      pfam15984
      Location:25802658
      Collagen_mid; Bacterial collagen, middle region
      cd12348
      Location:781
      RRM1_SHARP; RNA recognition motif 1 (RRM1) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12349
      Location:338411
      RRM2_SHARP; RNA recognition motif 2 (RRM2) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12350
      Location:438511
      RRM3_SHARP; RNA recognition motif 3 (RRM3) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cl17169
      Location:512571
      RRM_SF; RNA recognition motif (RRM) superfamily
    2. XM_036164266.1XP_036020159.1  msx2-interacting protein isoform X3

      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      Conserved Domains (8) summary
      PTZ00121
      Location:5971234
      PTZ00121; MAEBL; Provisional
      PHA03247
      Location:15912025
      PHA03247; large tegument protein UL36; Provisional
      pfam07744
      Location:34433606
      SPOC; SPOC domain
      pfam15984
      Location:26032681
      Collagen_mid; Bacterial collagen, middle region
      cd12348
      Location:781
      RRM1_SHARP; RNA recognition motif 1 (RRM1) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12349
      Location:338411
      RRM2_SHARP; RNA recognition motif 2 (RRM2) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12350
      Location:438511
      RRM3_SHARP; RNA recognition motif 3 (RRM3) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cl17169
      Location:512571
      RRM_SF; RNA recognition motif (RRM) superfamily
    3. XM_006539070.4XP_006539133.1  msx2-interacting protein isoform X2

      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      Conserved Domains (9) summary
      PTZ00121
      Location:6181249
      PTZ00121; MAEBL; Provisional
      PHA03247
      Location:16062040
      PHA03247; large tegument protein UL36; Provisional
      cd12348
      Location:781
      RRM1_SHARP; RNA recognition motif 1 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12349
      Location:338411
      RRM2_SHARP; RNA recognition motif 2 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12350
      Location:438511
      RRM3_SHARP; RNA recognition motif 3 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12351
      Location:512588
      RRM4_SHARP; RNA recognition motif 4 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      TIGR01642
      Location:128596
      U2AF_lg; U2 snRNP auxilliary factor, large subunit, splicing factor
      pfam07744
      Location:34603619
      SPOC; SPOC domain
      pfam15984
      Location:26182699
      Collagen_mid; Bacterial collagen, middle region
    4. XM_006539073.5XP_006539136.1  msx2-interacting protein isoform X5

      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      Conserved Domains (8) summary
      PTZ00121
      Location:6351272
      PTZ00121; MAEBL; Provisional
      PHA03247
      Location:16292063
      PHA03247; large tegument protein UL36; Provisional
      cd12348
      Location:781
      RRM1_SHARP; RNA recognition motif 1 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12349
      Location:338411
      RRM2_SHARP; RNA recognition motif 2 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12350
      Location:438511
      RRM3_SHARP; RNA recognition motif 3 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12351
      Location:512588
      RRM4_SHARP; RNA recognition motif 4 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      TIGR01642
      Location:128596
      U2AF_lg; U2 snRNP auxilliary factor, large subunit, splicing factor
      pfam07744
      Location:33603519
      SPOC; SPOC domain
    5. XM_006539069.4XP_006539132.1  msx2-interacting protein isoform X1

      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      Conserved Domains (9) summary
      PTZ00121
      Location:6351272
      PTZ00121; MAEBL; Provisional
      PHA03247
      Location:16292063
      PHA03247; large tegument protein UL36; Provisional
      cd12348
      Location:781
      RRM1_SHARP; RNA recognition motif 1 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12349
      Location:338411
      RRM2_SHARP; RNA recognition motif 2 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12350
      Location:438511
      RRM3_SHARP; RNA recognition motif 3 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12351
      Location:512588
      RRM4_SHARP; RNA recognition motif 4 in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      TIGR01642
      Location:128596
      U2AF_lg; U2 snRNP auxilliary factor, large subunit, splicing factor
      pfam07744
      Location:34833642
      SPOC; SPOC domain
      pfam15984
      Location:26412722
      Collagen_mid; Bacterial collagen, middle region
    6. XM_036164268.1XP_036020161.1  msx2-interacting protein isoform X6

      UniProtKB/Swiss-Prot
      Q62504, Q80TN9, Q99PS4, Q9QZW2
      Conserved Domains (6) summary
      PTZ00121
      Location:213844
      PTZ00121; MAEBL; Provisional
      PHA03247
      Location:12011635
      PHA03247; large tegument protein UL36; Provisional
      pfam07744
      Location:30533216
      SPOC; SPOC domain
      pfam15984
      Location:22132291
      Collagen_mid; Bacterial collagen, middle region
      cd12350
      Location:33106
      RRM3_SHARP; RNA recognition motif 3 (RRM3) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins
      cd12351
      Location:107183
      RRM4_SHARP; RNA recognition motif 4 (RRM4) found in SMART/HDAC1-associated repressor protein (SHARP) and similar proteins