U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Cenpk centromere protein K [ Mus musculus (house mouse) ]

Gene ID: 60411, updated on 27-Nov-2024

Summary

Official Symbol
Cenpkprovided by MGI
Official Full Name
centromere protein Kprovided by MGI
Primary source
MGI:MGI:1926210
See related
Ensembl:ENSMUSG00000021714 AllianceGenome:MGI:1926210
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
Solt; Solzt; Cenp-K; B130045K24Rik; C530004N04Rik
Summary
Acts upstream of or within positive regulation of transcription by RNA polymerase II. Located in nucleus. Is expressed in several structures, including central nervous system and neural retina. Orthologous to human CENPK (centromere protein K). [provided by Alliance of Genome Resources, Nov 2024]
Expression
Biased expression in liver E14 (RPKM 9.5), CNS E11.5 (RPKM 6.6) and 9 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Cenpk in Genome Data Viewer
Location:
13 D1; 13 56.42 cM
Exon count:
12
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 13 NC_000079.7 (104365474..104386130)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 13 NC_000079.6 (104228611..104249622)

Chromosome 13 - NC_000079.7Genomic Context describing neighboring genes Neighboring gene STARR-positive B cell enhancer ABC_E385 Neighboring gene STARR-seq mESC enhancer starr_35535 Neighboring gene STARR-positive B cell enhancer ABC_E4090 Neighboring gene tripartite motif-containing 23 Neighboring gene peptidylprolyl isomerase domain and WD repeat containing 1 Neighboring gene STARR-positive B cell enhancer ABC_E9887 Neighboring gene CapStarr-seq enhancer MGSCv37_chr13:105076891-105077195 Neighboring gene ADAM metallopeptidase with thrombospondin type 1 motif 6 Neighboring gene STARR-seq mESC enhancer starr_35536 Neighboring gene STARR-seq mESC enhancer starr_35537 Neighboring gene predicted gene 8680 Neighboring gene STARR-seq mESC enhancer starr_35538 Neighboring gene STARR-seq mESC enhancer starr_35540 Neighboring gene predicted gene, 53810 Neighboring gene CWC27 spliceosome-associated protein

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)
  • Endonuclease-mediated (2) 

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General protein information

Preferred Names
centromere protein K
Names
SoxLZ/Sox6 leucine zipper binding protein in testis

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001377093.1NP_001364022.1  centromere protein K isoform 3

    Status: VALIDATED

    Source sequence(s)
    AC154216
    UniProtKB/Swiss-Prot
    Q3UXA9, Q569Q3, Q8C469, Q9ESN5
    Conserved Domains (1) summary
    pfam11802
    Location:12271
    CENP-K; Centromere-associated protein K
  2. NM_001377094.1NP_001364023.1  centromere protein K isoform 3

    Status: VALIDATED

    Source sequence(s)
    AC154216
    UniProtKB/Swiss-Prot
    Q3UXA9, Q569Q3, Q8C469, Q9ESN5
    Conserved Domains (1) summary
    pfam11802
    Location:12271
    CENP-K; Centromere-associated protein K
  3. NM_001377095.1NP_001364024.1  centromere protein K isoform 3

    Status: VALIDATED

    Source sequence(s)
    AC154216
    UniProtKB/Swiss-Prot
    Q3UXA9, Q569Q3, Q8C469, Q9ESN5
    Conserved Domains (1) summary
    pfam11802
    Location:12271
    CENP-K; Centromere-associated protein K
  4. NM_021790.2NP_068562.1  centromere protein K isoform 1

    See identical proteins and their annotated locations for NP_068562.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) uses alternate 5' exon structure, differs in the 5' UTR, and includes an alternate 3' terminal exon compared to variant 2. This transcript also initiates translation at an alternate start codon, resulting in isoform 1, which is longer and has distinct N- and C-termini compared to isoform 2.
    Source sequence(s)
    AB043687, AC154216, AV367397
    Consensus CDS
    CCDS26749.1
    UniProtKB/TrEMBL
    A0A0R4J037
    Related
    ENSMUSP00000022227.7, ENSMUST00000022227.8
    Conserved Domains (1) summary
    pfam11802
    Location:47306
    CENP-K; Centromere-associated protein K
  5. NM_181061.6NP_851406.1  centromere protein K isoform 2

    See identical proteins and their annotated locations for NP_851406.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) represents the longest transcript and encodes the shorter isoform (2).
    Source sequence(s)
    AC154216
    Consensus CDS
    CCDS26750.1
    UniProtKB/Swiss-Prot
    Q9ESN5
    Related
    ENSMUSP00000070910.4, ENSMUST00000070761.10
    Conserved Domains (1) summary
    pfam11802
    Location:12220
    CENP-K; Centromere-associated protein K

RNA

  1. NR_075088.2 RNA Sequence

    Status: VALIDATED

    Description
    Transcript Variant: This variant (3) uses an alternate splice site in an internal exon and includes an alternate 3' terminal exon, compared to variant 2. This variant is represented as non-coding due to the presence of an upstream ORF that is predicted to interfere with translation of the longest ORF; translation of the upstream ORF renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
    Source sequence(s)
    AC154216

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000079.7 Reference GRCm39 C57BL/6J

    Range
    104365474..104386130
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)