U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from GEO Profiles

    • Showing Current items.

    Gtf2h2 general transcription factor II H, polypeptide 2 [ Mus musculus (house mouse) ]

    Gene ID: 23894, updated on 27-Nov-2024

    Summary

    Official Symbol
    Gtf2h2provided by MGI
    Official Full Name
    general transcription factor II H, polypeptide 2provided by MGI
    Primary source
    MGI:MGI:1345669
    See related
    Ensembl:ENSMUSG00000021639 AllianceGenome:MGI:1345669
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    p44; 44kDa; Btf2p44; BTF2 p44; BTF2-p44
    Summary
    Predicted to enable RNA polymerase II general transcription initiation factor activity. Predicted to be involved in nucleotide-excision repair; regulation of transcription by RNA polymerase II; and transcription by RNA polymerase II. Predicted to be located in nuclear speck. Predicted to be part of core TFIIH complex portion of holo TFIIH complex and transcription factor TFIID complex. Is expressed in several structures, including alimentary system; early conceptus; genitourinary system; integumental system; and nervous system. Orthologous to several human genes including GTF2H2C (GTF2H2 family member C) and GTF2H2 (general transcription factor IIH subunit 2). [provided by Alliance of Genome Resources, Nov 2024]
    Expression
    Ubiquitous expression in CNS E11.5 (RPKM 10.7), limb E14.5 (RPKM 9.8) and 28 other tissues See more
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Gtf2h2 in Genome Data Viewer
    Location:
    13 D1; 13 53.21 cM
    Exon count:
    19
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 13 NC_000079.7 (100596726..100629123, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 13 NC_000079.6 (100460218..100492609, complement)

    Chromosome 13 - NC_000079.7Genomic Context describing neighboring genes Neighboring gene ubiquitin specific peptidase 38 pseudogene Neighboring gene ribosomal protein L31, pseudogene 13 Neighboring gene NLR family, apoptosis inhibitory protein 1 Neighboring gene STARR-positive B cell enhancer ABC_E1467 Neighboring gene occludin Neighboring gene predicted gene, 57598 Neighboring gene eukaryotic translation initiation factor 4, gamma 1 pseudogene

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (2) 
    • Targeted (1) 

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables RNA polymerase II general transcription initiation factor activity ISO
    Inferred from Sequence Orthology
    more info
     
    enables zinc ion binding IEA
    Inferred from Electronic Annotation
    more info
     
    Process Evidence Code Pubs
    involved_in nucleotide-excision repair IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    involved_in nucleotide-excision repair IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in regulation of transcription by RNA polymerase II IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    involved_in transcription by RNA polymerase II ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    Component Evidence Code Pubs
    part_of core TFIIH complex portion of holo TFIIH complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of core TFIIH complex portion of holo TFIIH complex ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    located_in nuclear speck ISO
    Inferred from Sequence Orthology
    more info
     
    located_in nucleus ISO
    Inferred from Sequence Orthology
    more info
     
    located_in nucleus ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    part_of transcription factor TFIID complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of transcription factor TFIIH core complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of transcription factor TFIIH holo complex IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    part_of transcription factor TFIIH holo complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of transcription factor TFIIH holo complex ISS
    Inferred from Sequence or Structural Similarity
    more info
     

    General protein information

    Preferred Names
    general transcription factor IIH subunit 2
    Names
    TFIIH basal transcription factor complex p44 subunit
    basal transcription factor 2, p44 subunit
    basic transcription factor 2 44 kDa subunit
    general transcription factor II H, polypeptide 2 (44 kDa subunit)
    general transcription factor IIH, polypeptide 2 (44 kDa subunit)

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001360706.1NP_001347635.1  general transcription factor IIH subunit 2 isoform 2

      Status: VALIDATED

      Source sequence(s)
      AC158536
      UniProtKB/TrEMBL
      Q7TPV0
      Conserved Domains (2) summary
      cl06838
      Location:289389
      C1_4; TFIIH C1-like domain
      pfam04056
      Location:64255
      Ssl1; Ssl1-like
    2. NM_022011.4NP_071294.3  general transcription factor IIH subunit 2 isoform 1

      See identical proteins and their annotated locations for NP_071294.3

      Status: VALIDATED

      Source sequence(s)
      BC053382, BY277671, BY480509, DV662169
      Consensus CDS
      CCDS26730.1
      UniProtKB/Swiss-Prot
      Q9JIB4
      UniProtKB/TrEMBL
      Q7TPV0
      Related
      ENSMUSP00000065228.8, ENSMUST00000066984.14
      Conserved Domains (2) summary
      pfam04056
      Location:64255
      Ssl1; Ssl1-like
      cl06838
      Location:289389
      C1_4; TFIIH C1-like domain

    RNA

    1. NR_153794.1 RNA Sequence

      Status: VALIDATED

      Source sequence(s)
      AC158536
      Related
      ENSMUST00000145266.8

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000079.7 Reference GRCm39 C57BL/6J

      Range
      100596726..100629123 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_006517662.3XP_006517725.1  general transcription factor IIH subunit 2 isoform X1

      See identical proteins and their annotated locations for XP_006517725.1

      UniProtKB/TrEMBL
      Q7TPV0
      Conserved Domains (2) summary
      cl06838
      Location:289389
      C1_4; TFIIH C1-like domain
      pfam04056
      Location:64255
      Ssl1; Ssl1-like

    RNA

    1. XR_873756.4 RNA Sequence

    2. XR_873759.4 RNA Sequence

    3. XR_004938028.1 RNA Sequence

    4. XR_004938029.1 RNA Sequence