U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from Nucleotide

    • Showing Current items.

    Htatsf1 HIV TAT specific factor 1 [ Mus musculus (house mouse) ]

    Gene ID: 72459, updated on 9-Dec-2024

    Summary

    Official Symbol
    Htatsf1provided by MGI
    Official Full Name
    HIV TAT specific factor 1provided by MGI
    Primary source
    MGI:MGI:1919709
    See related
    Ensembl:ENSMUSG00000067873 AllianceGenome:MGI:1919709
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    TAT-SF1; 1600023H17Rik; 2600017A12Rik; 2700077B20Rik
    Summary
    Predicted to enable RNA binding activity; chromatin-protein adaptor activity; and poly-ADP-D-ribose modification-dependent protein binding activity. Predicted to be involved in U2-type prespliceosome assembly; double-strand break repair via homologous recombination; and protein localization to site of double-strand break. Predicted to be located in nucleoplasm. Predicted to be part of U2 snRNP and U2-type spliceosomal complex. Predicted to be active in site of double-strand break. Is expressed in several structures, including limb and nervous system. Orthologous to human HTATSF1 (HIV-1 Tat specific factor 1). [provided by Alliance of Genome Resources, Dec 2024]
    Expression
    Ubiquitous expression in CNS E11.5 (RPKM 27.5), placenta adult (RPKM 19.4) and 27 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Htatsf1 in Genome Data Viewer
    Location:
    X A6; X 30.81 cM
    Exon count:
    10
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) X NC_000086.8 (56098930..56112543)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) X NC_000086.7 (57053570..57067183)

    Chromosome X - NC_000086.8Genomic Context describing neighboring genes Neighboring gene adhesion G protein-coupled receptor G4 Neighboring gene bombesin-like receptor 3 Neighboring gene vestigial like family member 1 Neighboring gene STARR-seq mESC enhancer starr_47211 Neighboring gene predicted gene 14718

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Bibliography

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (2) 
    • Targeted (1)  1 citation

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables RNA binding IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    enables RNA binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables chromatin-protein adaptor activity ISO
    Inferred from Sequence Orthology
    more info
     
    enables chromatin-protein adaptor activity ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    enables poly-ADP-D-ribose modification-dependent protein binding ISO
    Inferred from Sequence Orthology
    more info
     
    enables poly-ADP-D-ribose modification-dependent protein binding ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    Process Evidence Code Pubs
    involved_in U2-type prespliceosome assembly ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in U2-type prespliceosome assembly ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    involved_in chromatin organization IEA
    Inferred from Electronic Annotation
    more info
     
    involved_in double-strand break repair via homologous recombination ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in double-strand break repair via homologous recombination ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    involved_in mRNA splicing, via spliceosome ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in mRNA splicing, via spliceosome ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    involved_in protein localization to site of double-strand break ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in protein localization to site of double-strand break ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    Component Evidence Code Pubs
    part_of U2 snRNP IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    part_of U2-type spliceosomal complex IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    part_of U2-type spliceosomal complex ISO
    Inferred from Sequence Orthology
    more info
     
    part_of U2-type spliceosomal complex ISS
    Inferred from Sequence or Structural Similarity
    more info
     
    located_in nucleoplasm IEA
    Inferred from Electronic Annotation
    more info
     
    located_in nucleoplasm ISO
    Inferred from Sequence Orthology
    more info
     
    is_active_in site of double-strand break ISO
    Inferred from Sequence Orthology
    more info
     
    is_active_in site of double-strand break ISS
    Inferred from Sequence or Structural Similarity
    more info
     

    General protein information

    Preferred Names
    17S U2 SnRNP complex component HTATSF1
    Names
    HIV Tat-specific factor 1 homolog

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_028242.2NP_082518.1  17S U2 SnRNP complex component HTATSF1

      See identical proteins and their annotated locations for NP_082518.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) represents the longer transcript. Variants 1 and 2 encode the same protein.
      Source sequence(s)
      AL672263
      Consensus CDS
      CCDS40983.1
      UniProtKB/Swiss-Prot
      B1AVC7, Q1WWK0, Q8BGC0, Q9CT41, Q9DAU3
      Related
      ENSMUSP00000086027.6, ENSMUST00000088652.6
      Conserved Domains (3) summary
      COG0724
      Location:89219
      RRM; RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis]
      cd12281
      Location:133223
      RRM1_TatSF1_like; RNA recognition motif 1 in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins
      cd12282
      Location:264354
      RRM2_TatSF1_like; RNA recognition motif 2 in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins
    2. NM_029371.1NP_083647.1  17S U2 SnRNP complex component HTATSF1

      See identical proteins and their annotated locations for NP_083647.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) differs in the 5' UTR, compared to variant 1. Variants 1 and 2 encode the same protein.
      Source sequence(s)
      AI849205, AK049495, BG077525, BY134418
      Consensus CDS
      CCDS40983.1
      UniProtKB/Swiss-Prot
      B1AVC7, Q1WWK0, Q8BGC0, Q9CT41, Q9DAU3
      Conserved Domains (3) summary
      COG0724
      Location:89219
      RRM; RNA recognition motif (RRM) domain [Translation, ribosomal structure and biogenesis]
      cd12281
      Location:133223
      RRM1_TatSF1_like; RNA recognition motif 1 in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins
      cd12282
      Location:264354
      RRM2_TatSF1_like; RNA recognition motif 2 in HIV Tat-specific factor 1 (Tat-SF1) and similar proteins

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000086.8 Reference GRCm39 C57BL/6J

      Range
      56098930..56112543
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)