U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from OMIM

    • Showing Current items.

    CTSA cathepsin A [ Homo sapiens (human) ]

    Gene ID: 5476, updated on 10-Dec-2024

    Summary

    Official Symbol
    CTSAprovided by HGNC
    Official Full Name
    cathepsin Aprovided by HGNC
    Primary source
    HGNC:HGNC:9251
    See related
    Ensembl:ENSG00000064601 MIM:613111; AllianceGenome:HGNC:9251
    Gene type
    protein coding
    RefSeq status
    REVIEWED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Also known as
    GSL; GLB2; NGBE; PPCA; PPGB
    Summary
    This gene encodes a member of the peptidase S10 family of serine carboxypeptidases. Alternative splicing results in multiple transcript variants, at least one of which encodes a preproprotein that is proteolytically processed to generate two chains that comprise the heterodimeric active enzyme. This enzyme possesses deamidase, esterase and carboxypeptidase activities and acts as a scaffold in the lysosomal multienzyme complex. Mutations in this gene are associated with galactosialidosis. [provided by RefSeq, Nov 2015]
    Expression
    Ubiquitous expression in adrenal (RPKM 65.7), kidney (RPKM 65.3) and 25 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See CTSA in Genome Data Viewer
    Location:
    20q13.12
    Exon count:
    15
    Annotation release Status Assembly Chr Location
    RS_2024_08 current GRCh38.p14 (GCF_000001405.40) 20 NC_000020.11 (45891335..45898820)
    RS_2024_08 current T2T-CHM13v2.0 (GCF_009914755.1) 20 NC_060944.1 (47627833..47635315)
    RS_2024_09 previous assembly GRCh37.p13 (GCF_000001405.25) 20 NC_000020.10 (44519974..44527459)

    Chromosome 20 - NC_000020.11Genomic Context describing neighboring genes Neighboring gene ATAC-STARR-seq lymphoblastoid active region 17952 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 17953 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:44514972-44515472 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:44515473-44515973 Neighboring gene zinc finger SWIM-type containing 1 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:44519312-44520021 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 17954 Neighboring gene spermatogenesis associated 25 Neighboring gene neuralized E3 ubiquitin protein ligase 2 Neighboring gene ReSE screen-validated silencer GRCh37_chr20:44531171-44531366 Neighboring gene phospholipid transfer protein Neighboring gene Sharpr-MPRA regulatory region 14031 Neighboring gene Sharpr-MPRA regulatory region 10402 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:44561248-44561940 Neighboring gene pre-mRNA-splicing factor cwc22-like Neighboring gene ATAC-STARR-seq lymphoblastoid active region 17955 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:44562632-44563323 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr20:44563324-44564014 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:44567204-44567719 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr20:44567720-44568234 Neighboring gene phosphorylated CTD interacting factor 1 Neighboring gene MED14-independent group 3 enhancer GRCh37_chr20:44572602-44573801 Neighboring gene zinc finger protein 335 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 12968

    Genomic regions, transcripts, and products

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    HIV-1 interactions

    Protein interactions

    Protein Gene Interaction Pubs
    Vpr vpr A stable-isotope labeling by amino acids in cell culture coupled with mass spectrometry-based proteomics identifies upregulation of cathepsin A (CTSA, lysosomal protective protein) expression by HIV-1 Vpr in Vpr transduced macrophages PubMed

    Go to the HIV-1, Human Interaction Database

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    enables carboxypeptidase activity TAS
    Traceable Author Statement
    more info
    PubMed 
    enables enzyme activator activity TAS
    Traceable Author Statement
    more info
    PubMed 
    enables serine-type carboxypeptidase activity IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    enables serine-type carboxypeptidase activity IMP
    Inferred from Mutant Phenotype
    more info
    PubMed 

    General protein information

    Preferred Names
    lysosomal protective protein
    Names
    beta-galactosidase 2
    beta-galactosidase protective protein
    carboxypeptidase C
    carboxypeptidase Y-like kininase
    carboxypeptidase-L
    deamidase
    lysosomal carboxypeptidase A
    protective protein cathepsin A
    urinary kininase
    NP_000299.3
    NP_001121167.1
    NP_001161066.2

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    Genomic

    1. NG_008291.1 RefSeqGene

      Range
      5384..12869
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. NM_000308.4NP_000299.3  lysosomal protective protein isoform b preproprotein

      Status: REVIEWED

      Description
      Transcript Variant: This variant (1) encodes the longest isoform (a). This isoform (a) may undergo proteolytic processing similar to isoform (b).
      Source sequence(s)
      BC093009
      Consensus CDS
      CCDS46609.1
      UniProtKB/Swiss-Prot
      B2R798, P10619, Q561W6, Q5JZH1, Q96KJ2, Q9BR08, Q9BW68
      UniProtKB/TrEMBL
      X6R8A1
      Related
      ENSP00000493613.2, ENST00000646241.3
      Conserved Domains (1) summary
      pfam00450
      Location:39476
      Peptidase_S10; Serine carboxypeptidase
    2. NM_001127695.3NP_001121167.1  lysosomal protective protein isoform b preproprotein

      See identical proteins and their annotated locations for NP_001121167.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (2) differs in the 5' UTR, lacks a portion of the 5' coding region, and initiates translation at a downstream start codon compared to variant 1. The encoded isoform (b) has a shorter N-terminus compared to isoform a.
      Source sequence(s)
      AL008726
      Consensus CDS
      CCDS46609.1
      UniProtKB/Swiss-Prot
      B2R798, P10619, Q561W6, Q5JZH1, Q96KJ2, Q9BR08, Q9BW68
      UniProtKB/TrEMBL
      X6R8A1
      Related
      ENSP00000191018.5, ENST00000191018.9
      Conserved Domains (1) summary
      pfam00450
      Location:39476
      Peptidase_S10; Serine carboxypeptidase
    3. NM_001167594.3NP_001161066.2  lysosomal protective protein isoform c precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (3) lacks an alternate in-frame exon in the 5' coding region compared to variant 1. The encoded isoform (c) is shorter than isoform a. This isoform may undergo proteolytic processing similar to isoform b.
      Source sequence(s)
      AL008726
      Consensus CDS
      CCDS54467.1
      UniProtKB/TrEMBL
      X6R5C5
      Related
      ENSP00000346952.5, ENST00000354880.9

    RNA

    1. NR_133656.2 RNA Sequence

      Status: REVIEWED

      Description
      Transcript Variant: This variant (4) uses an alternate splice site in an internal exon compared to variant 1. This variant is represented as non-coding because the use of the 5'-most expected translational start codon, as used in variant 1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
      Source sequence(s)
      AL008726
      Related
      ENST00000677525.1

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2024_08

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000020.11 Reference GRCh38.p14 Primary Assembly

      Range
      45891335..45898820
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060944.1 Alternate T2T-CHM13v2.0

      Range
      47627833..47635315
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)