Warning: The NCBI web site requires JavaScript to function. more...
An official website of the United States government
The .gov means it's official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
Download features.
Download gene features.
NCBI Reference Sequence: XM_054326555.1
FASTA Graphics
LOCUS XM_054326555 4446 bp mRNA linear PRI 26-AUG-2024 DEFINITION PREDICTED: Homo sapiens ubiquitin specific peptidase 51 (USP51), transcript variant X3, mRNA. ACCESSION XM_054326555 VERSION XM_054326555.1 DBLINK BioProject: PRJNA807723 KEYWORDS RefSeq. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_060947) annotated using gene prediction method: Gnomon, supported by mRNA evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Updated annotation Annotation Name :: GCF_009914755.1-RS_2024_08 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Best-placed RefSeq; Gnomon; RefSeqFE; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 08/23/2024 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4446 /organism="Homo sapiens" /mol_type="mRNA" /isolate="CHM13" /db_xref="taxon:9606" /chromosome="X" /sex="female" /cell_line="CHM13htert" /tissue_type="hydatidiform mole" /note="haploid cell line" gene 1..4446 /gene="USP51" /note="ubiquitin specific peptidase 51; Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 2 mRNAs, 102 long SRA reads, 10 Proteins, and 92% coverage of the annotated genomic feature by RNAseq alignments, including 48 samples with support for all annotated introns" /db_xref="GeneID:158880" /db_xref="HGNC:HGNC:23086" CDS 123..2258 /gene="USP51" /codon_start=1 /product="ubiquitin carboxyl-terminal hydrolase 51 isoform X1" /protein_id="XP_054182530.1" /db_xref="GeneID:158880" /db_xref="HGNC:HGNC:23086" /translation="MAQVRETSLPSGSGVRWISGGGGGASPEEAVEKAGKMEEAAAGA TKASSRREAEEMKLEPLQEREPAPEENLTWSSSGGDEKVLPSIPLRCHSSSSPVCPRR KPRPRPQPRARSRSQPGLSAPPPPPARPPPPPPPPPPPAPRPRAWRGSRRRSRPGSRP QTRRSCSGDLDGSGDPGGLGDWLLEVEFGQGPTGCSHVESFKVGKNWQKNLRLIYQRF VWSGTPETRKRKAKSCICHVCSTHMNRLHSCLSCVFFGCFTEKHIHKHAETKQHHLAV DLYHGVIYCFMCKDYVYDKDIEQIAKETKEKILRLLTSTSTDVSHQQFMTSGFEDKQS TCETKEQEPKLVKPKKKRRKKSVYTVGLRGLINLGNTCFMNCIVQALTHIPLLKDFFL SDKHKCIMTSPSLCLVCEMSSLFHAMYSGSRTPHIPYKLLHLIWIHAEHLAGYRQQDA HEFLIAILDVLHRHSKDDSGGQEANNPNCCNCIIDQIFTGGLQSDVTCQACHSVSTTI DPCWDISLDLPGSCATFDSQNPERADSTVSRDDHIPGIPSLTDCLQWFTRPEHLGSSA KIKCNSCQSYQESTKQLTMKKLPIVACFHLKRFEHVGKQRRKINTFISFPLELDMTPF LASTKESRMKEGQPPTDCVPNENKYSLFAVINHHGTLESGHYTSFIRQQKDQWFSCDD AIITKATIEDLLYSEGYLLFYHKQGLEKD" ORIGIN 1 agagaaggct ggctgaggag cgcctggaca gcagcagttg ctgacgttct atatcacgcg 61 cctggggccc aagtcgcagc tgttttccag acgccttgct cctcaggtcg gggagtgatc 121 tgatggccca ggttcgagaa acttctttgc cctccggctc tggggtccgc tggatctccg 181 gaggtggggg aggagcctct cctgaggagg cggttgagaa ggcggggaaa atggaggagg 241 cggcggcggg ggctacgaag gcgtcttcga gacgtgaagc cgaggagatg aagctggagc 301 cattacaaga gcgtgagccc gcgccggagg agaacttgac gtggagcagc agcggcggcg 361 acgagaaggt gctcccttca atcccccttc gctgtcacag cagctcctcg cccgtttgcc 421 cgcgccgcaa gccccgccct cggccccagc cccgggcccg ctcccgcagc cagcctgggc 481 tctcggcccc acccccgcct ccagcccggc ccccgccccc gccgccaccc ccgcccccac 541 ccgcaccgcg gcccagggcc tggcgtggat cccggcgcag atcccggcct gggtccaggc 601 ctcagacacg gagaagctgc tctggtgacc tagacgggtc gggggatcct ggcggcttag 661 gggactggtt gctggaggtc gagtttggtc agggtcccac aggctgctct catgtggaga 721 gctttaaagt aggtaagaac tggcagaaga acctgaggtt gatctaccag cgtttcgttt 781 ggagtgggac cccagagact aggaaacgta aagcaaagtc atgcatctgt cacgtatgta 841 gtacccatat gaacagactc cactcttgtc tctcctgtgt cttttttggc tgcttcactg 901 agaaacatat tcacaaacat gcagaaacaa agcagcacca tttagctgta gacctttatc 961 atggggtcat atattgcttc atgtgtaagg attatgtata tgacaaagac atagaacaga 1021 ttgccaaaga aacaaaagaa aaaattttga gattattaac ttccacctca acagatgttt 1081 ctcatcaaca gtttatgaca tcagggtttg aagacaagca atcaacctgt gagacaaagg 1141 aacaggagcc aaaattggtg aaacccaaga aaaagagaag aaaaaagtca gtctatactg 1201 taggcctgag agggctaatc aatcttggga acacttgttt tatgaattgt attgtccagg 1261 cacttaccca tattcctcta ctgaaagatt tcttcctctc tgacaagcac aaatgtataa 1321 tgacaagccc cagcttgtgt ctggtctgtg aaatgtcttc gctttttcat gctatgtact 1381 ctgggagccg aactcctcac attccctata agttactgca tctgatatgg atccatgcag 1441 aacatttagc agggtacagg cagcaggatg cccatgagtt ccttattgca atattagacg 1501 tgctacatag acacagcaaa gatgatagtg gtgggcagga ggccaataac cccaactgct 1561 gtaactgcat catagaccaa atctttacag gtggcctgca atcagatgtc acatgtcaag 1621 cctgccatag tgtttctacc accatagacc catgctggga catcagtttg gacttgcctg 1681 gctcttgtgc cacattcgat tcccagaacc cagagagggc tgacagcaca gtgagcaggg 1741 atgaccacat accaggaatc ccctcactta cagactgtct acagtggttt acaaggccag 1801 agcacctagg aagcagtgcc aaaatcaaat gcaatagttg ccaaagctac caggagtcta 1861 ctaaacagct cacaatgaaa aaattaccca ttgtggcttg ttttcatctc aagcggtttg 1921 agcatgtagg caaacagagg cgaaagatta atacctttat ctcctttccc ttggagctgg 1981 acatgactcc gtttttggcc tctactaaag agagcagaat gaaagaaggc cagccaccaa 2041 cagattgtgt gcccaatgag aataagtatt ccttgtttgc agtgattaat caccatggaa 2101 ctttggaaag tggccactat accagcttca tccggcaaca aaaggaccag tggttcagct 2161 gtgatgatgc catcatcacc aaggctacca ttgaggactt actctacagt gaagggtatt 2221 tactgttcta tcacaaacag ggtctagaga aagactagtc ttaccagacc acttactgaa 2281 aaaaaagtaa atgattaggc aaggattttg aagtgacaca cagacctact tggaatggac 2341 aatgacagta acacctatgt gacagctagt atcttgatat aaagaaccta ttttagcatg 2401 gcccatgggt ctgtcggaag aaaaaaatga atactaacca gtgaccattc aaccttaaga 2461 aatggggaga gggagaagag gttgaaaatg gtcacataaa gcataatgaa atgaaaagaa 2521 tgctttaggt ggggacaacg ggagtagaag tgttctgatg ctactctatg tcatttgttt 2581 ttacagaaat atcttgtgaa gtcagggagt attcctttat cagcaaaaac ttcacaattg 2641 gtgttccagc tgtggctgac cagctaaata gtttgaaaga aaaataatat tttaaaataa 2701 agtttaaaga gctttaaaag aaaaacattt aaaaaggaaa aaatcatttt taagatttta 2761 aaagaaaaaa acttttaaat gttgaaaaaa atttaagttg ttatttttaa aagaaatatt 2821 ttaaaagtta aaaataattt tttaatttaa agaagtttca gaattttaaa aattaaaagc 2881 aaagaaaatt aaattcttaa agtttaaaaa tgtaaaataa attaaggaac aaggttaaaa 2941 atgaaagttt accaaaaaaa ggaagaaaat actgttaaaa attaaagtta gaaacaaagg 3001 aacatcttaa aagtttcaaa tgaaggaaaa taatataaat agatatttca aaattaaagc 3061 ataaaatata cgtatttaaa aagtgttaac aaaattacta ctataatgat taagaaataa 3121 attttcaaaa atacagaatg gaatgcaatt cagattttag agaaaagttt taaaagaggc 3181 aagtttagaa taattcaaga caaaaagaca aaatgtgttt aaagacaaaa attgaaaaaa 3241 tacaggaaga aaatagagac ttgtaaaata aaaagaacct tagataagtt caagagattt 3301 aaatgaaaac tttaaatatt taaataaaga tttaaaaatt taagctttta aaaagaaaaa 3361 cagttacata aaaattgacc agtgaaaaaa tgtgaaagat tcaagtagaa aaaattatta 3421 aaattaaaag tttaagaagt tctatttttt tatttaagca tattaaaata cagacgggta 3481 tgaaagaaat aaatggagga gacaggaata acgacagcgg tgtgtgagta tatatatata 3541 tatatttttt tttttttgag ctttatgttg atatatcaat atacgttgga aaaggaaata 3601 gacattttaa tcattatctt accatccaat tgtgaaatgg ctgcaaggcc taaacactaa 3661 ctaacccacc ttcccctctt gagattccat gccccatctc atatcctgaa ggaagtcaaa 3721 ttacttccta ggcagaatca attcatctga gaaaaatgaa aacattaggg actttcccat 3781 ttagaactgc attattcata caaattctta gcttctgaaa aggggctctt gataatttgg 3841 tggcttagag aagttagatc acagctgtga agctctggct ttggtatttg agccccatgg 3901 cctgattgag gaagtggcag ggacccatat ctgaggcctt cccaccttcc tttttccctc 3961 ttccccctcc aaggagaaac aagaccttta atttccacag tagagcaatg ggtttgtgct 4021 tgcagtcctt ctgctcattc tttattccct ctttatttcc tttttttgtc actcatgcct 4081 agttccctga accccttact tacctggcct tacctacatc tctgagcccc agcctgcagg 4141 gctccttctt tgactattca cctggtggct tttactcctc cttcaagacc caactcacag 4201 atgacttctc tggagtcttc cctgaatgct ctaggcaggt ataaatgaga aaattgttcc 4261 tctcttcccc tctttattcc tctcactgtg cctccaaagt actttgtcca tacctcggtg 4321 atagtactta ttctactgtg attttattgt agctttgtga ttgtctgccc ctctgcactg 4381 agcatctaag aacagggggc caagccttat tcatttctgt gttaataaat atttgttaaa 4441 tgtgaa //
Whole sequence Selected region from: to:
All features Gene, RNA, and CDS features only
SNP
Show sequence Show reverse complement Show gap features
Your browsing activity is empty.
Activity recording is turned off.
Turn recording back on