Warning: The NCBI web site requires JavaScript to function. more...
An official website of the United States government
The .gov means it's official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
Download features.
Download gene features.
GenBank: QAKA01000003.1
FASTA Graphics
LOCUS QAKA01000003 6199 bp DNA linear ENV 23-MAY-2018 DEFINITION MAG: Massilioclostridium sp. isolate CIM:MAG 1013 contig_133008, whole genome shotgun sequence. ACCESSION QAKA01000003 QAKA01000000 VERSION QAKA01000003.1 DBLINK BioProject: PRJNA397219 BioSample: SAMN08294961 KEYWORDS WGS; ENV; Metagenome Assembled Genome; MAG. SOURCE Massilioclostridium sp. (human gut metagenome) ORGANISM Massilioclostridium sp. Bacteria; Bacillati; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Massilioclostridium. REFERENCE 1 (bases 1 to 6199) AUTHORS Jeraldo,P., Boardman,L., White,B.A., Nelson,H., Goldenfeld,N. and Chia,N. TITLE The uncultured portion of the human microbiome is neutrally assembled JOURNAL Unpublished REFERENCE 2 (bases 1 to 6199) AUTHORS Jeraldo,P. and Chia,N. TITLE Direct Submission JOURNAL Submitted (28-MAR-2018) Center for Individualized Medicine, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA COMMENT Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ ##Genome-Assembly-Data-START## Assembly Method :: Ray v. 2.3.2 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 4x Sequencing Technology :: Illumina HiSeq ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 04/11/2018 12:16:25 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline Annotation Method :: Best-placed reference protein set; GeneMarkS+ Annotation Software revision :: 4.5 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 1,589 CDS (total) :: 1,566 Genes (coding) :: 1,558 CDS (coding) :: 1,558 Genes (RNA) :: 23 tRNAs :: 20 ncRNAs :: 3 Pseudo Genes (total) :: 8 Pseudo Genes (ambiguous residues) :: 0 of 8 Pseudo Genes (frameshifted) :: 1 of 8 Pseudo Genes (incomplete) :: 4 of 8 Pseudo Genes (internal stop) :: 3 of 8 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..6199 /organism="Massilioclostridium sp." /mol_type="genomic DNA" /submitter_seqid="contig_133008" /isolate="CIM:MAG 1013" /isolation_source="feces" /host="Homo sapiens" /db_xref="taxon:1935928" /environmental_sample /geo_loc_name="USA:Minnesota, Rochester" /lat_lon="44.0234 N 92.46295 W" /metagenome_source="human gut metagenome" /note="metagenomic" gene 580..1098 /gene="lepB" /locus_tag="DBX37_00075" CDS 580..1098 /gene="lepB" /locus_tag="DBX37_00075" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_015515356.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="signal peptidase I" /protein_id="PWN00793.1" /translation="MTKETKKNILSWVITIGAAVLAAIILNSFIILNVSIPSSSMEPT ISKGDRLIGFRLAYLTSDPQQGDIVIFKYPDDEKQKFIKRIIGTPGDTVEGIDGVVYV NGEALNPDYTDIVIQEDFGPFEVPEDSYFMMGDNRNDSLDSRYWKNTFVKRDKILGKA EFTFFPKIEWLG" gene 1109..1972 /locus_tag="DBX37_00080" CDS 1109..1972 /locus_tag="DBX37_00080" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006353541.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="16S rRNA (adenine(1518)-N(6)/adenine(1519)-N(6))- dimethyltransferase" /protein_id="PWN00794.1" /translation="MENLSNISTIKAILTKHGFTFSKSLGQNFLVNPSVCPRIAEQGG AKQGVGVIEVGTGIGVLTSELAQRADKVVAVEIDQRLLPVLEETLSDYHNIKIINQDI LKVDLHKLIQEEFAGMQVVVCANLPYYITSPVIMYLLESRLPIDAVTVMVQKEAAIRI CAQPGTRDVGAVSLAVRYFSEPRILFQVSRGSFMPAPNVDSCVIRLDIKKETPQNVLD EKLFFRLVKAAFSQRRKTLVNPVSGSLGVGKPRLKQLMEESGIKSTARAEELTMEQFI QLANGITRLKK" gene 2652..3152 /locus_tag="DBX37_00085" CDS 2652..3152 /locus_tag="DBX37_00085" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PWN00795.1" /translation="MKKSLKVGVFVLCCVMVTLCLTGCVFGNVNQTLTLGGEYALLLN NGSWYDEQGNLVLQLDEYRGCTIAGDQEIYRGGFFVDTDFTCYIEDDTVVPETTESTG NGLLVFNLNDGPHVYEWTDELSSDSETWHFEIGSKQNENILYWEGKILHQEVEGTDDN VPTLGI" gene 3215..4081 /locus_tag="DBX37_00090" CDS 3215..4081 /locus_tag="DBX37_00090" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PWN00796.1" /translation="MLAVLFELPAFAEEPILKWNESGFLSLMKSQDFVEISALEEDIL EEISNNLLGLEDVEKSDIVPELIDYENAYCCFVMSDPFLLGEFSTEDWIRELRYTTHI WMIPICCKGFRITIQCFPRDYPFLADSEKEKMEKSGRSWVVGEFALNYVRYSSGMDVA EEFQMRSRWYGVDPNQPKLLFYGAEKRTGTLGVTVENNQLSQGILLDGSYPIGSEYVR FGQGSVREYQIGKFYDFSEMAKRFQQVKAEKQDFPHLNSISYVQILLLVLVGIVSILV AVCIIRFKLTCE" gene 4107..5048 /locus_tag="DBX37_00095" CDS 4107..5048 /locus_tag="DBX37_00095" /inference="COORDINATES: ab initio prediction:GeneMarkS+" /note="Derived by automated computational analysis using gene prediction method: GeneMarkS+." /codon_start=1 /transl_table=11 /product="hypothetical protein" /protein_id="PWN00797.1" /translation="MKKILSCLSCMLLLFSILTACSSDVETRELVWNENGYESLLQAE EFEKIQKLSGSILSETRTIIRSSNLDITVEKENIHFKLAYKSYMVNPLKLEDLEQELP RAEYAWNIPVLTDNGHFIVSCRLNRKLNTNSKGLLTDSSLRIIANQEVRWQTSLSYYY PNASKLPTEILDDALINTNFDQRVHKIFFGDCNKMRGLYSLVLQDKKPVYMVPIKDFS VSGIEKISGDMKSAGDFIDGQRYTYKDFAERINRVKEHKHVFPYGGEPGLITNPTRPK FLEGIKPEGIVLLVIIAAFVLTTGGLLIYQKVKDRVR" gene 5430..>6199 /locus_tag="DBX37_00100" CDS 5430..>6199 /locus_tag="DBX37_00100" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_006353543.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /codon_start=1 /transl_table=11 /product="formate acetyltransferase" /protein_id="PWN00798.1" /translation="MDFVTGKWNKEINVRDFIVRNYTPYDGDDSFLVGPTEKTKDLWQ EVTDLMKKEREAGGVLDIDEKTISAIDAYAPGYIDKNLETIVGLQTNEPLKRAIMPYG GIRMVETSLKAYGKEMDPKVARVFKHRKTHNDGVFDVYTDEMRAARHCAIITGLPDAY GRGRIIGDYRRVALYGIDKLIEEKKIAKKNFAFDYMDSPTIRQREELSEQIRALEAMK SMAARYGCDISKPATNTQEAIQAVYFGYLAAVKDQNGAA" ORIGIN 1 aattataatg accagcttta tgcgagttga aaagaggctg tatgaaaaag tatagcgaat 61 gaaaagccag tgcgtgtgat gcaatcatat tatttagcta tccccttata ggggttaatt 121 agaatagccc tttaggggct gtgtaagaca ctgattcgac tatattgaga aagttatact 181 ggagaagatg aaaaaagaaa taaaacgaaa ccatataaac ggataaaatg ataacgatac 241 agaactgaaa ataatgttgg actgttacag gactatggta cgcttacttt ttgctataaa 301 gcgaaattaa catttaaaca attagtaata tttttgatgt ttatacaaaa taaaactaaa 361 agataagatt agagaaaatt tctaatgata aattaggatt taaaaagtaa tgtgtatgaa 421 aaacaattaa tatttggtta atgagtaaag tatccgtatg acattaatag aaaaaggatt 481 tgaaaataca tcaaaaattt ggtatgatag agttatccaa tgagaaattg taagatctag 541 tttaaaggtt gatttataat ccagaaaaga gggattcttt tgacaaaaga aaccaaaaaa 601 aatatattga gttgggtgat taccattggt gcagcggtat tagccgcaat catcttaaat 661 agttttatta ttttaaatgt ttccatacct tcttcatcta tggaacctac catttcaaaa 721 ggggatcgtc tgataggctt tcggctggca tacctgactt ctgaccctca acagggggat 781 attgtgattt tcaaatatcc agatgatgaa aagcaaaagt ttattaaacg gattattggt 841 acaccaggag atactgtgga agggattgac ggcgttgttt atgtcaatgg ggaagccctc 901 aatccagatt atacggatat cgtcatacag gaggattttg gaccgtttga agtgccagag 961 gattcctatt ttatgatggg ggacaatcgg aatgactcct tagactcccg ctattggaaa 1021 aatacatttg tgaaacggga taaaattctt gggaaagcag aatttacctt tttcccaaaa 1081 atagagtggc tcggataagg aggaatagat ggaaaatcta tccaatattt caacaattaa 1141 agcaatatta accaaacatg gttttacatt ctccaaatca ttggggcaaa attttttggt 1201 aaaccccagt gtgtgcccta ggattgcgga acaaggaggc gctaaacaag gtgtcggcgt 1261 aattgaggtt ggaacgggga tcggtgtctt aaccagtgaa ttggcgcaac gggctgataa 1321 agtggttgcg gtggaaattg accagcggct gttaccggtt ttagaggaaa ccttatctga 1381 ttatcataat atcaaaatca tcaaccagga tattttaaag gtagacctcc acaaattgat 1441 ccaggaagaa ttcgcaggga tgcaggttgt ggtatgcgct aaccttcctt attatattac 1501 ctctcctgtt attatgtatt tgttggaaag taggttgcca attgatgctg ttaccgtgat 1561 ggtgcaaaaa gaggctgcca tccgtatttg tgcccagcct ggaaccaggg atgtaggtgc 1621 tgttagcttg gcagtgcgtt attttagcga accccgtatt ttgttccagg tatccagagg 1681 aagttttatg cctgcgccta atgtagatag ctgcgttatc cggctggata ttaaaaaaga 1741 aacgccacag aatgtactgg atgaaaagct gtttttccgt ttagtaaaag cggcgttttc 1801 ccagcgaaga aaaactttgg tgaacccagt atccggcagt ttaggggtgg gcaaaccccg 1861 gttaaaacaa ttgatggagg aatccggaat aaaatctact gcccgtgcgg aagaattgac 1921 gatggaacag tttatccagc ttgcaaacgg gattacaagg ttgaaaaaat aataaaacga 1981 ataagcgcaa aaaccctcta tggtaggatc gtactgcccc atattgtgca gacagtacaa 2041 aaaagacccg tggtgtagga atattaagtt ttagtaggaa tggcacagcg tgagcggcac 2101 tactcctgtt ttaatttaaa tgcgctcgtg gttgtagaaa tggatatatt ggtcaattat 2161 ttctctctga ctttttcctg cctgaattat ttcctcgatt tctggtaata tttcctgccc 2221 gtgcgtgtaa tttccttttt gtacagttaa gctccctgat gtatctattt tcctacatca 2281 gggggctttt gtctattatc cgtttttgct ggatctattc aatcctacca tagaaggttt 2341 ttatgatatc aatcaatgta aagatatttt agggtttcta ctgtaaaagg agaaatgtta 2401 ttttaagaag gacgcaaaac aaggaatagt atttcatttt atgtattaga aaaatcacaa 2461 tagaaaccag ataatcaata gttcacacaa aaaataaaag gatattcttt ccatagtagc 2521 acgaatctct tttgaaatat ctagtagtat gttgatttcg caaaaatatt ttggtgattt 2581 ttgtcgattt actatatagc tatttgttat cccttatgct acaataaaag caggaaaagg 2641 aggcatttat gatgaaaaag agtttaaaag taggagtatt tgttttatgt tgtgttatgg 2701 tgaccctttg tctaactgga tgtgtttttg gcaatgttaa tcaaaccctt actctaggcg 2761 gggaatatgc attgttgctg aataatggtt cttggtatga tgaacaggga aatctggttt 2821 tacaattgga cgaataccgt ggatgtacga ttgcgggtga ccaggaaatc tatcgaggcg 2881 gattttttgt ggatactgat tttacctgtt atattgaaga tgatactgtg gtaccagaaa 2941 cgacagaatc aacaggaaat gggttgttag tgtttaattt aaacgatggg cctcatgttt 3001 atgaatggac ggacgaatta agttctgatt ccgaaacctg gcattttgaa attggctcaa 3061 aacaaaatga gaatattttg tattgggaag gaaaaatttt gcatcaggag gtagagggaa 3121 cagacgataa tgttcctact ttggggatat aacaattgat ttttctaagc tggtaatagt 3181 tataaaaaga tggatgttgg tatggatatg tgttatgctt gccgttttgt ttgaattacc 3241 agcatttgcg gaagaaccaa tattaaagtg gaatgaatcc ggatttttat ccttaatgaa 3301 atctcaggac tttgtggaaa tttctgcatt ggaagaggat attttggaag aaatcagtaa 3361 taatttgctg gggctggagg atgtggaaaa atccgatatt gttccggaat tgattgatta 3421 tgaaaacgcc tattgctgtt ttgttatgtc tgacccattt cttttaggag agttttccac 3481 agaagattgg atacgagaac tgcgttatac cactcatatt tggatgattc caatctgttg 3541 taaagggttt cggattacaa tccagtgttt tccaagggat tatccttttt tagcggatag 3601 cgaaaaggaa aagatggaga aatcagggag aagctgggtg gttggtgaat tcgcgttaaa 3661 ttatgtgcgg tattcatcag gtatggatgt agcggaagaa tttcagatgc gttccagatg 3721 gtatggcgtt gatccaaacc agccaaaatt attgttttat ggcgcagaaa aacgaactgg 3781 aacattgggt gtgacagtgg aaaacaatca attatcccaa ggaattttgt tagatggttc 3841 ttaccctatc gggtcggagt atgtaaggtt tggacaagga tcggtgaggg aatatcaaat 3901 tggaaaattt tatgattttt cggagatggc aaaaagattt caacaggtaa aagcagagaa 3961 gcaggatttt ccccatttga atagtatctc gtacgttcaa atccttttgc tcgtgttagt 4021 tgggattgtt tcgattttgg tagcggtgtg tatcatccgt tttaaattaa catgtgagtg 4081 acctttatgg acgagaaggt ggtcgtatga aaaagatttt atcctgctta tcttgtatgc 4141 tactgctgtt ttctatccta acagcgtgtt catccgatgt ggaaacaagg gaactggtat 4201 ggaatgaaaa tggatatgaa tcattgcttc aagcagaaga atttgagaaa atacaaaagc 4261 tttcagggag tatcttaagt gaaacccgta ccattatccg ttccagcaat ttagatatca 4321 ccgttgaaaa agaaaatatt cattttaaat tggcatacaa atcttatatg gtaaatccat 4381 taaaactaga ggatttggag caggagttgc ctcgtgcgga atatgcttgg aatattccgg 4441 ttctcaccga taatgggcat tttattgtga gctgtcgttt gaaccgaaaa ttaaacacaa 4501 atagcaaagg gctgttgacc gattcctccc tccgcataat tgccaatcag gaagtccgct 4561 ggcagacctc attatcgtat tattatccaa acgcttccaa attaccaacg gaaatactgg 4621 atgacgcttt gatcaatacg aatttcgatc agagggtaca taaaatattt tttggcgatt 4681 gtaataagat gagaggattg tatagcctcg ttttacagga taaaaaacct gtttatatgg 4741 ttccaattaa agattttagt gtaagcggta tagaaaaaat atcaggagat atgaaatcag 4801 ccggtgattt tatagatgga cagcgctata cctataagga ctttgcggaa cggataaacc 4861 gcgtaaaaga acataagcac gttttccctt acggaggaga acctggatta attacgaacc 4921 caaccaggcc gaaattttta gaaggcatta aaccggaagg aattgtgtta ctagttatta 4981 tagcggcgtt tgttctcaca acaggagggt tattgattta ccagaaggta aaagataggg 5041 ttcgataata gaagggcgat gttgattcgc ccttattttg ataggaattg atcataatag 5101 acttattgtt tccaaaaaaa tgaataaaaa caaagatttg gaagcaaaaa ttatttttgt 5161 ttttttgaga aaaagagaaa aatttatttg ataaagaaat aaattcttgg tttatttatg 5221 aaaaaacgtt ttattgcaat gttaggaatg attcctatat aatatataca aaataagtat 5281 agtttgttgc agaaagtttg gcgtacaatt gtttgacatt attcttgtgg tttgttaaga 5341 tattaatgcg agataaagga tggacggaat agttctttat cgaaggaaag tgaacaaagg 5401 cagtcataca aaaacaggag ggaaaataca tggattttgt aacaggaaaa tggaacaaag 5461 aaatcaatgt acgtgacttt attgtaagaa actacacacc atatgatggt gatgatagct 5521 ttctggttgg accaaccgaa aaaacaaaag acctctggca agaagtaacc gatttaatga 5581 aaaaagagcg ggaagccggc ggcgtattgg atattgacga aaaaacaatc tccgctattg 5641 acgcatatgc tccaggctat attgataaaa acctggaaac aattgttggc ttgcagacta 5701 acgagccttt aaaacgtgcg attatgcctt acggtggaat ccgtatggtg gaaacttcct 5761 taaaagcata cggcaaagag atggatccaa aggttgcacg tgtatttaaa cacagaaaaa 5821 cccataacga cggtgtattc gatgtttata ccgatgaaat gagggcagct cgccactgtg 5881 ctattatcac tggtttacca gacgcatacg gccgtggaag gattattggc gactaccgcc 5941 gtgttgcttt gtatgggatt gacaagttaa tcgaagaaaa gaaaatcgcg aagaaaaact 6001 ttgcttttga ctatatggac agcccaacca tccgtcaaag ggaagaactt tctgaacaaa 6061 tccgtgcttt ggaagcaatg aagagcatgg ctgcaagata tggctgtgat atctccaaac 6121 ctgcaaccaa cacacaggaa gcaattcagg cggtatactt tggctacttg gcagcagtaa 6181 aagaccagaa cggcgctgc //
Whole sequence Selected region from: to:
All features Gene, RNA, and CDS features only
Show sequence Show reverse complement Show gap features
Your browsing activity is empty.
Activity recording is turned off.
Turn recording back on