U.S. flag

An official website of the United States government

MAG: Massilioclostridium sp. isolate CIM:MAG 1013 contig_133008, whole genome shotgun sequence

GenBank: QAKA01000003.1

FASTA Graphics 

LOCUS       QAKA01000003            6199 bp    DNA     linear   ENV 23-MAY-2018
DEFINITION  MAG: Massilioclostridium sp. isolate CIM:MAG 1013 contig_133008,
            whole genome shotgun sequence.
ACCESSION   QAKA01000003 QAKA01000000
VERSION     QAKA01000003.1
DBLINK      BioProject: PRJNA397219
            BioSample: SAMN08294961
KEYWORDS    WGS; ENV; Metagenome Assembled Genome; MAG.
SOURCE      Massilioclostridium sp. (human gut metagenome)
  ORGANISM  Massilioclostridium sp.
            Bacteria; Bacillati; Bacillota; Clostridia; Eubacteriales;
            Clostridiaceae; Massilioclostridium.
REFERENCE   1  (bases 1 to 6199)
  AUTHORS   Jeraldo,P., Boardman,L., White,B.A., Nelson,H., Goldenfeld,N. and
            Chia,N.
  TITLE     The uncultured portion of the human microbiome is neutrally
            assembled
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 6199)
  AUTHORS   Jeraldo,P. and Chia,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (28-MAR-2018) Center for Individualized Medicine, Mayo
            Clinic, 200 First Street SW, Rochester, MN 55905, USA
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: Ray v. 2.3.2
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 4x
            Sequencing Technology  :: Illumina HiSeq
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 04/11/2018 12:16:25
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS+
            Annotation Software revision      :: 4.5
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 1,589
            CDS (total)                       :: 1,566
            Genes (coding)                    :: 1,558
            CDS (coding)                      :: 1,558
            Genes (RNA)                       :: 23
            tRNAs                             :: 20
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 8
            Pseudo Genes (ambiguous residues) :: 0 of 8
            Pseudo Genes (frameshifted)       :: 1 of 8
            Pseudo Genes (incomplete)         :: 4 of 8
            Pseudo Genes (internal stop)      :: 3 of 8
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..6199
                     /organism="Massilioclostridium sp."
                     /mol_type="genomic DNA"
                     /submitter_seqid="contig_133008"
                     /isolate="CIM:MAG 1013"
                     /isolation_source="feces"
                     /host="Homo sapiens"
                     /db_xref="taxon:1935928"
                     /environmental_sample
                     /geo_loc_name="USA:Minnesota, Rochester"
                     /lat_lon="44.0234 N 92.46295 W"
                     /metagenome_source="human gut metagenome"
                     /note="metagenomic"
     gene            580..1098
                     /gene="lepB"
                     /locus_tag="DBX37_00075"
     CDS             580..1098
                     /gene="lepB"
                     /locus_tag="DBX37_00075"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_015515356.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="signal peptidase I"
                     /protein_id="PWN00793.1"
                     /translation="MTKETKKNILSWVITIGAAVLAAIILNSFIILNVSIPSSSMEPT
                     ISKGDRLIGFRLAYLTSDPQQGDIVIFKYPDDEKQKFIKRIIGTPGDTVEGIDGVVYV
                     NGEALNPDYTDIVIQEDFGPFEVPEDSYFMMGDNRNDSLDSRYWKNTFVKRDKILGKA
                     EFTFFPKIEWLG"
     gene            1109..1972
                     /locus_tag="DBX37_00080"
     CDS             1109..1972
                     /locus_tag="DBX37_00080"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_006353541.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="16S rRNA
                     (adenine(1518)-N(6)/adenine(1519)-N(6))-
                     dimethyltransferase"
                     /protein_id="PWN00794.1"
                     /translation="MENLSNISTIKAILTKHGFTFSKSLGQNFLVNPSVCPRIAEQGG
                     AKQGVGVIEVGTGIGVLTSELAQRADKVVAVEIDQRLLPVLEETLSDYHNIKIINQDI
                     LKVDLHKLIQEEFAGMQVVVCANLPYYITSPVIMYLLESRLPIDAVTVMVQKEAAIRI
                     CAQPGTRDVGAVSLAVRYFSEPRILFQVSRGSFMPAPNVDSCVIRLDIKKETPQNVLD
                     EKLFFRLVKAAFSQRRKTLVNPVSGSLGVGKPRLKQLMEESGIKSTARAEELTMEQFI
                     QLANGITRLKK"
     gene            2652..3152
                     /locus_tag="DBX37_00085"
     CDS             2652..3152
                     /locus_tag="DBX37_00085"
                     /inference="COORDINATES: ab initio prediction:GeneMarkS+"
                     /note="Derived by automated computational analysis using
                     gene prediction method: GeneMarkS+."
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="PWN00795.1"
                     /translation="MKKSLKVGVFVLCCVMVTLCLTGCVFGNVNQTLTLGGEYALLLN
                     NGSWYDEQGNLVLQLDEYRGCTIAGDQEIYRGGFFVDTDFTCYIEDDTVVPETTESTG
                     NGLLVFNLNDGPHVYEWTDELSSDSETWHFEIGSKQNENILYWEGKILHQEVEGTDDN
                     VPTLGI"
     gene            3215..4081
                     /locus_tag="DBX37_00090"
     CDS             3215..4081
                     /locus_tag="DBX37_00090"
                     /inference="COORDINATES: ab initio prediction:GeneMarkS+"
                     /note="Derived by automated computational analysis using
                     gene prediction method: GeneMarkS+."
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="PWN00796.1"
                     /translation="MLAVLFELPAFAEEPILKWNESGFLSLMKSQDFVEISALEEDIL
                     EEISNNLLGLEDVEKSDIVPELIDYENAYCCFVMSDPFLLGEFSTEDWIRELRYTTHI
                     WMIPICCKGFRITIQCFPRDYPFLADSEKEKMEKSGRSWVVGEFALNYVRYSSGMDVA
                     EEFQMRSRWYGVDPNQPKLLFYGAEKRTGTLGVTVENNQLSQGILLDGSYPIGSEYVR
                     FGQGSVREYQIGKFYDFSEMAKRFQQVKAEKQDFPHLNSISYVQILLLVLVGIVSILV
                     AVCIIRFKLTCE"
     gene            4107..5048
                     /locus_tag="DBX37_00095"
     CDS             4107..5048
                     /locus_tag="DBX37_00095"
                     /inference="COORDINATES: ab initio prediction:GeneMarkS+"
                     /note="Derived by automated computational analysis using
                     gene prediction method: GeneMarkS+."
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="PWN00797.1"
                     /translation="MKKILSCLSCMLLLFSILTACSSDVETRELVWNENGYESLLQAE
                     EFEKIQKLSGSILSETRTIIRSSNLDITVEKENIHFKLAYKSYMVNPLKLEDLEQELP
                     RAEYAWNIPVLTDNGHFIVSCRLNRKLNTNSKGLLTDSSLRIIANQEVRWQTSLSYYY
                     PNASKLPTEILDDALINTNFDQRVHKIFFGDCNKMRGLYSLVLQDKKPVYMVPIKDFS
                     VSGIEKISGDMKSAGDFIDGQRYTYKDFAERINRVKEHKHVFPYGGEPGLITNPTRPK
                     FLEGIKPEGIVLLVIIAAFVLTTGGLLIYQKVKDRVR"
     gene            5430..>6199
                     /locus_tag="DBX37_00100"
     CDS             5430..>6199
                     /locus_tag="DBX37_00100"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_006353543.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /codon_start=1
                     /transl_table=11
                     /product="formate acetyltransferase"
                     /protein_id="PWN00798.1"
                     /translation="MDFVTGKWNKEINVRDFIVRNYTPYDGDDSFLVGPTEKTKDLWQ
                     EVTDLMKKEREAGGVLDIDEKTISAIDAYAPGYIDKNLETIVGLQTNEPLKRAIMPYG
                     GIRMVETSLKAYGKEMDPKVARVFKHRKTHNDGVFDVYTDEMRAARHCAIITGLPDAY
                     GRGRIIGDYRRVALYGIDKLIEEKKIAKKNFAFDYMDSPTIRQREELSEQIRALEAMK
                     SMAARYGCDISKPATNTQEAIQAVYFGYLAAVKDQNGAA"
ORIGIN      
        1 aattataatg accagcttta tgcgagttga aaagaggctg tatgaaaaag tatagcgaat
       61 gaaaagccag tgcgtgtgat gcaatcatat tatttagcta tccccttata ggggttaatt
      121 agaatagccc tttaggggct gtgtaagaca ctgattcgac tatattgaga aagttatact
      181 ggagaagatg aaaaaagaaa taaaacgaaa ccatataaac ggataaaatg ataacgatac
      241 agaactgaaa ataatgttgg actgttacag gactatggta cgcttacttt ttgctataaa
      301 gcgaaattaa catttaaaca attagtaata tttttgatgt ttatacaaaa taaaactaaa
      361 agataagatt agagaaaatt tctaatgata aattaggatt taaaaagtaa tgtgtatgaa
      421 aaacaattaa tatttggtta atgagtaaag tatccgtatg acattaatag aaaaaggatt
      481 tgaaaataca tcaaaaattt ggtatgatag agttatccaa tgagaaattg taagatctag
      541 tttaaaggtt gatttataat ccagaaaaga gggattcttt tgacaaaaga aaccaaaaaa
      601 aatatattga gttgggtgat taccattggt gcagcggtat tagccgcaat catcttaaat
      661 agttttatta ttttaaatgt ttccatacct tcttcatcta tggaacctac catttcaaaa
      721 ggggatcgtc tgataggctt tcggctggca tacctgactt ctgaccctca acagggggat
      781 attgtgattt tcaaatatcc agatgatgaa aagcaaaagt ttattaaacg gattattggt
      841 acaccaggag atactgtgga agggattgac ggcgttgttt atgtcaatgg ggaagccctc
      901 aatccagatt atacggatat cgtcatacag gaggattttg gaccgtttga agtgccagag
      961 gattcctatt ttatgatggg ggacaatcgg aatgactcct tagactcccg ctattggaaa
     1021 aatacatttg tgaaacggga taaaattctt gggaaagcag aatttacctt tttcccaaaa
     1081 atagagtggc tcggataagg aggaatagat ggaaaatcta tccaatattt caacaattaa
     1141 agcaatatta accaaacatg gttttacatt ctccaaatca ttggggcaaa attttttggt
     1201 aaaccccagt gtgtgcccta ggattgcgga acaaggaggc gctaaacaag gtgtcggcgt
     1261 aattgaggtt ggaacgggga tcggtgtctt aaccagtgaa ttggcgcaac gggctgataa
     1321 agtggttgcg gtggaaattg accagcggct gttaccggtt ttagaggaaa ccttatctga
     1381 ttatcataat atcaaaatca tcaaccagga tattttaaag gtagacctcc acaaattgat
     1441 ccaggaagaa ttcgcaggga tgcaggttgt ggtatgcgct aaccttcctt attatattac
     1501 ctctcctgtt attatgtatt tgttggaaag taggttgcca attgatgctg ttaccgtgat
     1561 ggtgcaaaaa gaggctgcca tccgtatttg tgcccagcct ggaaccaggg atgtaggtgc
     1621 tgttagcttg gcagtgcgtt attttagcga accccgtatt ttgttccagg tatccagagg
     1681 aagttttatg cctgcgccta atgtagatag ctgcgttatc cggctggata ttaaaaaaga
     1741 aacgccacag aatgtactgg atgaaaagct gtttttccgt ttagtaaaag cggcgttttc
     1801 ccagcgaaga aaaactttgg tgaacccagt atccggcagt ttaggggtgg gcaaaccccg
     1861 gttaaaacaa ttgatggagg aatccggaat aaaatctact gcccgtgcgg aagaattgac
     1921 gatggaacag tttatccagc ttgcaaacgg gattacaagg ttgaaaaaat aataaaacga
     1981 ataagcgcaa aaaccctcta tggtaggatc gtactgcccc atattgtgca gacagtacaa
     2041 aaaagacccg tggtgtagga atattaagtt ttagtaggaa tggcacagcg tgagcggcac
     2101 tactcctgtt ttaatttaaa tgcgctcgtg gttgtagaaa tggatatatt ggtcaattat
     2161 ttctctctga ctttttcctg cctgaattat ttcctcgatt tctggtaata tttcctgccc
     2221 gtgcgtgtaa tttccttttt gtacagttaa gctccctgat gtatctattt tcctacatca
     2281 gggggctttt gtctattatc cgtttttgct ggatctattc aatcctacca tagaaggttt
     2341 ttatgatatc aatcaatgta aagatatttt agggtttcta ctgtaaaagg agaaatgtta
     2401 ttttaagaag gacgcaaaac aaggaatagt atttcatttt atgtattaga aaaatcacaa
     2461 tagaaaccag ataatcaata gttcacacaa aaaataaaag gatattcttt ccatagtagc
     2521 acgaatctct tttgaaatat ctagtagtat gttgatttcg caaaaatatt ttggtgattt
     2581 ttgtcgattt actatatagc tatttgttat cccttatgct acaataaaag caggaaaagg
     2641 aggcatttat gatgaaaaag agtttaaaag taggagtatt tgttttatgt tgtgttatgg
     2701 tgaccctttg tctaactgga tgtgtttttg gcaatgttaa tcaaaccctt actctaggcg
     2761 gggaatatgc attgttgctg aataatggtt cttggtatga tgaacaggga aatctggttt
     2821 tacaattgga cgaataccgt ggatgtacga ttgcgggtga ccaggaaatc tatcgaggcg
     2881 gattttttgt ggatactgat tttacctgtt atattgaaga tgatactgtg gtaccagaaa
     2941 cgacagaatc aacaggaaat gggttgttag tgtttaattt aaacgatggg cctcatgttt
     3001 atgaatggac ggacgaatta agttctgatt ccgaaacctg gcattttgaa attggctcaa
     3061 aacaaaatga gaatattttg tattgggaag gaaaaatttt gcatcaggag gtagagggaa
     3121 cagacgataa tgttcctact ttggggatat aacaattgat ttttctaagc tggtaatagt
     3181 tataaaaaga tggatgttgg tatggatatg tgttatgctt gccgttttgt ttgaattacc
     3241 agcatttgcg gaagaaccaa tattaaagtg gaatgaatcc ggatttttat ccttaatgaa
     3301 atctcaggac tttgtggaaa tttctgcatt ggaagaggat attttggaag aaatcagtaa
     3361 taatttgctg gggctggagg atgtggaaaa atccgatatt gttccggaat tgattgatta
     3421 tgaaaacgcc tattgctgtt ttgttatgtc tgacccattt cttttaggag agttttccac
     3481 agaagattgg atacgagaac tgcgttatac cactcatatt tggatgattc caatctgttg
     3541 taaagggttt cggattacaa tccagtgttt tccaagggat tatccttttt tagcggatag
     3601 cgaaaaggaa aagatggaga aatcagggag aagctgggtg gttggtgaat tcgcgttaaa
     3661 ttatgtgcgg tattcatcag gtatggatgt agcggaagaa tttcagatgc gttccagatg
     3721 gtatggcgtt gatccaaacc agccaaaatt attgttttat ggcgcagaaa aacgaactgg
     3781 aacattgggt gtgacagtgg aaaacaatca attatcccaa ggaattttgt tagatggttc
     3841 ttaccctatc gggtcggagt atgtaaggtt tggacaagga tcggtgaggg aatatcaaat
     3901 tggaaaattt tatgattttt cggagatggc aaaaagattt caacaggtaa aagcagagaa
     3961 gcaggatttt ccccatttga atagtatctc gtacgttcaa atccttttgc tcgtgttagt
     4021 tgggattgtt tcgattttgg tagcggtgtg tatcatccgt tttaaattaa catgtgagtg
     4081 acctttatgg acgagaaggt ggtcgtatga aaaagatttt atcctgctta tcttgtatgc
     4141 tactgctgtt ttctatccta acagcgtgtt catccgatgt ggaaacaagg gaactggtat
     4201 ggaatgaaaa tggatatgaa tcattgcttc aagcagaaga atttgagaaa atacaaaagc
     4261 tttcagggag tatcttaagt gaaacccgta ccattatccg ttccagcaat ttagatatca
     4321 ccgttgaaaa agaaaatatt cattttaaat tggcatacaa atcttatatg gtaaatccat
     4381 taaaactaga ggatttggag caggagttgc ctcgtgcgga atatgcttgg aatattccgg
     4441 ttctcaccga taatgggcat tttattgtga gctgtcgttt gaaccgaaaa ttaaacacaa
     4501 atagcaaagg gctgttgacc gattcctccc tccgcataat tgccaatcag gaagtccgct
     4561 ggcagacctc attatcgtat tattatccaa acgcttccaa attaccaacg gaaatactgg
     4621 atgacgcttt gatcaatacg aatttcgatc agagggtaca taaaatattt tttggcgatt
     4681 gtaataagat gagaggattg tatagcctcg ttttacagga taaaaaacct gtttatatgg
     4741 ttccaattaa agattttagt gtaagcggta tagaaaaaat atcaggagat atgaaatcag
     4801 ccggtgattt tatagatgga cagcgctata cctataagga ctttgcggaa cggataaacc
     4861 gcgtaaaaga acataagcac gttttccctt acggaggaga acctggatta attacgaacc
     4921 caaccaggcc gaaattttta gaaggcatta aaccggaagg aattgtgtta ctagttatta
     4981 tagcggcgtt tgttctcaca acaggagggt tattgattta ccagaaggta aaagataggg
     5041 ttcgataata gaagggcgat gttgattcgc ccttattttg ataggaattg atcataatag
     5101 acttattgtt tccaaaaaaa tgaataaaaa caaagatttg gaagcaaaaa ttatttttgt
     5161 ttttttgaga aaaagagaaa aatttatttg ataaagaaat aaattcttgg tttatttatg
     5221 aaaaaacgtt ttattgcaat gttaggaatg attcctatat aatatataca aaataagtat
     5281 agtttgttgc agaaagtttg gcgtacaatt gtttgacatt attcttgtgg tttgttaaga
     5341 tattaatgcg agataaagga tggacggaat agttctttat cgaaggaaag tgaacaaagg
     5401 cagtcataca aaaacaggag ggaaaataca tggattttgt aacaggaaaa tggaacaaag
     5461 aaatcaatgt acgtgacttt attgtaagaa actacacacc atatgatggt gatgatagct
     5521 ttctggttgg accaaccgaa aaaacaaaag acctctggca agaagtaacc gatttaatga
     5581 aaaaagagcg ggaagccggc ggcgtattgg atattgacga aaaaacaatc tccgctattg
     5641 acgcatatgc tccaggctat attgataaaa acctggaaac aattgttggc ttgcagacta
     5701 acgagccttt aaaacgtgcg attatgcctt acggtggaat ccgtatggtg gaaacttcct
     5761 taaaagcata cggcaaagag atggatccaa aggttgcacg tgtatttaaa cacagaaaaa
     5821 cccataacga cggtgtattc gatgtttata ccgatgaaat gagggcagct cgccactgtg
     5881 ctattatcac tggtttacca gacgcatacg gccgtggaag gattattggc gactaccgcc
     5941 gtgttgcttt gtatgggatt gacaagttaa tcgaagaaaa gaaaatcgcg aagaaaaact
     6001 ttgcttttga ctatatggac agcccaacca tccgtcaaag ggaagaactt tctgaacaaa
     6061 tccgtgcttt ggaagcaatg aagagcatgg ctgcaagata tggctgtgat atctccaaac
     6121 ctgcaaccaa cacacaggaa gcaattcagg cggtatactt tggctacttg gcagcagtaa
     6181 aagaccagaa cggcgctgc
//
Feature
Display: FASTA GenBank Help
Details

Supplemental Content

Change region shown

Customize view

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...
External link. Please review our privacy policy.