NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM3738969 Query DataSets for GSM3738969
Status Public on May 01, 2020
Title Second Replicate RNA-seq sample OD600=4.0
Sample type SRA
 
Source name Escherichia coli grown in LB
Organism Escherichia coli str. K-12 substr. MG1655
Characteristics strain: MG1655
genotype: wildtype
optical density: OD 4.0
Extracted molecule total RNA
Extraction protocol Total RNA was extracted using the Guanidium thiocyanate phenol method. RNA integrity was assessed with the Prokaryote Total RNA Nano assay on a 2100 Bioanalyzer (Agilent). Genomic DNA was removed by incubating 10 μg of total RNA with 2U Turbo DNase (Ambion) in a 50 μl final volume for 30 minutes at 37°C in the presence of 10 U SuperaseIn RNase Inhibitor (Ambion). RNA was subsequently phenol-chloroform extracted and purified by ethanol-precipitation.
The RNA-seq libraries were generated using the Illumina TrueSeq protocol according to manufacturer's procedures.
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina NovaSeq 6000
 
Description RNA-seq sample
Supplementary Table 1.xlsx
Data processing Raw sequencing reads in fastq files were processed using a pipeline developed by Sander Granneman, which uses tools from the pyCRAC package (Webb et al., 2014a). pyCRAC versions. 1.3.2 to 1.4.4 were used for the analyes. The entire pipeline is available at https://bitbucket.org/sgrann/). The CRAC_pipeline_PE.py pipeline first demultiplexes the data using pyBarcodeFilter.py and the in-read barcode sequences found in the L5 5’ adapters. Flexbar then trims the reads to remove 3’-adapter sequences and poor quality nucleotides (Phred score <23). Using the random nucleotide information present in the L5 5’-adaptor sequences, the reads were collapsed to remove potential PCR duplicates. The reads were then mapped to the Escherichia coli MG1655 genome with Novoalign (www.novocraft.com). To determine which genes the reads overlapped with we generated an annotation file in the Gene Transfer Format (GTF). This file contains the start and end positions of each gene on the chromosome as well as what genomic features (i.e. sRNA, protein- coding, tRNA) it belongs to. To generate this file, we used the Rockhopper software (Tjaden, 2015) on E. coli rRNA-depleted total RNA-seq data (generated by Christel Sirocchi), a minimal GTF file obtained from ENSEMBL (without UTR information). The resulting GTF file contained information not only on the coding sequences, but also complete 5’ and 3’ UTR coordinates. PyReadCounters then used the novoalign output file and the GTF file to count the total number of unique cDNAs that mapped to each gene. RNA-seq data was generated by Novogene using the Illumina TruSeq protocol. The data were processed using the CRAC_pipeline_PE.py. For all the CLASH and RNA-seq data, we normalized the counts to transcripts per million (TPM). Processed and raw data files for these counts are provided.
Genome_build: Escherichia coli MG1655
Supplementary_files_format_and_content: TPM
 
Submission date Apr 29, 2019
Last update date May 01, 2020
Contact name Sander Granneman
E-mail(s) [email protected]
Organization name University of Edinburgh
Department Centre for Synthetic and Systems Biology
Lab Granneman lab
Street address Mayfield Road, Kings Buildings, Waddington building, room 3.06
City Edinburgh
ZIP/Postal code EH9 3JD
Country United Kingdom
 
Platform ID GPL26592
Series (2)
GSE123048 Hfq CLASH uncovers sRNA-target interaction networks linked to nutrient availability adaptation [RNA-seq]
GSE123050 Hfq CLASH uncovers sRNA-target interaction networks linked to nutrient availability adaptation
Relations
BioSample SAMN11528403
SRA SRX5766153

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap