NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4468658 Query DataSets for GSM4468658
Status Public on May 28, 2020
Title WT rep2
Sample type SRA
 
Source name flowers
Organism Arabidopsis thaliana
Characteristics genotype: WT (Col-0)
biological replicate: replicate 2
reference sequence: TAIR10
Growth protocol Arabidopsis plantlets of Col-0 accession were grown on soil with 16 hr light (21 °C) /8h (18 °C) darkness cycles
Extracted molecule total RNA
Extraction protocol Total RNA was extracted from flowers with TRI Reagent® (Molecular Research Center) according to manufacturer’s instructions
TAILseq libraries according to Chang et al., 2014 (https://doi.org/10.1016/j.molcel.2014.02.007) and Zuber et al., 2016 (doi: 10.1016/j.celrep.2016.02.060) with some modifications of the protocol, see related publication.
 
Library strategy OTHER
Library source transcriptomic
Library selection other
Instrument model Illumina MiSeq
 
Description Processed_data_TAILseq.txt
Data processing Library strategy: TAIL-seq
The base calling-based pipeline was adapted from Zuber et al. 2016 (doi: 10.1016/j.celrep.2016.02.060) .
The base calls were retrieved after initial data processing by the MiSeq Control Software v 2.6 and the sequences having completely identical nucleotides in the 1st to 15th cycle in read 2 (degenerate bases in 3’ adapter) were deduplicated.
In order to identify transcripts, read 1 sequences were mapped onto the Arabidopsis thaliana reference genome (TAIR 10) using Hisat2 (v2.1.0) (Kim et al. 2015, https://doi.org/10.1038/nmeth.3317).
The resulting alignment was annotated using the intersectBed tool from the BEDTools suite (v2.17.5) (Quinlan and Hall, 2010, https://doi.org/10.1093/bioinformatics/btq033) and the Arabidopsis annotation file TAIR10_GFF3_genes.gff (http://www.arabidopsis.org).
Reads 1 that map onto cytosolic mRNAs and their corresponding reads 2 were extracted and used for further analyses.
Reads 2 that contain the delimiter sequence were selected and subsequently trimmed from their random and delimiter sequences.
A homemade python script was then used to extract poly(A) tails (>5 nt) and their potential 3’ end modification from the remaining read 2. Poly(A) were detected with a constraint that it must begin within the first 30 cycles, so the maximum detectable 3’ end modification of poly(A) tails was limited to the last 29 nucleotides of insert.
Detected poly(A) and 3’ extensions were finally analyzed for their size and composition using a homemade script based on Python 2.7.
Genome_build: TAIR10
Supplementary_files_format_and_content: One processed data file is given. It includes the processed data for the three biological replicate.
Supplementary_files_format_and_content: For each read, we indicate read ID (read.ID), gene AGI (AGI), TruSeq index (index), biological replicate (rep), poly(A) tail sequence (polyA), poly(A) tail length (polyA.size), non-A extension sequence (modification), length of the non-A extension (modification.size), tail sequence (extension), tail size (extension.size), and a tag (classification) indicating the category of the tail.
 
Submission date Apr 09, 2020
Last update date May 28, 2020
Contact name Dominique Gagliardi
E-mail(s) [email protected]
Organization name CNRS
Department IBMP
Street address 12, rue du General Zimmer
City Strasbourg
ZIP/Postal code 67084
Country France
 
Platform ID GPL17970
Series (2)
GSE148417 Global investigation by TAILseq of mRNA 3' ends in Arabidopsis thaliana flowers
GSE148449 Molecular connection between the TUTase URT1 and decapping activators
Relations
BioSample SAMN14569696
SRA SRX8092700

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap