GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM5826554

Query DataSets for GSM5826554

Status

Public on May 25, 2022

Title

ISPY2_672734

Sample type

RNA

Source name

Breast cancer biopsy (pre-treatment)

Organism

Homo sapiens

Characteristics

patient id: 672734
tissue: Breast cancer biopsy (pre-treatment)
hr: 1
her2: 0
mp: 1
pcr: 0
arm: Paclitaxel + Ganitumab
adjustment method: ComBat Adjust

Extracted molecule

total RNA

Extraction protocol

Extraction was performed by Agendia Inc (Irvine, CA).

Label

Cy3

Label protocol

Labeling was performed by Agendia Inc (Irvine, CA).

Hybridization protocol

Hybridization was performed by Agendia Inc (Irvine, CA). This company is referenced by the platforms used: GPL20078 and GPL30494 (updated version of GPL16233).

Scan protocol

Scanning was performed by Agendia Inc (Irvine, CA).

Data processing

For each array, gnormalized signal (log2) data, as provided in the raw files on each patient, was extracted and aligned to its 75th quantile by Agendia as per the manufacturer’s data processing recommendations. All values indicated for non-conformity, as indicated by the 'gIsFeatNonUnifOL' column in the raw files, are NA'd out; and a fixed value of 9.5 was added to avoid negative values. Probeset level data per array were mean-collapsed to the gene level, and genes common to the two platforms identified. Expression data from the first ~900 I-SPY2 patients distributed over the two platforms GPL30494 (updated version of GPL16233; n=334) and GPL20078 (n=545) were combined into a single gene-level dataset after batch-adjusting using ComBat [PMID:16632515]. Linear adjustment factors were derived from the ComBat transformation, per platform, which can be used to batch correct raw files. The subsequent ~100 samples, assayed on GPL20078, were batch corrected using these factors and added to the original set, yielding a normalized expression dataset comprising 988 samples (987 unique patients) x 19,134 (common) genes. The linear adjustment factors and updated probe annotation files per platform are provided, as is the ComBat-normalized data we used in our analysis (supplemental file "ISPY2ResID_AgilentGeneExp_990_FrshFrzn_meanCol_geneLevel_n988.txt").
ISPY2ResID_AgilentGeneExp_990_FrshFrzn_meanCol_geneLevel_n988.txt: Normalized, gene-level, batch-/platform- adjusted pre-treatment Agilent 44K expression data for patients from 10 arms of the neoadjuvant I-SPY 2 platform trial for early stage, aggressive breast cancer (n=988 samples for 987 patients). This is the dataset we used in our pan-arm analysis, and constitutes part of the I-SPY2-990 mRNA/RPPA Data Resource. To derive this matrix, probeset level expression data from the first ~900 I-SPY2 patients distributed over the two platforms GPL30494 (updated version of GPL16233; n=334) and GPL20078 (n=545) were mean-collapsed by gene and combined into a single gene-level dataset after batch-adjusting using ComBat [PMID:16632515]. Linear adjustment factors were derived from the ComBat transformation, per platform, which can be used to batch correct new files. The subsequent ~100 samples, assayed on GPL20078, were batch corrected using these factors and added to the original set, yielding the resulting normalized expression dataset comprising 988 samples (987 unique patients) x 19,134 (common) genes.
ISPY2ResID_AgilentGeneExp_990_FrshFrzn_GPL20078_ProbeLevel_n654.txt: Normalized probeset level pre-treatment gene expression data from Agilent 44K expression arrays Agendia32627_DPv1.14_SCFGplus [with annotation GPL20078 (n=654)] from the I-SPY 2 platform trial for early stage, aggressive breast cancer. Each column represents a patient sample. Per column, green-channel (Cy3) normalized signal intensity (log2) data, as provided in the raw files on each patient, was extracted and aligned to its (within-array) 75th quantile by Agendia as per the manufacturer’s data processing recommendations. All values indicated for non-conformity are NA'd out; and a fixed value of 9.5 was added to avoid negative values.
ISPY2ResID_AgilentGeneExp_990_FrshFrzn_GPL30493.wasGPL16233_ProbeLevel_n334.txt: Normalized probeset level pre-treatment gene expression data from Agilent 44K expression arrays Agilent_human_DiscoverPrint_15746 [with annotation GPL30493 (update of GPL16233; n=334)] from the I-SPY 2 platform trial for early stage, aggressive breast cancer. Each column represents a patient sample. Per column, green-channel (Cy3) normalized signal intensity (log2) data, as provided in the raw files on each patient, was extracted and aligned to its (within-array) 75th quantile by Agendia as per the manufacturer’s data processing recommendations. All values indicated for non-conformity are NA'd out; and a fixed value of 9.5 was added to avoid negative values.
ISPY2_LinearFit_forCombiningExp_20078.txt: This file contains linear adjustment factors (intercept and linear coefficient) derived from the ComBat transformation combining data (~900 samples) from GPL20078 and GPL30493 that can be used to batch-/platform- correct new Agilent 44K expression arrays of the type Agendia32627_DPv1.14_SCFGplus [gene-collapsed using annotation GPL20078] for inclusion in the larger normalized dataset. Application of this transformation to columns from ISPY2ResID_AgilentGeneExp_990_FrshFrzn_GPL20078_ProbeLevel_n654.txt, once mean-collapsed to gene level using GPL20078, results in values nearly identical to those in ISPY2ResID_AgilentGeneExp_990_FrshFrzn_meanCol_geneLevel_n988.txt, the normalized expression matrix in the I-SPY2-990 data resource.
ISPY2_LinearFit_forCombiningExp_GPL30493_wasGPL16233.txt: This file contains linear adjustment factors (intercept and linear coefficient) derived from the ComBat transformation combining data (~900 samples) from GPL20078 and GPL30493 that can be used to batch-/platform- correct new Agilent 44K expression arrays of the type Agilent_human_DiscoverPrint_15746 [with annotation GPL30493 (update of GPL16233)] for inclusion in the larger normalized dataset. Application of this transformation to columns from ISPY2ResID_AgilentGeneExp_990_FrshFrzn_GPL30493.wasGPL16233_ProbeLevel_n334.txt, once mean-collapsed to gene level using GPL20078, results in values nearly identical to those in ISPY2ResID_AgilentGeneExp_990_FrshFrzn_meanCol_geneLevel_n988.txt.
ProbeAnnotation_ISPY2Edit_GPL30493_wasGPL16233.txt: This is the probeset annotation file for GPL30493, a minor update of GPL16233. For each probeset ID (row), the associated gene name, systematic name and DNA sequence is described (columns).

Submission date

Jan 19, 2022

Last update date

May 25, 2022

Contact name

Denise M Wolf

E-mail(s)

[email protected]

Organization name

University of California, San Francisco

Street address

2340 Sutter st

City

San Francisco

State/province

ZIP/Postal code

94143

Country

USA

Platform ID

GPL30493

Series (2)

GSE194040	I-SPY2-990 mRNA/RPPA Data Resource: mRNA component
GSE196096	I-SPY2-990 mRNA/RPPA Data Resource

Relations

Reanalysis of

GSM5481200

Supplementary file	Size	Download	File type/resource
GSM5826554_ISPY2_672734_GPL16233_SingleChannel_FullGenome+.txt.gz	653.9 Kb	(ftp)(http)	TXT
Processed data are available on Series record