MicroarrayProcessing

Microarray data processing

Welcome to MicroarrayProcessing

MicroarrayProcessing is to automate microarray data processing for multiple public project data.

Step 1. Installation.

Step 2. Download raw data from GEO

I first download raw-data files from GEO (you might be able to do this with R package GEOquery for most cases. But please note that there are some cases you can’t download raw data with GEOquery.) and saved raw data files of each project into separate folders of GSE15059_RAW and GSE28320_RAW.

Step 3. Write up a metadata file.

Then I write up a metadata file that shows information about how to process two project data. The content of this file should be in metadata.txt. There are 9 columns and 19 rows (one header and 18 samples). The descriptions of columns are:

Step 4. Run the command.

I run the R script (RunDataProcessing.R) by

Rscript RunDataProcessing.R metadata.txt.

This script will produce 18 gene expression files where the first column is ID and the second column is gene expression level.