RNA-seq data analysis: from raw reads to differentially expressed genes

by | Dec 4, 2015

This course material introduces the central concepts, analysis steps and file formats in RNA-seq data analysis. It covers the analysis from quality control to differential expression detection, and workflow construction and several data visualizations are also practised. The material consists of 10-30 minute lectures intertwined with hands-on exercises, and it can be accomplished in a day. As the user-friendly Chipster software is used in the exercises, no prior knowledge of R/Bioconductor or Unix ir required, and the course is thus suitable for everybody. Our book RNA-seq data analysis: A practical approach (CRC Press) can be used as background reading.

The following topics and analysis tools are covered:
1. Introduction to the Chipster analysis platform
2. Quality control of raw reads (FastQC, PRINSEQ)
3. Preprocessing (Trimmomatic, PRINSEQ)
4. Alignment to reference genome (TopHat2)
5. Alignment level quality control (RseQC)
6. Quantitation (HTSeq)
7. Experiment level quality control with PCA and MDS plots
8. Differential expression analysis (DESeq2, edgeR)
-dispersion estimation
-statistical testing
-controlling for batch effects, multifactor designs
-multiple testing correction
9. Visualization of reads and results
-genome browser
-Venn diagram
-volcano plot
-plotting normalized counts for a gene
-expression profiles
10. Experimental design