Skip to content

Brown CBC's Bioconductor Workshop 2 Materials

Docs nbviewer License

Goals

This workshop, 'Analyses of RNA-seq and ChiP-seq with R/Bioconductor', is primarily designed to introduce attendees DEseq and its analysis pipeline; it is an intermediate-level lecture that presupposes some knowledge of R like that found in our 'Fundamentals of R' workshop. This workshop is not a direct sequel to 'R/Bioconductor Workshop for Genomic Data Analysis', and that material is not required to understand this content.

Here, users will learn how to perform exploratory data analysis on RNA-seq data, then use that data (if appropriate) to conduct a complete differential expression and gene-set analysis. The workshop will discuss how to interpret the results of the exploratory analyses it presents and what they say about the data's quality, as well as offer advice on how to make the data more suitable for analysis. It takes the user from a raw table of counts all the way to a set of differentially expressed genes and their pathways - the traditional endpoint for an RNA-seq analysis.

Finally, this workshop also includes a brief introduction to ChIP-seq analysis. This topic exists in its own self-contained notebook (Notebook 4), for which the RNA-seq material is not a prerequisite.

Workshop Topic Overview

This workshop, 'Analyses of RNA-seq and ChiP-seq with R/Bioconductor', is designed to complement (rather than succeed) the Bioconductor training provided in the first Bioconductor workshop; it is an intermediate-level lecture that presupposes some knowledge of R like that found in our 'Fundamentals of R' workshop. The topics presented herein include the following:

  • Standard analyses for RNA-seq Data

    • Approaches to processing raw data from RNA seq experiments
    • Annotating and importing expression counts
    • Basic exploratory data analysis
    • Assessing model fit and dispersion
  • Gene-wise Testing in Depth, Including Multiple Testing

    • Differential expression analysis with the 'edgeR' package
    • Comparison of the 'edgeR' and 'DESeq2' packages for differential expression analysis
    • Hypothesis testing for count data
    • Approaches to multiple hypothesis testing result correction (e.g. false discovery rate calculation)
    • Visualizing differentially expressed genes/transcripts
  • Gene Set Analysis

    • Over-representation analysis
    • Gene set enrichment analysis
    • Network-based enrichment analysis
    • Region-based enrichment analysis
    • Resources for these types of analyses (e.g. GO, KEGG, MSigDb)
  • Analyses of ChIP-seq Data

    • Introduction to standard approaches used in ChIP Seq
    • Analyses for peak calling
    • Calculation of differential binding across samples
    • The annotation and plotting of peaks

Notebook

To access a read-only version of the workshop notebooks (complete with full output), please click the above link; you will be redirected to Jupyter's NBViewer utility. Simply click the name of the notebook and it will be rendered as HTML.

Authors

Levi Waldron

Contact

For assistance, contact cbc-help@brown.edu - this is our general help line, so please specify that your issue is with this site's contents

Original Time and Location

Date: 2/19/2018

Location: Campus Center