Next-Gen Sequence Analysis Workshop (2015)¶
This is the schedule for the 2015 MSU NGS course.
This workshop has a Workshop Code of Conduct.
Download all of these materials or visit the GitHub repository.
Meal Times: Breakfast 7-9, Lunch 12-1, Dinner 6-7 (Unless noted below)
This year we also ran a third week workshop focused on intermediate and advanced skills; please see the schedule at Week 3.
Day | Schedule |
---|---|
Monday 8/10 |
|
Tuesday 8/11 |
|
Wed 8/12 |
|
Thursday 8/13 |
|
Friday 8/14 |
|
Saturday 8/15 |
|
Sunday 8/16 |
|
Monday 8/17 |
|
Tuesday 8/18 |
|
Wed 8/19 |
|
Thursday 8/20 |
|
Friday 8/21 |
|
Dramatis personae¶
Instructors:
- C Titus Brown
- Chris Chandler
- Ian Dworkin
- Adina Howe
- Matt MacManes
- Meg Staton
TAs:
- Amanda Charbonneau
- Lisa Cohen
- Ryan Williams
- Phil Brooks
Lecturers:
- Nick Loman
- Torsten Seemann
- Erich Schwarz
She Who Drives Many Places:
- Jessica Mizzi
Papers and References¶
Books¶
Practical Computing for Biologists
This is a highly recommended book for people looking for a systematic presentation on shell scripting, programming, UNIX, etc.
RNAseq¶
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks, Trapnell et al., Nat. Protocols.
One paper that outlines a pipeline with the tophat, cufflinks, cuffdiffs and some associated R scripts.
Statistical design and analysis of RNA sequencing data., Auer and Doerge, Genetics, 2010.
A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae. Nookaew et al., Nucleic Acids Res. 2012.
Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA-seq experiments Vijay et al., 2012.
Computational methods for transcriptome annotation and quantification using RNA-seq, Garber et al., Nat. Methods, 2011.
Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments., Bullard et al., 2010.
A comparison of methods for differential expression analysis of RNA-seq data, Soneson and Delorenzi, BMC Bioinformatics, 2013.
Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples., Wagner et al., Theory Biosci, 2012. Also see this blog post explaining the paper in detail.
Computing and Data¶
- A Quick Guide to Organizing Computational Biology Projects, Noble, PLoS Comp Biology, 2009.
- Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results, Wicherts et al., PLoS One, 2011.
- Got replicability?, McCullough, Economics in Practice, 2007.
Also see this great pair of blog posts on organizing projects and research workflow.
Links¶
Resources¶
-
A high quality question & answer Web site.
-
A discussion and information site for next-generation sequencing.
-
A large number of open and reusable tutorials on the shell, programming, version control, etc.
Blogs¶
http://www.genomesunzipped.org/
Genomes Unzipped.
-
Titus’s blog.
-
Blue Collar Bioinformatics
-
Mass Genomics
-
Next Genetics
http://gettinggeneticsdone.blogspot.com/
Getting Genetics Done
http://omicsomics.blogspot.com/
Omics! Omics!
-
Nick Loman’s lab notebook
Complete table of contents¶
- Day 1 - Getting started with Amazon
- Day 2 – Command line & quality control
- Variant calling
- Interval Analysis and Visualization
- Running bedtools
- Understanding the SAM format
- Control Flow and loops in R
- Control Flow
- ifelse()
- Other vectorized ways of control flow.
- Simple loops
- for loop
- So for the for loop we would do the following:
- More avoiding loops
- The step above creates a vector of n NA’s. They will be replaced sequentially with the random numbers as we generate them (using a function like the above one).
- Variant calling and exploration of polymorphisms
- A complete de novo assembly and annotation protocol for mRNASeq
- Assembly with SOAPdenovo-Trans
- Mapping and Counting
- Analyzing RNA-seq counts with DESeq
- RNA-seq: mapping to a reference genome with tophat and counting with HT-seq
- RNA-seq: mapping to a reference genome with BWA and counting with HTSeq
- Booting an Amazon AMI
- Updating the operating system
- Install software
- Preparing the reference
- Mapping
- Genome comparison and phylogeny
- Interactive visual genome comparison with Mauve
- Running a genome alignment
- Booting an Amazon AMI
- Logging in & updating the operating system
- Packages to install
- Getting the E. coli genome data
- What is the nearest reference genome?
- Ordering the assembly contigs against a nearby reference
- Making a phylogeny of many E. coli assemblies
- From tree file to figures
- Automation, scripts, git, and GitHub
- MG-RAST and its API
- So you want to get some sequencing data out of NCBI?
- Looking at k-mer abundance distributions
- PacBio Tutorial
- RNASeq Transcript Mapping and Counting (BWA and HtSeq Flavor)
- Evaluating the quality of your short reads, and trimming them
- Getting started with Amazon EC2
- Instructor’s Guide to ANGUS Materials
- Workshop Code of Conduct
LICENSE: This documentation and all textual/graphic site content is licensed under the Creative Commons - 0 License (CC0) -- fork @ github. Presentations (PPT/PDF) and PDFs are the property of their respective owners and are under the terms indicated within the presentation.
comments powered by Disqus