11/17/2011 1 Practical Bioinformatics for Life Scientists Week 13, Lecture 26 István Albert Bioinformatics Consulting Center Penn State Visualizing high dimensionality data by Hadley Wickham: http://had.co.nz/ There is nothing like it in any programming environment! Parts of this presentation follow the tutorial of ggplot2 Getting started with ggplot2 http://had.co.nz/ggplot2/book/qplot.pdf We will start out with example plots from this manual Then at the end we generate a peak distribution plot around gene starts sites. Install ggplot2

11/17/2011 2 diamonds.txt (data comes with ggplot2) NOTE For the next few slides I will be changing only line 10 (sometimes we use all data or just the small data)
11/17/2011 3 ggplot2 concepts

Unformatted text preview: geometry what plot looks like faceting how many plots/panels statistics transformation on the data positioning fine tunes locations in the plot scales maps data to an x,y coordinate 11/17/2011 4 Faceting - multiplots Faceting and shapes and colors scripts are in supporting data located in the 26.tar.gz file on the website 11/17/2011 5 Recall intersecting peaks with genes from the Chip-Seq lecture. We need an R script to prepare the data for plotting. Code included in this weeks download Homework 26 Generate four plots with ggplot2 that demonstrate one ore more features including: histograms shapes colors faceting...
