This module contains a set of R scripts that generate distributions of summary-level metrics across all 69 standard samples in Complete Genomics Public Genome Repository. The main file that needs to be run is analysis.summary.R – this sources all the other files. Also included are the plots, as generated by the scripts, in both pdf and png format.
Note on Complete Genomics Public Genome Repository:
Complete Genomics provides free public access to a variety of whole human genome data sets generated from Complete Genomics’ sequencing services. This public genome repository comprises genome results from both our Standard Sequencing Service (69 standard, non-diseased samples) and the Cancer Sequencing Service (two matched tumor and normal sample pairs).
Download the tool here.