Cancer Data Set

To provide the scientific community with public access to data generated from two paired tumor/normal cancer samples, Complete Genomics sequenced and analyzed cell-line samples of patients with breast cancer (invasive ductal carcinomas). The cell line-derived DNA are housed at ATCC. Samples have been sequenced to an average genome-wide coverage of 123X for three of the samples, and 92X for for the fourth sample.

Please view our Public Genome Data Repository Service Note for further sample information and complete download instructions through our ftp site.

The data are freely available for use in a publication with the following stipulations:

  1. The Coriell and ATCC Repository number(s) of the cell line(s) or the DNA sample(s) must be cited in publications or presentations that are based on the use of these materials.
  2. The Complete Genomics Science paper must be referenced (R. Drmanac, et. al. Science 327(5961), 78. [DOI: 10.1126/science.1181498]).
  3. The version number of the Complete Genomics assembly software with which the data was generated must be referenced. This can be found in the header of the summary.tsv file (# Software_Version).

For further technical assistance:


Call toll-free: 1-855-CMPLETE or 1-855-267-5383

Additional documentation can be found here.