Data Management and Analysis

Sequencing complete human genomes generates substantial amounts of data that must be managed, stored and analyzed. To address this potential need by our customers, we have built a genomics data processing facility with computing infrastructure for managing both small- and large-scale genomic sequencing projects. Our proprietary assembly software uses advanced data analysis algorithms and statistical modeling techniques to accurately reconstruct over 90% of the complete human genome from approximately two billion 70-base reads. After assembling the genomic data, we use our analysis software to identify and annotate key differences, or variants, in each genome.  Detected variants inlcude SNPs, indels, substitutions, Copy Number Variants (CNVs), Structural Variants (SVs) and mobile element insertions (MEIs).

By using our analytical tools and data management software, our customers can significantly reduce their investments in computing infrastructure. Our customers are provided with reliable access to assembled and annotated sequence data in multiple formats to ease data sharing and comparative analyses. As the reagent cost of sequencing declines, we believe that the cost and complexity of data analysis and management will emerge as the primary limiting factor for conducting complete human genome analysis.

Copyright © 2011 Complete Genomics Incorporated. All rights reserved. Use of this website signifies your agreement to the Terms of Use and Online Privacy Policy. Contact Webmaster.