Skip to main content Accessibility help
×
Hostname: page-component-8448b6f56d-c4f8m Total loading time: 0 Render date: 2024-04-19T23:39:20.537Z Has data issue: false hasContentIssue false

6 - Data visualization and the DDP process

Published online by Cambridge University Press:  05 February 2016

Ke Xu
Affiliation:
Bristol-Myers-Squibb Inc
William T. Loging
Affiliation:
Mount Sinai School of Medicine, New York
Get access

Summary

Data visualization denotes the techniques of visually presenting complex data sets to achieve goals such as displaying multiple data dimensions simultaneously, connecting related data points from data sets, or showing data distribution patterns. They are of great value for data processing, data analysis, and data presentation activities.

Genomics and functional genomics are the major driving forces for the development and utilization of visualization tools in biological fields. Following the completion of genomic sequencing projects of human and other model organisms around the beginning of this century, our knowledge of genes has jumped to the tens of thousands per species. Expression profiling microarray can generate millions of data points per experiment. The challenge of the huge data set size and the need to integrate different data sources in analyses prompted significant research and development work by both academic and industrial bioinformaticians. As a result, many visualization methods, proposals, and tools for biological data have been developed thus far. This chapter will describe the problems and solutions for the visualization of three basic and largest (thus, most challenging) genomics/functional genomics data types. More specifically, the first two sections will discuss visualization of sequence data and pathway/gene network data, which are two data types specific to genomics and other biology fields. In the third section, we will review visualization methods of numeric data, such as expression profiling data, proteomic data, and genotyping data. Most of the techniques in the section can also be applied to other areas. However, some topics, such as viewing numeric data in the context of genome or pathways, are still biology-specific.

Sequence and genomes

The genome is the complete set of genetic materials for an organism, which includes genes, regulatory and replication-related sequences, as well as non-functional intergenic regions. For most organisms other than RNA viruses, long linear or circular DNA molecules form the biochemical basis of the genome that stores all the genetic information. Visualization of the genome refers to the visual display of the DNA sequences and associated annotations. Depending on the visualization purposes, genome visualization tools can be classified into two categories: sequence viewer for visualizing sequence and annotations, and genome alignment viewer, for comparing different genomes.

Type
Chapter
Information
Publisher: Cambridge University Press
Print publication year: 2016

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Berriman, M. and Rutherford, K.Viewing and annotating sequence data with Artemis. Briefings in Bioinformatics. 2003;4(2):124–132.CrossRefGoogle ScholarPubMed
Frazer, K. A., Pachter, L., Poliakov, A., Rubin, E. M. and Dubchak, I.VISTA: Computational tools for comparative genomics. Nucleic Acids Research. 2004;32: W273–W279.CrossRefGoogle ScholarPubMed
Junker, B. H., Klukas, C. and Schreiber, F.VANTED: A system for advanced data analysis and visualization in the context of biological networks. BMC Bioinformatics. 2006;7:109–121.Google ScholarPubMed
Le Novère, N. L., Hucka, M., Mi, H., et al. The systems biology graphical notation. Nature Biotechnology, 2009;27(8):735–741.Google ScholarPubMed
Lewis, S. E., Searle, S. M., Harris, N., et al. Apollo: A sequence annotation editor. Genome Biology. 2002;3(12):research0082.1-0082.14.CrossRefGoogle ScholarPubMed
Schwartz, S., Zhang, Z., Frazer, K. A., et al. PipMaker – A web server for aligning two genomic DNA sequences. Genome Research. 2000;10:577–586.CrossRefGoogle ScholarPubMed

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

  • Data visualization and the DDP process
    • By Ke Xu, Bristol-Myers-Squibb Inc
  • Edited by William T. Loging, Mount Sinai School of Medicine, New York
  • Book: Bioinformatics and Computational Biology in Drug Discovery and Development
  • Online publication: 05 February 2016
  • Chapter DOI: https://doi.org/10.1017/CBO9780511989421.007
Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

  • Data visualization and the DDP process
    • By Ke Xu, Bristol-Myers-Squibb Inc
  • Edited by William T. Loging, Mount Sinai School of Medicine, New York
  • Book: Bioinformatics and Computational Biology in Drug Discovery and Development
  • Online publication: 05 February 2016
  • Chapter DOI: https://doi.org/10.1017/CBO9780511989421.007
Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

  • Data visualization and the DDP process
    • By Ke Xu, Bristol-Myers-Squibb Inc
  • Edited by William T. Loging, Mount Sinai School of Medicine, New York
  • Book: Bioinformatics and Computational Biology in Drug Discovery and Development
  • Online publication: 05 February 2016
  • Chapter DOI: https://doi.org/10.1017/CBO9780511989421.007
Available formats
×