Efficient and accurate clustering for large-scale genetic mapping

Veronika Strnadová, Aydin Buluc, Jarrod Chapman, John R. Gilbert, Joseph Gonzalez, Stefanie Jegelka, Daniel Rokhsar, Leonid Oliker

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

High-throughput 'next generation' genome sequencing technologies are producing a flood of inexpensive genetic information that is invaluable to genomics research. Sequences of millions of genetic markers are being produced, providing genomics researchers with the opportunity to construct highresolution genetic maps for many complicated genomes. However, the current generation of genetic mapping tools were designed for the small data setting, and are now limited by the prohibitively slow clustering algorithms they employ in the genetic marker-clustering stage. In this work, we present a new approach to genetic mapping based on a fast clustering algorithm that exploits the geometry of the data. Our theoretical and empirical analysis shows that the algorithm can correctly recover linkage groups. Using synthetic and real-world data, including the grand-challenge wheat genome, we demonstrate that our approach can quickly process orders of magnitude more genetic markers than existing tools while retaining - and in some cases even improving - the quality of genetic marker clusters.

Original languageEnglish
Title of host publicationProceedings - 2014 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2014
EditorsHuiru Zheng, Xiaohua Tony Hu, Daniel Berrar, Yadong Wang, Werner Dubitzky, Jin-Kao Hao, Kwang-Hyun Cho, David Gilbert
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3-10
Number of pages8
ISBN (Electronic)9781479956692
DOIs
StatePublished - 29 Dec 2014
Externally publishedYes
Event2014 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2014 - Belfast, United Kingdom
Duration: 2 Nov 20145 Nov 2014

Publication series

NameProceedings - 2014 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2014

Conference

Conference2014 IEEE International Conference on Bioinformatics and Biomedicine, IEEE BIBM 2014
Country/TerritoryUnited Kingdom
CityBelfast
Period2/11/145/11/14

Fingerprint

Dive into the research topics of 'Efficient and accurate clustering for large-scale genetic mapping'. Together they form a unique fingerprint.

Cite this