Abstract

The bacterial haplotype reconstruction is critical for selecting proper treatments for diseases caused by unknown haplotypes. Existing methods and tools do not work well on this task, because they are usually developed for viral instead of bacterial populations. In this study, we developed BHap, a novel algorithm based on fuzzy flow networks, for reconstructing bacterial haplotypes from next generation sequencing data. Tested on simulated and experimental datasets, we showed that BHap was capable of reconstructing haplotypes of bacterial populations with an average F1 score of 0.87, an average precision of 0.87 and an average recall of 0.88. We also demonstrated that BHap had a low susceptibility to sequencing errors, was capable of reconstructing haplotypes with low coverage and could handle a wide range of mutation rates. Compared with existing approaches, BHap outperformed them in terms of higher F1 scores, better precision, better recall and more accurate estimation of the number of haplotypes.

Download

The source code of BHap is available here.

Manual

The manual for running the program is avai lable here.

Authors

Xin Li1,Samaneh Saadat1, Haiyan Hu 1, and Xiaoman Li2

1Department of Electrical Engineering and Computer Science, University Of Central Florida, Orlando, FL 328 26, USA.

2Burnett School of Biomedi cal Science, University Of Central Florida, Orlando, FL 32826, USA.

: