Abstract
The bacterial haplotype reconstruction is critical for selecting proper treatments for diseases caused by unknown haplotypes. Existing methods and tools do not work well on this task, because they are usually developed for viral instead of bacterial populations. In this study, we developed BHap, a novel algorithm based on fuzzy flow networks, for reconstructing bacterial haplotypes from next generation sequencing data. Tested on simulated and experimental datasets, we showed that BHap was capable of reconstructing haplotypes of bacterial populations with an average F1 score of 0.87, an average precision of 0.87 and an average recall of 0.88. We also demonstrated that BHap had a low susceptibility to sequencing errors, was capable of reconstructing haplotypes with low coverage and could handle a wide range of mutation rates. Compared with existing approaches, BHap outperformed them in terms of higher F1 scores, better precision, better recall and more accurate estimation of the number of haplotypes.
Download
The source code of BHap is available here.
Manual
The manual for running the program is avai lable here.
Authors
Xin Li1,Samaneh Saadat1, Haiyan Hu 1, and Xiaoman Li2
1Department of Electrical Engineering and Computer Science, University Of Central Florida, Orlando, FL 328 26, USA.
2Burnett School of Biomedi cal Science, University Of Central Florida, Orlando, FL 32826, USA.