PMID- 31913436 OWN - NLM STAT- MEDLINE DCOM- 20201009 LR - 20201009 IS - 1367-4811 (Electronic) IS - 1367-4803 (Print) IS - 1367-4803 (Linking) VI - 36 IP - 8 DP - 2020 Apr 15 TI - DeepSimulator1.5: a more powerful, quicker and lighter simulator for Nanopore sequencing. PG - 2578-2580 LID - 10.1093/bioinformatics/btz963 [doi] AB - MOTIVATION: Nanopore sequencing is one of the leading third-generation sequencing technologies. A number of computational tools have been developed to facilitate the processing and analysis of the Nanopore data. Previously, we have developed DeepSimulator1.0 (DS1.0), which is the first simulator for Nanopore sequencing to produce both the raw electrical signals and the reads. However, although DS1.0 can produce high-quality reads, for some sequences, the divergence between the simulated raw signals and the real signals can be large. Furthermore, the Nanopore sequencing technology has evolved greatly since DS1.0 was released. It is thus necessary to update DS1.0 to accommodate those changes. RESULTS: We propose DeepSimulator1.5 (DS1.5), all three modules of which have been updated substantially from DS1.0. As for the sequence generator, we updated the sample read length distribution to reflect the newest real reads' features. In terms of the signal generator, which is the core of DeepSimulator, we added one more pore model, the context-independent pore model, which is much faster than the previous context-dependent one. Furthermore, to make the generated signals more similar to the real ones, we added a low-pass filter to post-process the pore model signals. Regarding the basecaller, we added the support for the newest official basecaller, Guppy, which can support both GPU and CPU. In addition, multiple optimizations, related to multiprocessing control, memory and storage management, have been implemented to make DS1.5 a much more amenable and lighter simulator than DS1.0. AVAILABILITY AND IMPLEMENTATION: The main program and the data are available at https://github.com/lykaust15/DeepSimulator. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. CI - (c) The Author(s) 2020. Published by Oxford University Press. FAU - Li, Yu AU - Li Y AD - Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia. FAU - Wang, Sheng AU - Wang S AD - Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia. AD - Tencent AI lab, Shenzhen 518000, China. FAU - Bi, Chongwei AU - Bi C AD - Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia. FAU - Qiu, Zhaowen AU - Qiu Z AD - Institute of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, China. FAU - Li, Mo AU - Li M AD - Biological and Environmental Sciences and Engineering (BESE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia. FAU - Gao, Xin AU - Gao X AD - Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia. LA - eng PT - Journal Article PT - Research Support, Non-U.S. Gov't PL - England TA - Bioinformatics JT - Bioinformatics (Oxford, England) JID - 9808944 SB - IM MH - *High-Throughput Nucleotide Sequencing MH - Nanopore Sequencing MH - *Nanopores MH - Sequence Analysis, DNA MH - Software PMC - PMC7178411 EDAT- 2020/01/09 06:00 MHDA- 2020/10/10 06:00 PMCR- 2020/01/08 CRDT- 2020/01/09 06:00 PHST- 2019/08/12 00:00 [received] PHST- 2019/11/17 00:00 [revised] PHST- 2020/01/03 00:00 [accepted] PHST- 2020/01/09 06:00 [pubmed] PHST- 2020/10/10 06:00 [medline] PHST- 2020/01/09 06:00 [entrez] PHST- 2020/01/08 00:00 [pmc-release] AID - 5698265 [pii] AID - btz963 [pii] AID - 10.1093/bioinformatics/btz963 [doi] PST - ppublish SO - Bioinformatics. 2020 Apr 15;36(8):2578-2580. doi: 10.1093/bioinformatics/btz963.