-
PacBio long read-assembled draft genome of Pythium insidiosum strain Pi-S isolated from a Thai patient with pythiosis
- Back
Metadata
Document Title
PacBio long read-assembled draft genome of Pythium insidiosum strain Pi-S isolated from a Thai patient with pythiosis
Author
Krajaejun T. Patumcharoenpol P. Rujirawat T. Kittichotirat W. Tangphatsornruang S. Lohnoo T. Yingyong W.
Affiliations
Department of Pathology Faculty of Medicine Ramathibodi Hospital Mahidol University Bangkok Thailand; Interdisciplinary Graduate Program in Bioscience Faculty of Science Kasetsart University Bangkok Thailand; Research Center Faculty of Medicine Ramathibodi Hospital Mahidol University Bangkok Thailand; Systems Biology and Bioinformatics Research Group Pilot Plant Development and Training Institute King Mongkut抯 University of Technology Thonburi Bangkhuntien Bangkok Thailand; National Omics Center National Science and Technology Development Agency Pathum Thani 12120 Thailand
Type
Article
Source Title
BMC Research Notes
ISSN
17560500
Year
2023
Volume
16
Issue
1
Open Access
All Open Access Gold Green
Publisher
BioMed Central Ltd
DOI
10.1186/s13104-023-06532-7
Abstract
Objectives: Pythium insidiosum is the causative agent of pythiosis a difficult-to-treat condition in humans and animals worldwide. Biological information about this filamentous microorganism is sparse. Genomes of several P. insidiosum strains were sequenced using the Illumina short-read NGS platform producing incomplete genome sequence data. PacBio long-read platform was employed to obtain a better-quality genome of Pythium insidiosum. The obtained genome data could promote basic research on the pathogen抯 biology and pathogenicity. Data description: gDNA sample was extracted from the P. insidiosum strain Pi-S for whole-genome sequencing by PacBio long-read NGS platform. Raw reads were assembled using CANU (v2.1) polished using ARROW (SMRT link version 5.0.1) aligned with the original raw PacBio reads using pbmm2 (v1.2.1) consensus sequence checked using ARROW and gene predicted using Funannotate pipeline (v1.7.4). The genome completion was assessed using BUSCO (v4.0.2). As a result 840 contigs (maximum length: 1.3燤b; N 50: 229.9 Kb; L 50: 70) were obtained. Sequence assembly showed a genome size of 66.7燤b (178x coverage; 57.2% G-C content) that contained 20 375 ORFs. A BUSCO-based assessment revealed 85.5% genome completion. All assembled contig sequences have been deposited in the NCBI database under the accession numbers BBXB02000001 - BBXB02000840. ? 2023 BioMed Central Ltd. part of Springer Nature.
License
CC BY
Rights
Authors
Publication Source
WOS