Please use this identifier to cite or link to this item: http://cmuir.cmu.ac.th/jspui/handle/6653943832/67588
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPrapaporn Techa-Angkoonen_US
dc.contributor.authorKevin L. Childsen_US
dc.contributor.authorYanni Sunen_US
dc.date.accessioned2020-04-02T14:56:19Z-
dc.date.available2020-04-02T14:56:19Z-
dc.date.issued2019-12-24en_US
dc.identifier.issn14712105en_US
dc.identifier.other2-s2.0-85077127673en_US
dc.identifier.other10.1186/s12859-019-3047-3en_US
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85077127673&origin=inwarden_US
dc.identifier.urihttp://cmuir.cmu.ac.th/jspui/handle/6653943832/67588-
dc.description.abstract© 2019 The Author(s). Background: Gene is a key step in genome annotation. Ab initio gene prediction enables gene annotation of new genomes regardless of availability of homologous sequences. There exist a number of ab initio gene prediction tools and they have been widely used for gene annotation for various species. However, existing tools are not optimized for identifying genes with highly variable GC content. In addition, some genes in grass genomes exhibit a sharp 5 ′- 3′ decreasing GC content gradient, which is not carefully modeled by available gene prediction tools. Thus, there is still room to improve the sensitivity and accuracy for predicting genes with GC gradients. Results: In this work, we designed and implemented a new hidden Markov model (HMM)-based ab initio gene prediction tool, which is optimized for finding genes with highly variable GC contents, such as the genes with negative GC gradients in grass genomes. We tested the tool on three datasets from Arabidopsis thaliana and Oryza sativa. The results showed that our tool can identify genes missed by existing tools due to the highly variable GC contents. Conclusions: GPRED-GC can effectively predict genes with highly variable GC contents without manual intervention. It provides a useful complementary tool to existing ones such as Augustus for more sensitive gene discovery. The source code is freely available at https://sourceforge.net/projects/gpred-gc/.en_US
dc.subjectBiochemistry, Genetics and Molecular Biologyen_US
dc.subjectComputer Scienceen_US
dc.subjectMathematicsen_US
dc.titleGPRED-GC: A Gene PREDiction model accounting for 5 <sup>′</sup>- 3<sup>′</sup> GC gradienten_US
dc.typeJournalen_US
article.title.sourcetitleBMC Bioinformaticsen_US
article.volume20en_US
article.stream.affiliationsMichigan State Universityen_US
article.stream.affiliationsCity University of Hong Kongen_US
article.stream.affiliationsChiang Mai Universityen_US
Appears in Collections:CMUL: Journal Articles

Files in This Item:
There are no files associated with this item.


Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.