Please use this identifier to cite or link to this item: http://cmuir.cmu.ac.th/jspui/handle/6653943832/70607
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKhishigsuren Davagdorjen_US
dc.contributor.authorVan Huy Phamen_US
dc.contributor.authorNipon Theera-Umponen_US
dc.contributor.authorKeun Ho Ryuen_US
dc.date.accessioned2020-10-14T08:35:24Z-
dc.date.available2020-10-14T08:35:24Z-
dc.date.issued2020-09-02en_US
dc.identifier.issn16604601en_US
dc.identifier.issn16617827en_US
dc.identifier.other2-s2.0-85090613582en_US
dc.identifier.other10.3390/ijerph17186513en_US
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85090613582&origin=inwarden_US
dc.identifier.urihttp://cmuir.cmu.ac.th/jspui/handle/6653943832/70607-
dc.description.abstract© 2020 by the authors. Licensee MDPI, Basel, Switzerland. Smoking-induced noncommunicable diseases (SiNCDs) have become a significant threat to public health and cause of death globally. In the last decade, numerous studies have been proposed using artificial intelligence techniques to predict the risk of developing SiNCDs. However, determining the most significant features and developing interpretable models are rather challenging in such systems. In this study, we propose an efficient extreme gradient boosting (XGBoost) based framework incorporated with the hybrid feature selection (HFS) method for SiNCDs prediction among the general population in South Korea and the United States. Initially, HFS is performed in three stages: (I) significant features are selected by t-test and chi-square test; (II) multicollinearity analysis serves to obtain dissimilar features; (III) final selection of best representative features is done based on least absolute shrinkage and selection operator (LASSO). Then, selected features are fed into the XGBoost predictive model. The experimental results show that our proposed model outperforms several existing baseline models. In addition, the proposed model also provides important features in order to enhance the interpretability of the SiNCDs prediction model. Consequently, the XGBoost based framework is expected to contribute for early diagnosis and prevention of the SiNCDs in public health concerns.en_US
dc.subjectEnvironmental Scienceen_US
dc.subjectMedicineen_US
dc.titleXgboost-based framework for smoking-induced noncommunicable disease predictionen_US
dc.typeJournalen_US
article.title.sourcetitleInternational Journal of Environmental Research and Public Healthen_US
article.volume17en_US
article.stream.affiliationsTon-Duc-Thang Universityen_US
article.stream.affiliationsChungbuk National Universityen_US
article.stream.affiliationsChiang Mai Universityen_US
Appears in Collections:CMUL: Journal Articles

Files in This Item:
There are no files associated with this item.


Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.