Please use this identifier to cite or link to this item:
http://cmuir.cmu.ac.th/jspui/handle/6653943832/75447
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Pree Thiengburanathum | en_US |
dc.contributor.author | Phasit Charoenkwan | en_US |
dc.date.accessioned | 2022-10-16T06:59:39Z | - |
dc.date.available | 2022-10-16T06:59:39Z | - |
dc.date.issued | 2021-03-03 | en_US |
dc.identifier.other | 2-s2.0-85106616298 | en_US |
dc.identifier.other | 10.1109/ECTIDAMTNCON51128.2021.9425718 | en_US |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85106616298&origin=inward | en_US |
dc.identifier.uri | http://cmuir.cmu.ac.th/jspui/handle/6653943832/75447 | - |
dc.description.abstract | There are numerous tweeter user accounts in Thailand and many toxic comments are being generated every day on this platform. Sentimental Analysis can be used as a tool to identify toxic comments. In this study, two feature extraction techniques, including Bag of Words (BOW) and Term frequency-inverse document (TF-IDF), were investigated. Additionally, the performance of ten well-known traditional classifiers, along with three deep-learning approaches including Convolutional Neural Network (CNN), Long-short-Term memory (LSTM) and pretrained Bidirectional Encoder Representations (BERT), were compared with the public Toxicity Thai tweeter corpus the experiments reveal that by combining Bag of Words (BOW) with the Extra-Tree classifier, researchers were able to archive the highest F1-score of 0.72, classification accuracy rate of 72.27% and AUC value of 0.77 using the test set in contrast to other classifiers and other deep-learning techniques. Feature importance, correlation and impacts were also investigated through the use of SHapley Additive exPlanations (SHAP) diagram. | en_US |
dc.subject | Arts and Humanities | en_US |
dc.subject | Computer Science | en_US |
dc.subject | Engineering | en_US |
dc.title | A Performance Comparison of Supervised Classifiers and Deep-learning Approaches for Predicting Toxicity in Thai Tweets | en_US |
dc.type | Conference Proceeding | en_US |
article.title.sourcetitle | 2021 Joint 6th International Conference on Digital Arts, Media and Technology with 4th ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunication Engineering, ECTI DAMT and NCON 2021 | en_US |
article.stream.affiliations | Chiang Mai University | en_US |
Appears in Collections: | CMUL: Journal Articles |
Files in This Item:
There are no files associated with this item.
Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.