Please use this identifier to cite or link to this item: http://cmuir.cmu.ac.th/jspui/handle/6653943832/55537
Full metadata record
DC FieldValueLanguage
dc.contributor.authorChanin Mahatthanachaien_US
dc.contributor.authorKanchit Malaivongsen_US
dc.contributor.authorNuttiya Tantranonten_US
dc.contributor.authorEkkarat Boonchiengen_US
dc.date.accessioned2018-09-05T02:57:39Z-
dc.date.available2018-09-05T02:57:39Z-
dc.date.issued2016-02-08en_US
dc.identifier.other2-s2.0-84964341233en_US
dc.identifier.other10.1109/ICSEC.2015.7401423en_US
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84964341233&origin=inwarden_US
dc.identifier.urihttp://cmuir.cmu.ac.th/jspui/handle/6653943832/55537-
dc.description.abstract© 2015 IEEE. This research has an objective to develop an efficient technique for Thai word segmentation, especially those nonexistent in dictionaries. The researchers developed a model for Thai word segmentation by relying on grammar and rules to solve the problems with words not found in dictionaries. The model was intended to be used as the best approach of word segmentation, which applied the segmentation technique developed by the researchers called PTTSF (Parsing Thai Text with Syntax and Feature of Word). The system of this technique operates by starting from finding the boundary of each word in Thai sentences. If the system finds a word that does not exist in the dictionary or a meaningless word, it would not be able to solve the problem with the method of longest-matching algorithm. Therefore, rules need to be specified to solve such problems. In this study, 28 rules were created and Digraph method was used to find a pattern of word segmentation with the highest probability based on the grammatical principle. After the procedure of finding boundary of the word, the result from correct word segmentation can be used for further processes. In analyzing efficiency of the system, its accuracy in word segmentation was the main point of concern. The results revealed that the derived mapping technique could solve the problem concerned with segmentation words that do not exist in the dictionary with an average accuracy over 90% of the whole document. However, the researchers encountered with ambiguous words problem. Although this problem rarely occurs, it could affect accuracy of word segmentation.en_US
dc.subjectComputer Scienceen_US
dc.subjectDecision Sciencesen_US
dc.titleDevelopment of Thai word segmentation technique for solving problems with unknown wordsen_US
dc.typeConference Proceedingen_US
article.title.sourcetitleICSEC 2015 - 19th International Computer Science and Engineering Conference: Hybrid Cloud Computing: A New Approach for Big Data Eraen_US
article.stream.affiliationsChiang Mai Rajabhat Universityen_US
article.stream.affiliationsThe Royal Society of Thailanden_US
article.stream.affiliationsChiang Mai Universityen_US
Appears in Collections:CMUL: Journal Articles

Files in This Item:
There are no files associated with this item.


Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.