Please use this identifier to cite or link to this item:
http://cmuir.cmu.ac.th/jspui/handle/6653943832/75445
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Desheng Yang | en_US |
dc.contributor.author | Pree Thiengburanathum | en_US |
dc.date.accessioned | 2022-10-16T06:59:38Z | - |
dc.date.available | 2022-10-16T06:59:38Z | - |
dc.date.issued | 2021-03-03 | en_US |
dc.identifier.other | 2-s2.0-85106631239 | en_US |
dc.identifier.other | 10.1109/ECTIDAMTNCON51128.2021.9425701 | en_US |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85106631239&origin=inward | en_US |
dc.identifier.uri | http://cmuir.cmu.ac.th/jspui/handle/6653943832/75445 | - |
dc.description.abstract | This paper implemented the proposed framework. It focuses on evaluating the crawlers based on scalability and robustness on e-commerce websites the scalability is a feature that the system can adapt to the amount of data continuing to increase, and the performance does not decrease the robustness is an ability that can handle exceptions when web crawlers are crawling. Multiple testing environments were set up on e-commerce websites. Scalability testing and robustness testing were used to measure the scalability and robustness of web crawlers the scalability attributes and robustness failure rate were used to quantify the scalability and robustness. Statistical methods such as the Friedman test and the Nemenyi test were used to analyze the significant differences among crawlers the experimental results show Heritrix, Scrapy, and Nutch have the best overall scalability. In the non-interference test, Scrapy has the best robustness. However, Webmagic, Webcolletor, and Gecco have the best robustness in the interference test based on general test and database test. | en_US |
dc.subject | Arts and Humanities | en_US |
dc.subject | Computer Science | en_US |
dc.subject | Engineering | en_US |
dc.title | Scalability and Robustness Testing for Open Source Web Crawlers | en_US |
dc.type | Conference Proceeding | en_US |
article.title.sourcetitle | 2021 Joint 6th International Conference on Digital Arts, Media and Technology with 4th ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunication Engineering, ECTI DAMT and NCON 2021 | en_US |
article.stream.affiliations | Chiang Mai University | en_US |
Appears in Collections: | CMUL: Journal Articles |
Files in This Item:
There are no files associated with this item.
Items in CMUIR are protected by copyright, with all rights reserved, unless otherwise indicated.