This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy.
Brought to you by:
Paper The following article is Open access

POS-Tagging for informal language (study in Indonesian tweets)

, , , and

Published under licence by IOP Publishing Ltd
, , Citation Endang Suryawati et al 2018 J. Phys.: Conf. Ser. 971 012055 DOI 10.1088/1742-6596/971/1/012055

1742-6596/971/1/012055

Abstract

This paper evaluates Part-of-Speech Tagging for the formal Indonesian language can be used for the tagging process of Indonesian tweets. In this study, we add five additional tags which reflect to social media attributes to the existing original tagset. Automatic POS tagging process is done by stratified training process with 1000, 1600, and 1800 of annotated tweets. It shows that the process can achieve up to 66.36% accuracy. The experiment with original tagset gives slightly better accuracy (67.39%) than the experiment with five additional tags, but will lose important informations which given by the five additional tagset.POS-Tagging for Informal Language (Study in Indonesian Tweets).

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/971/1/012055