Paper The following article is Open access

Establishing the Syntactic Rules of the Kankanaey Dialect using RNN

, , and

Published under licence by IOP Publishing Ltd
, , Citation Laurie Lynne F. Aspiras et al 2020 IOP Conf. Ser.: Mater. Sci. Eng. 803 012023 DOI 10.1088/1757-899X/803/1/012023

1757-899X/803/1/012023

Abstract

The diverse culture and ethnic groups in the Philippines creates a beautiful mixture of ideas, traditions, and practices but also makes it hard for researchers to keep track of them all. One integral part of any culture is language, with one of the most spoken languages in the Cordillera Administrative Region (CAR) being Kankanaey. Unfortunately, it has very little resources and documentation for it. This paper presents a corpus created for Kankanaey that contains 3412 words and was trained with a dataset containing 400 Kankanaey sentences in order to establish its syntactic rules. Data for the collected texts for Kankanaey were taken from public sources online and were organized into various categories based on the type of content. Training and testing was done to establish the syntactic rules using the Keras API. The rules were derived by having each word in the training sentences tagged with the corresponding POS tag. After tagging, the number of POS tags were then expanded to all possible combinations of the POS which resulted in the documenting of 1,722 syntactic rules for Kankanaey with the model having an accuracy of 64% when it was tested to identify the syntactic rules in 50 test sentences.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1757-899X/803/1/012023