Towards Kikamba Computational Grammar

Show simple item record

dc.contributor.author Kituku, Benson
dc.contributor.author Nganga, Wanjiku
dc.contributor.author Muchemi, Lawrence
dc.date.accessioned 2020-07-30T13:17:07Z
dc.date.available 2020-07-30T13:17:07Z
dc.date.issued 2019-10-22
dc.identifier.citation Kituku, B., Nganga, W. and Muchemi, L. (2019) Towards Kikamba Computational Grammar. Journal of Data Analysis and Information Processing, 7, 250-275 en_US
dc.identifier.issn 2327-7203
dc.identifier.issn 2327-7211
dc.identifier.uri http://repository.dkut.ac.ke:8080/xmlui/handle/123456789/1279
dc.description.abstract The under-resourced Kikamba language has few language technology tools since the more efficient and popular data driven approaches for developing them suffer from data sparseness due to lack of digitized corpora. To address this challenge, we have developed a computational grammar for the Kikamba language within the multilingual Grammatical Framework (GF) toolkit. GF uses the Interlingua rule-based translation approach. To develop the grammar, we used the morphology driven strategy. Therefore, we first developed regular expressions for morphology inflection and thereafter developed the syntax rules. Evaluation of the grammar was done using one hundred sentences in both English and Kikamba languages. The results were an encouraging four n-gram BLEU score of 83.05% and the Position independent error rate (PER) of 10.96%. Finally, we have made a contribution to the language technology resources for Kikamba including multilingual machine translation, a morphology analyzer, a computational grammar which provides a platform for development of multilingual applications and the ability to generate a variety of bilingual corpora for Kikamba for all languages currently defined in GF, making it easier to experiment with data driven approaches. en_US
dc.language.iso en en_US
dc.publisher Scientific Research publishing en_US
dc.subject Grammar en_US
dc.subject Morphology en_US
dc.subject Syntax en_US
dc.subject Grammatical Framework en_US
dc.subject Under-Resourced language en_US
dc.subject Concord en_US
dc.subject Multilingual en_US
dc.subject Agglutination en_US
dc.subject Kikamba en_US
dc.title Towards Kikamba Computational Grammar en_US
dc.title.alternative A memory-based approach to Kıkamba named entity recognition
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account