基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
The under-resourced Kikamba language has few language technology tools since the more efficient and popular data driven approaches for developing them suffer from data sparseness due to lack of digitized corpora. To address this challenge, we have developed a computational grammar for the Kikamba language within the multilingual Grammatical Framework (GF) toolkit. GF uses the Interlingua rule-based translation approach. To develop the grammar, we used the morphology driven strategy. Therefore, we first developed regular expressions for morphology inflection and thereafter developed the syntax rules. Evaluation of the grammar was done using one hundred sentences in both English and Kikamba languages. The results were an encouraging four n-gram BLEU score of 83.05% and the Position independent error rate (PER) of 10.96%. Finally, we have made a contribution to the language technology resources for Kikamba including multilingual machine translation, a morphology analyzer, a computational grammar which provides a platform for development of multilingual applications and the ability to generate a variety of bilingual corpora for Kikamba for all languages currently defined in GF, making it easier to experiment with data driven approaches.
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Towards Kikamba Computational Grammar
来源期刊 数据分析和信息处理(英文) 学科 文学
关键词 GRAMMAR Morphology SYNTAX GRAMMATICAL Framework Under-Resourced language Concord MULTILINGUAL AGGLUTINATION Kikamba
年,卷(期) 2019,(4) 所属期刊栏目
研究方向 页码范围 250-275
页数 26页 分类号 H31
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
GRAMMAR
Morphology
SYNTAX
GRAMMATICAL
Framework
Under-Resourced
language
Concord
MULTILINGUAL
AGGLUTINATION
Kikamba
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
数据分析和信息处理(英文)
季刊
2327-7211
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
106
总下载数(次)
0
总被引数(次)
0
论文1v1指导