Towards a Complete Kurdish NLP Pipeline: Challenges and Opportunities

Authors

  • Karwan Jacksi University of Zakho http://orcid.org/0000-0002-5220-5548
  • Dastan Maulud Duhok Polytechnic University
  • Dastan Maulud Duhok Polytechnic University
  • Ismael Ali University of Zakho
  • Ismael Ali University of Zakho

Keywords:

Text Corpus, Annotated Corpus, Kurdish Language, NLP, Semantic Web, Text Mining,

Abstract

With the rapid growth of Kurdish language content on the web, there is a high demand for making this information readable and processable by machines. In order to accomplish this, the Kurdish Natural Language Processing (KNLP) pipeline is required. Computers that can process human language use the field of Natural Language Processing (NLP). In its efforts to bridge the communication gap between humans and computers, NLP draws from a wide range of fields, including computer science and computational linguistics. There have been some notable efforts made toward creating the KNLP pipeline. However, it does not support the complete NLP tasks needed to enable semantic web and text mining applications. This paper surveys the work done in the field of NLP for the Kurdish language, its applications, and linguistic challenges.

Downloads

Published

2023-01-10

Issue

Section

Computational Intelligence