Towards a Complete Kurdish NLP Pipeline: Challenges and Opportunities
Karwan Jacksi, Dastan Maulud, Dastan Maulud, Ismael Ali, Ismael Ali
Abstract
With the rapid growth of Kurdish language content on the web, there is a high demand for making this information readable and processable by machines. In order to accomplish this, the Kurdish Natural Language Processing (KNLP) pipeline is required. Computers that can process human language use the field of Natural Language Processing (NLP). In its efforts to bridge the communication gap between humans and computers, NLP draws from a wide range of fields, including computer science and computational linguistics. There have been some notable efforts made toward creating the KNLP pipeline. However, it does not support the complete NLP tasks needed to enable semantic web and text mining applications. This paper surveys the work done in the field of NLP for the Kurdish language, its applications, and linguistic challenges.
Keywords
Text Corpus; Annotated Corpus; Kurdish Language; NLP; Semantic Web; Text Mining;