Word Sense Disambiguation for Turkish


Mert E., Dalkılıç G.

24th International Symposium on Computer and Information Sciences, Güzelyurt, Kıbrıs (Kktc), 14 - 16 Eylül 2009, ss.205-210 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası:
  • Doi Numarası: 10.1109/iscis.2009.5291849
  • Basıldığı Şehir: Güzelyurt
  • Basıldığı Ülke: Kıbrıs (Kktc)
  • Sayfa Sayıları: ss.205-210
  • Anahtar Kelimeler: Word Sense Disambiguation, Natural Language Processing, WSD for Turkish, Stemming Ambiguity
  • Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Word Sense Disambiguation (WSD) is the core and one of the hardest problems of many Natural Language Processing tasks. WSD is considered as an AI-complete problem. Although there are many approaches trying to solve this problem, many of them are not adequate to solve WSD problem for Turkish. Dealing with sense ambiguity for Turkish also requires dealing with stemming ambiguity as well as polysemy, homonymy and categorical ambiguity. In this study, largely known Lesk and Simplified Lesk methods are modified and adapted to Turkish. The main aim of this project is to minimize the word sense ambiguity for Turkish and this is performed by eliminating the incorrect senses as much as possible by applying proposed methods.