Automated synonym dictionary generation tool for Turkish (ASDICT) Türkçe için otomatik esanlamlilar sözlügü olusturma Araci (ASDICT)


Aktaş Ö., Birant Ç. C., Aksu B., Çebi Y.

Bilig, vol.65, pp.47-68, 2013 (SSCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 65
  • Publication Date: 2013
  • Journal Name: Bilig
  • Journal Indexes: Social Sciences Citation Index (SSCI), Scopus, TR DİZİN (ULAKBİM)
  • Page Numbers: pp.47-68
  • Keywords: Automated, Dictionary, Natural language processing, Synonyms, Turkish
  • Dokuz Eylül University Affiliated: Yes

Abstract

In this paper, an Automated Synonym Dictionary Generation Tool for Turkish (ASDICT) was briefly described and the development process of the algorithms was given in detail. By applying the ASDICT onto the data of Contemporary Turkish Dictionary published by Turkish Linguistic Association (TDK: Türk Dil Kurumu), a synonym database was obtained. The synonym dictionary generation process was carried out by applying four processes. As a result of these processes, the definite synonyms were classified as Definite Synonym (Dn) and put into the Synonym List (SLi). Some words, which could not be classified as Dn, were classified as Ambiguity and stored in a file called Ambiguity File (AF) to be checked out by supervised methods to build a more reliable synonym database. The synonym database for Contemporary Turkish Dictionary, which is called "Definite Synonyms Database (DSDB)", was built by applying ASDICT, and it is currently available on the official web site of TDK (TDK 2009).