Mining chat conversations for sex identification


Koese C., Oezyurt O., AMANMYRADOV G.

11th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Nanjing, Çin, 22 Mayıs 2007 - 25 Mayıs 5007, cilt.4819, ss.45-55 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 4819
  • Basıldığı Şehir: Nanjing
  • Basıldığı Ülke: Çin
  • Sayfa Sayıları: ss.45-55
  • Karadeniz Teknik Üniversitesi Adresli: Evet

Özet

Chat mediums are becoming an important part of human life in societies and provide quite useful information about people such as their current interests, habits, social behaviors and tendencies. In this study, we have presented an identification system to identify the sex of a person in a Turkish chat medium. Here, the sex identification is taken as a base study in the information raining in chat mediums. This system acquires data from a chat medium, and then automatically detects the chatter's sex from the information exchanged between chatters and compares them with the known identities of the chatters. To do this task, a simple discrimination function is used to determine the sex of the chatters. A semantic analysis method is also proposed to enhance the performance of the system. The system with the semantic analyzer has achieved accuracy over 90% in the sex identification in the real chat medium.