Mining chat conversations for sex identification


Koese C. , Oezyurt O., AMANMYRADOV G.

11th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Nanjing, China, 22 May 2007 - 25 May 5007, vol.4819, pp.45-55 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 4819
  • City: Nanjing
  • Country: China
  • Page Numbers: pp.45-55

Abstract

Chat mediums are becoming an important part of human life in societies and provide quite useful information about people such as their current interests, habits, social behaviors and tendencies. In this study, we have presented an identification system to identify the sex of a person in a Turkish chat medium. Here, the sex identification is taken as a base study in the information raining in chat mediums. This system acquires data from a chat medium, and then automatically detects the chatter's sex from the information exchanged between chatters and compares them with the known identities of the chatters. To do this task, a simple discrimination function is used to determine the sex of the chatters. A semantic analysis method is also proposed to enhance the performance of the system. The system with the semantic analyzer has achieved accuracy over 90% in the sex identification in the real chat medium.