Detection of child exploiting chats from a mixed chat dataset as text classification task

dc.contributor.authorRahmanMiah, M. W., Yearwood, J., & Kulkarni, S.
dc.date.accessioned2017-02-10T15:11:06Z
dc.date.available2017-02-10T15:11:06Z
dc.date.issued2011
dc.description.abstractDetection of child exploitation in Internet chatting is an important issue for the protection of children from prospective online paedophiles. This paper investigates the effectiveness of text classifiers to identify Child Exploitation (CE) in chatting. As the chatting occurs among two or more users by typing texts, the text of chat-messages can be used as the data to be analysed by text classifiers. Therefore the problem of identification of CE chats can be framed as the problem of text classification by categorizing the chatlogs into predefined CE types. Along with three traditional text categorizing techniques a new approach has been made to accomplish the task. Psychometric and categorical information by LIWC (Linguistic Inquiry and Word Count) has been used and improvement of performance in some classifier has been found. For the experiments of current research the chat logs are collected from various websites open to public. Classification-via-Regression, J-48-Decision-Tree and Naïve-Bayes classifiers are used. Comparison of the performance of the classifiers is shown in the result. (Author Abstract)en_US
dc.identifier.citationRahmanMiah, M. W., Yearwood, J., & Kulkarni, S. (2011, December). Detection of child exploiting chats from a mixed chat dataset as text classification task. In Proceedings of the Australian Language Technology Association Workshop (pp. 157-165).en_US
dc.identifier.urihttps://www.aclweb.org/anthology/U11-1020.pdf
dc.identifier.urihttp://hdl.handle.net/11212/3210
dc.language.isoenen_US
dc.publisherAustralian Language Technology Associationen_US
dc.subjectchild abuseen_US
dc.subjectcyber solicitationen_US
dc.subjectonline solicitationen_US
dc.subjectgroomingen_US
dc.subjectinvestigationen_US
dc.subjectInternational Resourcesen_US
dc.subjectresearchen_US
dc.titleDetection of child exploiting chats from a mixed chat dataset as text classification tasken_US
dc.typeArticleen_US

Files