Open Access Open Access  Restricted Access Subscription or Fee Access

Support Vector Machine Classifier Based on Approximate Entropy Metric for Chatbot Text-based Communication

Xuewen Mu, Bowo Prakoso, John Kirby

Abstract



Chatbot is a computer program designed to simulate conversation with human users over the Internet. Chatbot has been found on a number of chat systems, including large commercial chat networks. However, their use as malicious tools has made them a growing nuisance and security concern. We present a support vector machine training algorithm for classification on human and bots in chatbot text-based communications. We use data from the annual Loebner competition to distinguish between bots and humans. The normalized approximate entropy of Message size and inter-message delays at each conversation are introduced. Coupled with the mean and the normalized Shannon entropy of two features, they were considered as the input data. Simulation results have shown that the support vector machine is an efficient method for chatbot data classification.

Keywords


Chatbot, support vector machine, Approximate entropy, Shannon entropy, text-based communications, Loebner competition

Full Text:

PDF


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.