DocumentCode :
243570
Title :
Cyberbullying Detection using Time Series Modeling
Author :
Potha, Nektaria ; Maragoudakis, Manolis
Author_Institution :
Dept. of Inf. & Commun. Syst. Eng., Univ. of the Aegean, Samos, Greece
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
373
Lastpage :
382
Abstract :
Cyber bullying is a new phenomenon resulting from the advance of new communication technologies including the Internet, cell phones and Personal Digital Assistants. It is a challenging bullying problem occurring in a new territory. Online bullying can be particularly damaging and upsetting because it´s usually anonymous or hard to trace. In this paper, the proposed method is utilizing a dataset of real world conversations (i.e. Pairs of questions and answers between cyber predator and the victim), in which each predator question is manually annotated in terms of severity using a numeric label. We approach the issue as a sequential data modelling approach, in which the predator´s questions are formulated using a Singular Value Decomposition representation. The motivation of this procedure is to study the accuracy of predicting the level of cyber bullying attack using classification methods and also to examine potential patterns between the lingustic style of each predator. More specifically, unlike previous approaches that consider a fixed window of a cyber-predator´s questions within a dialogue, we exploit the whole question set and model it as a signal, whose magnitude depends on the degree of bullying content. Using feature weighting and dimensionality reduction techniques, each signal is straightforwardly parsed by a neural network that forecasts the level of insult within a question given a window between two and three previous questions. Throughout the time series modeling experiments, an interesting discovery was made. By applying SVD on the time series data and taking into account the second dimension (since the first is usually modeling trivial dependencies between instances and attributes) we observed that its plot was very similar to the plot of the class attribute. By applying a Dynamic Time Warping algorithm, the similarity of the aforementioned signals was proved to exist, providing an immediate indicator for the severity of cyber bullying within a - iven dialogue.
Keywords :
Internet; behavioural sciences computing; singular value decomposition; social aspects of automation; support vector machines; time series; Internet; SVD; bullying problem; cell phones; classification methods; communication technologies; cyber bullying attack; cyber bullying detection; cyber predator questions; dimensionality reduction; dynamic time warping algorithm; feature weighting; lingustic style; neural network; online bullying; personal digital assistants; sequential data modelling; singular value decomposition representation; time series data; time series modeling; Data models; Feature extraction; Heuristic algorithms; Large scale integration; Media; Support vector machines; Time series analysis; SVM feature selection; cyberbullying; dynamic time warping; singular value decomposition; time series analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.170
Filename :
7022621
Link To Document :
بازگشت