Home   >   CSC-OpenAccess Library   >    Manuscript Information
A Novel Text Mining Approach to Sexual Harassment Detection of Case Suspects
Sundar Krishnan, Narasimha Shashidhar, Cihan Varol, ABM Rezbaul Islam
Pages - 17 - 29     |    Revised - 30-06-2022     |    Published - 01-08-2022
Volume - 11   Issue - 1    |    Publication Date - August 2022  Table of Contents
MORE INFORMATION
KEYWORDS
Digital Forensic Analytics, Digital Forensics, Sexual Harassment, Supervised Learning, Hybrid Learning, Unsupervised Learning, Legal Analytics, eDiscovery, Electronic Stored Information, Case Investigation.
ABSTRACT
Sexual harassment cases often go unreported and can be difficult for an investigator to detect when working with large volumes of digital evidence of an investigation. Artificial Intelligence can be a promising solution to help identify instances of sexual harassment, especially from written communication. In this research, a comprehensive approach to detect indicators of sexual harassment is proposed using supervised and unsupervised learning coupled with the application of Bidirectional Encoder Representations from Transformers (BERT) and Snips NLU. The models are then applied against synthetic digital forensic evidence data for detection of sexual harassment indicators from textual digital evidence.
Angelov, D. (2020). Top2Vec: Distributed Representations of Topics. Retrieved from https://arxiv.org/abs/2008.09470.
Basu, P., Singha Roy, T., Tiwari, S., & Mehta, S. (2021). CyberPolice: Classification of Cyber Sexual Harassment. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 12981 LNAI, 701-714. https://doi.org/10.1007/978-3-030-86230-5_55.
Bauer, T., Devrim, E., Glazunov, M., Jaramillo, W. L., Mohan, B., & Spanakis, G. (2020). #MeTooMaastricht: Building a chatbot to assist survivors of sexual harassment. Communications in Computer and Information Science, 1167 CCIS, 503-521. https://doi.org/10.1007/978-3-030-43823-4_41/FIGURES/7.
Cercas Curry, A., Abercrombie, G., & Rieser, V. (2021). {C}onv{A}buse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational {AI}. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp. 7388-7403). Online and Punta Cana, Dominican Republic: Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.587.
Charney, D. A., & Russell, R. C. (1994). An overview of sexual harassment. American Journal of Psychiatry, 151(1), 10-17. https://doi.org/10.1176/AJP.151.1.10.
Coucke, A., Saade, A., Ball, A., Bluche, T., Caulier, A., Leroy, D., Dureau, J. (2018). Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces. Retrieved from https://arxiv.org/abs/1805.10190v3.
Crebbin, W., Campbell, G., Hillis, D. A., Watters, D. A., Crebbin, W., Campbell FRACS, G., … Watters FRCSEd, D. A. (2015). Prevalence of bullying, discrimination and sexual harassment in surgery in Australasia. ANZ Journal of Surgery, 85(12), 905-909. https://doi.org/10.1111/ANS.13363.
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 1, 4171-4186. Retrieved from https://arxiv.org/abs/1810.04805v2.
Element of Intent in Criminal Law | Office of Justice Programs. (n.d.). Retrieved April 14, 2022, from https://www.ojp.gov/ncjrs/virtual-library/abstracts/element-intent-criminal-law.
Garrett, A., & Hassan, N. (2019). Understanding the silence of sexual harassment victims through the #Whyididntreport movement. Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2019, 649-652. https://doi.org/10.1145/3341161.3343700.
GitHub. (n.d.). nlu-benchmark. Retrieved February 5, 2022, from https://github.com/wenjingu/nlu-benchmark.
Gyawali, D. K. (2021). Sexual Harassment AND Its effects on Mental Health OF THE Teenage School Girls in Lalitpur and rupandehi district. Journal of Balkumari College, 10(1), 39-47. https://doi.org/10.3126/JBKC.V10I1.42092.
Krishnan, S. (n.d.). Project • GitHub. Retrieved May 6, 2022, from https://github.com/kshsus.
Krishnan, S., Shashidhar, N., Varol, C., & Islam, A. R. (2022). Sentiment Analysis of Case Suspects in Digital Forensics and Legal Analytics. International Journal of Security, 13(1). Retrieved from https://www.cscjournals.org/journals/IJS/issues-archive.php.
Mackinnon, Catha. A., & Siegel, R. B. (2003). Directions in Sexual Harassment Law A Short History of Sexual Harassment.
Mclaughlin, H., Uggen, C., & Blackstone, A. (n.d.). Sexual Harassment, Workplace Authority, and the Paradox of Power. American Sociological Review, 77(4), 625-647. https://doi.org/10.1177/0003122412451728.
MeToo movement - Wikipedia. (n.d.). Retrieved April 7, 2022, from https://en.wikipedia.org/wiki/MeToo_movement.
Nisha Priya Bhatia v. Union of India & Anr. CA No. 2365/2020, S. C. of I. (n.d.). Types of Sexual Harassment. Retrieved from https://www.whatishumanresource.com/types-of-sexual-harassment.
Nova, F. F., Rifat, R., Saha, P., Ahmed, S. I., & Guha, S. (2019). Online sexual harassment over anonymous social media in Bangladesh. ACM International Conference Proceeding Series. https://doi.org/10.1145/3287098.3287107.
Open source conversational AI. (n.d.). Retrieved February 6, 2022, from https://rasa.com/.
Rezvan, M., Thirunarayan, K., Shekarpour, S., Shalin, V. L., Balasuriya, L., & Sheth, A. (2018). A quality type-aware annotated corpus and lexicon for harassment research. WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science, 33-36. https://doi.org/10.1145/3201064.3201103.
Rodríguez-Rodríguez, I., & Heras-González, P. (2020). How are universities using Information and Communication Technologies to face sexual harassment and how can they improve? Technology in Society, 62, 101274. https://doi.org/10.1016/J.TECHSOC.2020.101274.
Saeidi, M., Samuel, S. B., Milios, E., Zeh, N., & Berton, L. (2020). Categorizing Online Harassment on Twitter. Communications in Computer and Information Science, 1168 CCIS, 283-297. https://doi.org/10.1007/978-3-030-43887-6_22.
Sexual Harassment - Equal Rights Advocates. (n.d.). Retrieved April 6, 2022, from https://www.equalrights.org/issue/economic-workplace-equality/sexual-harassment/.
Sexual Harassment | U.S. Equal Employment Opportunity Commission. (n.d.). Retrieved July 19, 2020, from https://www.eeoc.gov/sexual-harassment.
Snips Natural Language Understanding — Snips NLU 0.20.2 documentation. (n.d.). Retrieved February 5, 2022, from https://snips-nlu.readthedocs.io/en/latest/.
Sundar Krishnan, Shashidhar, N., Varol, C., & Islam, A. R. (2021). Evidence Data Preprocessing for Forensic and Legal Analytics. International Journal of Computational Linguistics (IJCL), 12(2), 24-34. Retrieved from https://www.cscjournals.org/library/manuscriptinfo.php?mc=IJCL-122.
The Psychological Persuasion Techniques of Sexual Predators | Psychology Today. (n.d.). Retrieved April 15, 2022, from https://www.psychologytoday.com/us/blog/the-new-teen-age/201905/the-psychological-persuasion-techniques-sexual-predators.
Mr. Sundar Krishnan
Department of Computer Science, Sam Houston State University, Huntsville, TX - United States of America
skrishnan@shsu.edu
Associate Professor Narasimha Shashidhar
Department of Computer Science, Sam Houston State University, Huntsville, TX - United States of America
Professor Cihan Varol
Department of Computer Science, Sam Houston State University, Huntsville, TX - United States of America
Associate Professor ABM Rezbaul Islam
Department of Computer Science, Sam Houston State University, Huntsville, TX - United States of America