Experiences of sexual minorities on social media: A study of sentiment analysis and machine learning approaches
Abstract
Nowadays, social media has become a forum for people to express their views on issues such as sexual orientation, legislation, and taxes. Sexual orientation refers to individuals with whom you are attracted and wish to be engaged. In the world, many people are regarded as having different sexual orientations. People categorized as lesbian, gay, bisexual, transgender, queer, and many more (LGBTQ+) have many sexual orientations. Because of the public stigmatization of LGBTQ+ persons, many turn to social media to express themselves, sometimes anonymously. The present study aims to use natural language processing (NLP) and machine learning (ML) approaches to assess the experiences of LGBTQ+ persons. To train the data, the study used lexicon-based sentiment analysis (SA) and six distinct machine classifiers, including logistic regression (LR), support vector machine (SVM), naïve bayes (NB), decision tree (DT), random forest (RF), and gradient boosting (GB). Individuals are positive about LGBTQ concerns, according to the SA results; yet, prejudice and harsh statements against the LGBTQ people persist in many regions where they live, according to the negative sentiment ratings. Furthermore, using LR, SVM, NB, DT, RF, and GB, the ML classifiers attained considerable accuracy values of 97%, 96%, 88%, 100%, 92%, and 91%, respectively. The performance assessment metrics used obtained significant recall and precision values. This study will assist the government, non-governmental organizations, and rights advocacy groups make educated decisions about LGBTQ+ concerns in order to ensure a sustainable future and peaceful coexistence.
Keywords
Full Text:
PDFReferences
1. Afrifa S, Varadarajan V, Appiahene P, et al. Ensemble machine learning techniques for accurate and efficient detection of botnet attacks in connected computers. Eng 2023; 4(1): 650–664. doi: 10.3390/eng4010039
2. Afrifa S, Varadarajan V. Cyberbullying detection on twitter using natural language processing and machine learning techniques. International Journal of Innovative Technology and Interdisciplinary Sciences 2022; 5(4): 1069–1080. doi: 10.15157/IJITIS.2022.5.4.1069-1080
3. Ahmed C, ElKorany A, ElSayed E. Prediction of customer’s perception in social networks by integrating sentiment analysis and machine learning. Journal of Intelligent Information Systems 2022; 1–27. doi: 10.1007/s10844-022-00756-y
4. Choudhary D. Security challenges and countermeasures for the heterogeneity of IoT applications. Journal of Autonomous Intelligence 2019; 1(2): 16–22. doi: 10.32629/jai.v1i2.25
5. Arcila-Calderón C, Amores JJ, Sánchez-Holgado P, Blanco-Herrero D. Using shallow and deep learning to automatically detect hate motivated by gender and sexual orientation on Twitter in Spanish. Multimodal Technologies and Interaction 2021; 5(10): 63. doi: 10.3390/mti5100063
6. Westwood S. Religious-based negative attitudes towards LGBTQ people among healthcare, social care and social work students and professionals: A review of the international literature. Health & Social Care in the Community 2022; 30(5): e1449–e1470. doi: 10.1111/hsc.13812
7. LGBTI Rights. Available online: https://www.amnesty.org/en/what-we-do/discrimination/lgbti-rights/ (accessed on 20 July 2023).
8. Turner R, Hammersjö A. Navigating survivorhood? Lived experiences of social support-seeking among LGBTQ survivors of intimate partner violence. Qualitative Social Work 2023; 0(0): 1–19. doi: 10.1177/14733250221150208
9. Adu WK, Appiahene P, Afrifa S. VAR, ARIMAX and ARIMA models for nowcasting unemployment rate in Ghana using Google trends. Journal of Electrical Systems and Information Technology 2023; 10: 1–16. doi: 10.1186/s43067-023-00078-1
10. Appiahene P, Asare JW, Donkoh ET, et al. Detection of iron deficiency anemia by medical images: A comparative study of machine learning algorithms. BioData Mining 2023; 16(1): 1–20. doi: 10.1186/s13040-023-00319-z
11. Elmir WB, Hemmak A, Senouci B. Smart platform for data blood bank management: Forecasting demand in blood supply chain using machine learning. Information 2023; 14(1): 31. doi: 10.3390/info14010031
12. Chaganti R, Suliman W, Ravi V, Dua A. Deep learning approach for SDN-enabled intrusion detection system in IoT networks. Information 2023; 14(1): 41. doi: 10.3390/info14010041
13. Giannakas F, Kouliaridis V, Kambourakis G. A closer look at machine learning effectiveness in Android malware detection. Information 2023; 14(1): 2–24. doi: 10.3390/info14010002
14. Băroiu AC, Trăușan-Matu S. Comparison of deep learning models for automatic detection of sarcasm context on the MUStARD dataset. Electronics 2023; 12(3): 666. doi: 10.3390/electronics12030666
15. Ainapure BS, Pise RN, Reddy P, et al. Sentiment analysis of COVID-19 tweets using deep learning and lexicon-based approaches. Sustainability 2023; 15(3): 2573–2593. doi: 10.3390/su15032573
16. Zhang H, Zhang D, Wei Z, et al. Analysis of public opinion on food safety in Greater China with big data and machine learning. Current Research in Food Science 2023; 6: 100468. doi: 10.1016/j.crfs.2023.100468
17. Çilgin C, BAŞ M, Bilgehan H, Ünal C. Twitter sentiment analysis during COVID-19 outbreak with VADER. Academic Journal of Information Technology 2022; 13(49): 72–89. doi: 10.5824/ajite.2022.02.001.x
18. Çılgın C, Gökçen H, z Gökşen Y. Sentiment analysis of public sensitivity to COVID-19 vaccines on Twitter by majority voting classifier-based machine learning. Journal of the Faculty of Engineering and Architecture of Gazi University 2023; 38(2): 1093–1104. doi: 10.17341/gazimmfd.1030198
19. Thai HH, Silhavy P, Kumar Dey S, et al. Analyzing public opinions regarding virtual tourism in the context of COVID-19: Unidirectional vs. 360-degree videos. Information 2023; 14(1): 11. doi: 10.3390/info14010011
20. Dai A, Hu X, Nie J, Chen J. Learning from word semantics to sentence syntax by graph convolutional networks for aspect-based sentiment analysis. International Journal of Data Science and Analytics 2022; 14(1): 17–26. doi: 10.1007/s41060-022-00315-2
21. Appiahene P, Missah YM, Najim U. Predicting bank operational efficiency using machine learning algorithm: Comparative study of decision tree, random forest, and neural networks. Advances in Fuzzy Systems 2020; 2020: 8581202. doi: 10.1155/2020/8581202
22. Mohammed AFY, Sultan SM, Lee Y, Lim S. Deep-reinforcement-learning-based IoT sensor data cleaning framework for enhanced data analytics. Sensors 2023; 23(4): 1791. doi: https://doi.org/ 10.3390/s23041791
23. Costola M, Hinz O, Nofer M, Pelizzon L. Machine learning sentiment analysis, COVID-19 news and stock market reactions. Research in International Business and Finance 2023; 64: 101881. doi: 10.1016/j.ribaf.2023.101881
24. Nigam N, Yadav D. Lexicon-based approach to Sentiment Analysis of tweets using R language. In: Singh M, Gupta P, Tyagi V, et al. (editors). Advances in Computing and Data Sciences, Proceedings of ICACDS 2018: 2nd International Conference on Advances in Computing and Data Sciences; 20–21 April 2018; Dehradun, India. Springer; 2018. pp. 154–164.
25. Rainer J, Vicini A, Salzer L, et al. A modular and expandable ecosystem for metabolomics data annotation in R. Metabolites 2022; 12(2): 173. doi: 10.3390/metabo12020173
26. Zeeshan, Ali Z, Jawad, Zakira M. Research Chinese-urdu machine translation based on deep learning. Journal of Autonomous Intelligence 2020; 3(2): 34–44. doi: 10.32629/jai.v3i2.279
27. Sharma S, Srinivas PYKL, Balabantaray RC. Text normalization of code mix and sentiment analysis. In: Proceedings of 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI); 10–13 August 2015; Kochi, India. pp. 1468–1473. doi: 10.1109/ICACCI.2015.7275819
28. Chennafi ME, Bedlaoui H, Dahou A, Al-qaness MAA. Arabic aspect-based sentiment classification using Seq2Seq dialect normalization and transformers. Knowledge 2022; 2(3): 388–401. doi: 10.3390/knowledge2030022
29. Yang L, Baratchi M, van Leeuwen M. Unsupervised discretization by two-dimensional MDL-based histogram. arXiv 2022; arXiv:2006.01893. doi: 10.1007/s10994-022-06294-6
30. Ren D, Srivastava G. A novel natural language processing model in mobile communication networks. Mobile Networks and Applications 2022; 27: 2575–2584. doi: 10.1007/s11036-022-02072-9
31. Ramaswamy SL, Chinnappan J. RecogNet-LSTM+CNN: A hybrid network with attention mechanism for aspect categorization and sentiment classification. Journal of Intelligent Information Systems 2022; 58: 379–404. doi: 10.1007/s10844-021-00692-3
32. Khan Z, Zakira M, Slamu W, Slam N. A study of neural machine translation from Chinese to Urdu. Journal of Autonomous Intelligence 2019; 2(4): 29–36. doi: 10.32629/jai.v2i4.82
33. Mat Razali NA, Malizan NA, Hasbullah NA, et al. Opinion mining for national security: Techniques, domain applications, challenges and research opportunities. Journal of Big Data 2021; 8: 150. doi: 10.1186/s40537-021-00536-5
34. Chakravarthi BR. Multilingual hope speech detection in English and Dravidian languages. International Journal of Data Science and Analytics 2022; 14: 389–406. doi: 10.1007/s41060-022-00341-0.
35. Karsi R, Zaim M, El Alami J. Assessing naive bayes and support vector machine performance in sentiment classification on a big data platform. IAES International Journal of Artificial Intelligence (IJ-AI) 2021; 10(4): 990–996. doi: 10.11591/ijai.v10.i4.pp990-996
36. Sivakumar M, Uyyala SR. Aspect-based sentiment analysis of mobile phone reviews using LSTM and fuzzy logic. International Journal of Data Science and Analytics 2021; 12: 355–367. doi: 10.1007/s41060-021-00277-x
37. Okey OD, Maidin SS, Adasme P, et al. BoostedEnML: Efficient technique for detecting cyberattacks in IoT systems using boosted ensemble machine learning. Sensors (Basel) 2022; 22(19): 7409. doi: 10.3390/s22197409
38. Raj C, Agarwal A, Bharathy G, et al. Cyberbullying detection: Hybrid models based on machine learning and natural language processing techniques. Electronics 2021; 10(22): 2810. doi: 10.3390/electronics10222810
39. Muneer A, Fati SM. A comparative analysis of machine learning techniques for cyberbullying detection on Twitter. Future Internet 2020; 12(11): 187. doi: 10.3390/fi12110187
40. Borg A, Boldt M. Using VADER sentiment and SVM for predicting customer response sentiment. Expert Systems with Applications 2020; 162: 113746. doi: 10.1016/j.eswa.2020.113746
41. Saba T, Khan SU, Islam N, et al. Cloud-based decision support system for the detection and classification of malignant cells in breast cancer using breast cytology images. Microscopy Research and Technique 2019; 82(6): 775–785. doi: 10.1002/jemt.23222
42. Azeez NA, Idiakose SO, Onyema CJ, Van Der Vyver C. Cyberbullying detection in social networks: Artificial intelligence approach. Journal of Cyber Security and Mobility 2021; 10(4): 745–774. doi: 10.13052/jcsm2245-1439.1046
43. Ahmed MT, Rahman M, Nur S, et al. Natural language processing and machine learning based cyberbullying detection for Bangla and Romanized Bangla texts. TELKOMNIKA Telecommunication Computing Electronics and Control 2022; 20(1): 89–97. doi: 10.12928/TELKOMNIKA.v20i1.18630
44. Sarailidis G, Wagener T, Pianosi F. Integrating scientific knowledge into machine learning using interactive decision trees. Computers & Geosciences 2022; 170: 105248. doi: 10.1016/j.cageo.2022.105248
45. Murorunkwere BF, Ihirwe JF, Kayijuka I, et al. Comparison of tree-based machine learning algorithms to predict reporting behavior of electronic billing machines. Information 2023; 14(3): 140. doi: 10.3390/ info14030140
46. Afrifa S, Zhang T, Appiahene P, Vijayakumar V. Mathematical and machine learning models for groundwater level changes: A systematic review and bibliographic analysis. Future Internet 2022; 14(9): 259. doi: 10.3390/fi14090259
47. Junior MA, Appiahene P, Appiah O. Forex market forecasting with two-layer stacked Long Short-Term Memory neural network (LSTM) and correlation analysis. Journal of Electrical Systems and Information Technology 2022; 9: 14. doi: 10.1186/s43067-022-00054-1
48. Abiola O, Abayomi-Alli A, Tale OA, et al. Sentiment analysis of COVID-19 tweets from selected hashtags in Nigeria using VADER and Text Blob analyser. Journal of Electrical Systems and Information Technology 2023; 10: 5. doi: 10.1186/s43067-023-00070-9
49. Thangavel P, Lourdusamy R. A lexicon-based approach for sentiment analysis of multimodal content in tweets. Multimedia Tools and Applications 2023; 82: 24203–24226. doi: 10.1007/s11042-023-14411-3
50. Velu SR, Ravi V, Tabianan K. Multi-lexicon classification and valence-based sentiment analysis as features for deep neural stock price prediction. Sci 2023; 5(1): 8. doi: 10.3390/sci5010008
51. Mutinda J, Mwangi W, Okeyo G. Sentiment analysis of text reviews using Lexicon-Enhanced Bert Embedding (LeBERT) model with convolutional neural network. Applied Sciences 2023; 13(3): 1445. doi: 10.3390/app13031445
52. Kaur G, Sharma A. A deep learning-based model using hybrid feature extraction approach for consumer sentiment analysis. Journal of Big Data 2023; 10: 5. doi: 10.1186/s40537-022-00680-6
53. Paramesha K, Gururaj HL, Nayyar A, Ravishankar KC. Sentiment analysis on cross-domain textual data using classical and deep learning approaches. Multimedia Tools and Applications 2023; 82: 30759–30782. doi: 10.1007/s11042-023-14427-9
DOI: https://doi.org/10.32629/jai.v6i2.623
Refbacks
- There are currently no refbacks.
Copyright (c) 2023 Peter Appiahene, Vijayakumar Varadarajan, Tao Zhang, Stephen Afrifa
License URL: https://creativecommons.org/licenses/by-nc/4.0