This is an unedited manuscript accepted for publication and provided as an Article in Press for early access at the author’s request. The article will undergo copyediting, typesetting, and galley proof review before final publication. Please be aware that errors may be identified during production that could affect the content. All legal disclaimers of the journal apply.
Geeta Makhija,
Bhakti Gotarne,
Rutuja Mane,
Bhakti More,
- Assistant Professor, Department of Computer Engineering, Parvatibai Genaba Moze College of Engineering, Wagholi, Pune, Maharashtra, India
- Student, Department of Computer Engineering, Parvatibai Genaba Moze College of Engineering, Wagholi, Pune, Maharashtra, India
- Student, Department of Computer Engineering, Parvatibai Genaba Moze College of Engineering, Wagholi, Pune, Maharashtra, India
- Student, Student, Maharashtra, India
Abstract
The emergence of the World Wide Web and the rapid growth of online platforms have transformed the landscape of news dissemination. However, the rise of social media has also led to an overwhelming influx of potentially unreliable information, making it increasingly challenging to verify the truthfulness of articles. This verification process has become a daunting task, necessitating a thorough examination of various domain-specific aspects to ascertain the credibility of news content. In response to this issue, machine learning algorithms have shown considerable promise in automatically detecting fake news. Researchers are actively employing a diverse range of performance metrics to evaluate the efficacy of these algorithms. NLP techniques are essential in this process, as they aid in data preprocessing, thereby improving the accuracy of machine learning models. By leveraging extracted textual properties, researchers can train and evaluate machine learning classifiers designed to distinguish between genuine and fabricated content. This includes leveraging various features and performance metrics to evaluate the effectiveness of these classifiers. Ultimately, this study aims to contribute to the ongoing efforts to combat misinformation in the digital age by providing insights into the intersection of machine learning, NLP, and news verification.
Keywords: machine learning, fake news, MultinomialNB, social media, TF-IDF
[This article belongs to Current Trends in Information Technology (ctit)]
Geeta Makhija, Bhakti Gotarne, Rutuja Mane, Bhakti More. Fake News Detection System Using MultinomialNB and Django Framework. Current Trends in Information Technology. 2024; 15(01):-.
Geeta Makhija, Bhakti Gotarne, Rutuja Mane, Bhakti More. Fake News Detection System Using MultinomialNB and Django Framework. Current Trends in Information Technology. 2024; 15(01):-. Available from: https://journals.stmjournals.com/ctit/article=2024/view=191757
References
- Wu C, Wu F, Huang Y, Xie X. Personalized news recommendation: Methods and challenges. ACM Transactions on Information Systems. 2023 Jan 10;41(1):1-50.
- Su X, Sperlì G, Moscato V, Picariello A, Esposito C, Choi C. An edge intelligence empowered recommender system enabling cultural heritage applications. IEEE Transactions on Industrial Informatics. 2019 Mar 28;15(7):4266-75.
- Zhou F, Xu X, Trajcevski G, Zhang K. A survey of information cascade analysis: Models, predictions, and recent advances. ACM Computing Surveys (CSUR). 2021 Mar 5;54(2):1-36.
- Gaillard S, Oláh ZA, Venmans S, Burke M. Countering the cognitive, linguistic, and psychological underpinnings behind susceptibility to fake news: A review of current literature with special focus on the role of age and digital literacy. Frontiers in Communication. 2021 Jul 7;6:661801.
- Della Vedova ML, Tacchini E, Moret S, Ballarin G, DiPierro M, De Alfaro L. Automatic online fake news detection combining content and social signals. In2018 22nd conference of open innovations association (FRUCT) 2018 May 15 (pp. 272-279). IEEE.
- Tandoc Jr EC, Lim ZW, Ling R. Defining “fake news” A typology of scholarly definitions. Digital journalism. 2018 Feb 7;6(2):137-53.
- Park M, Chai S. Constructing a User-Centered Fake News Detection Model by Using Classification Algorithms in Machine Learning Techniques (Jan 2023). IEEE Access. 2023 Jul 12.
- Cybenko AK, Cybenko G. AI and fake news. IEEE intelligent systems. 2018 Dec 7;33(5):1-5.
- Granik M, Mesyura V. Fake news detection using naive Bayes classifier. In2017 IEEE first Ukraine conference on electrical and computer engineering (UKRCON) 2017 May 29 (pp. 900-903). IEEE.
- Helmstetter S, Paulheim H. Weakly supervised learning for fake news detection on Twitter. In2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) 2018 Aug 28 (pp. 274-277). IEEE.
- Mandical RR, Mamatha N, Shivakumar N, Monica R, Krishna AN. Identification of fake news using machine learning. In2020 IEEE international conference on electronics, computing and communication technologies (CONECCT) 2020 Jul 2 (pp. 1-6). IEEE.
- Kaur S, Kumar P, Kumaraguru P. Automating fake news detection system using multi-level voting model. Soft Computing. 2020 Jun;24(12):9049-69.
- Ghosh S, Desarkar MS. Class specific TF-IDF boosting for short-text classification: Application to short-texts generated during disasters. InCompanion Proceedings of the The Web Conference 2018 2018 Apr 23 (pp. 1629-1637).
- Laboreiro G, Sarmento L, Teixeira J, Oliveira E. Tokenizing micro-blogging messages using a text classification approach. InProceedings of the fourth workshop on Analytics for noisy unstructured text data 2010 Oct 26 (pp. 81-88).
- Masood AD, Abdulazeez AM, Zeebaree DQ. Machine learning supervised algorithms of gene selection: A review. Machine Learning. 2020;62:03.
- Dong L, Wesseloo J, Potvin Y, Li X. Discrimination of mine seismic events and blasts using the fisher classifier, naive bayesian classifier and logistic regression. Rock Mechanics and Rock Engineering. 2016 Jan;49:183-211.
- Vereshchaka A, Cosimini S, Dong W. Analyzing and distinguishing fake and real news to mitigate the problem of disinformation. Computational and mathematical organization theory. 2020 Sep;26:350-64.
- Al-Ghadir AI, Azmi AM, Hussain A. A novel approach to stance detection in social media tweets by fusing ranked lists and sentiments. Information Fusion. 2021 Mar 1;67:29-40.
- Somasundaran S, Wiebe J. Recognizing stances in ideological on-line debates. InProceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text 2010 Jun (pp. 116-124).
- Konjengbam A, Ghosh S, Kumar N, Singh M. Debate stance classification using word embeddings. InBig Data Analytics and Knowledge Discovery: 20th International Conference, DaWaK 2018, Regensburg, Germany, September 3–6, 2018, Proceedings 20 2018 (pp. 382-395). Springer International Publishing.
- Faulkner A. Automated classification of stance in student essays: An approach using stance target information and the Wikipedia link-based measure. InThe Twenty-Seventh International Flairs Conference 2014 May 3.
- Weedon J, Nuland W, Stamos A. Information operations and Facebook. Retrieved from Facebook: https://fbnewsroomus. files. wordpress. com/2017/04/facebook-and-information-operations-v1. pdf. 2017 Apr 27.
- Vlachos A, Riedel S. Identification and verification of simple claims about statistical properties. InProceedings of the 2015 Conference on Empirical Methods in Natural Language Processing 2015 (pp. 2596-2601). Association for Computational Linguistics.
- Sen A, Sinha M, Mannarswamy S, Roy S. Stance classification of multi-perspective consumer health information. InProceedings of the ACM India joint international conference on data science and management of data 2018 Jan 11 (pp. 273-281).
- Vychegzhanin SV, Kotelnikov EV. Stance detection based on ensembles of classifiers. Programming and Computer Software. 2019 Sep;45:228-40.
- Silverman C. Lies, damn lies and viral content. Tow Center for Digital Journalism, Columbia University. 2017. https://doi.org/10.7916/D8Q81RHH
- Harabagiu S, Hickl A, Lacatusu F. Negation, contrast and contradiction in text processing. InAAAI 2006 Jul 16 (Vol. 6, pp. 755-762).
- Mohammad S, Kiritchenko S, Sobhani P, Zhu X, Cherry C. Semeval-2016 task 6: Detecting stance in tweets. InProceedings of the 10th international workshop on semantic evaluation (SemEval-2016) 2016 Jun (pp. 31-41).
- Patra BG, Das D, Bandyopadhyay S. JU_NLP at SemEval-2016 task 6: detecting stance in tweets using support vector machines. InProceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016) 2016 Jun (pp. 440-444).
- Elfardy H, Diab M. Cu-gwu perspective at semeval-2016 task 6: Ideological stance detection in informal text. InProceedings of the 10th international workshop on semantic evaluation (SemEval-2016) 2016 Jun (pp. 434-439).
- Augenstein I, Rocktäschel T, Vlachos A, Bontcheva K. Stance detection with bidirectional conditional encoding. arXiv preprint arXiv:1606.05464. 2016 Jun 17.
- Wei P, Mao W, Zeng D. A target-guided neural memory model for stance detection in twitter. In2018 International Joint Conference on Neural Networks (IJCNN) 2018 Jul 8 (pp. 1-8). IEEE.
- Zhou S, Lin J, Tan L, Liu X. Condensed convolution neural network by attention over self-attention for stance detection in twitter. In2019 International Joint Conference on Neural Networks (IJCNN) 2019 Jul 14 (pp. 1-8). IEEE.
- Taulé M, Martí MA, Rangel FM, Rosso P, Bosco C, Patti V. Overview of the task on stance and gender detection in tweets on Catalan independence at IberEval 2017. InCEUR Workshop Proceedings 2017 (Vol. 1881, pp. 157-177). CEUR-WS.
- Ahmad I, Yousaf M, Yousaf S, Ahmad MO. Fake news detection using machine learning ensemble methods. Complexity. 2020;2020(1):8885861.
- Aldwairi M, Alwahedi A. Detecting fake news in social media networks. Procedia Computer Science. 2018 Jan 1;141:215-22.
- Buntain C, Golbeck J. Automatically identifying fake news in popular twitter threads. In2017 IEEE international conference on smart cloud (smartCloud) 2017 Nov 3 (pp. 208-215). IEEE.
- Conroy NK, Rubin VL, Chen Y. Automatic deception detection: Methods for finding fake news. Proceedings of the association for information science and technology. 2015;52(1):1-4.
Current Trends in Information Technology
Volume | 15 |
Issue | 01 |
Received | 25/09/2024 |
Accepted | 23/12/2024 |
Published | 31/12/2024 |