Jyoti Jayesh Chavhan,
Shoaib Hafiz Shaikh,
Suyash Kiran Ayachit,
Sheshasai Arjun Dusa,
Nivedh Vijay Anakakkil,
- Assistant Professor, Department of Artificial Intelligence Machine Learning, SIES Graduate School of Technology, Nerul, Navi-Mumbai, Maharashtra, India
- Student, Department of Artificial Intelligence Machine Learning, SIES Graduate School of Technology, Nerul, Navi-Mumbai, Maharashtra, India
- Student, Department of Artificial Intelligence Machine Learning, SIES Graduate School of Technology, Nerul, Navi-Mumbai, Maharashtra, India
- Student, Department of Artificial Intelligence Machine Learning, SIES Graduate School of Technology, Nerul, Navi-Mumbai, Maharashtra, India
- Student, Department of Artificial Intelligence Machine Learning, SIES Graduate School of Technology, Nerul, Navi-Mumbai, Maharashtra, India
Abstract
For knowledge-intensive businesses, large document libraries contain a plethora of information. Massive, chaotic, collections of documents that are unstructured have emerged from this rapid increase. Even if accessing or storing these documents has become easier, finding the required critical information in these vast document collections has become more difficult. NLP (natural language processing) is one of the techniques used to retrieve the information that is required from a huge chunk of data. The proposed system uses cutting-edge methods for document comprehension and NLP to enable users to upload PDFs, ask questions, and get timely, accurate answers. NLP techniques use the subtle layers of context and semantics, going beyond simple keyword extraction. The interface facilitates easy interaction with complex textual material by providing users with access to a multitude of information. Regardless of your experience level, level of diligence, or role as a multi-disciplinary expert, the platform meets a wide range of needs and fluidly adjusts to meet each user’s specific needs. This initiative stands out as a pathfinder as we traverse the digital world, where information is plentiful but frequently difficult to extract. It not only makes it easier to interact with textual content, but it also creates an ecosystem where insights are easily available to everyone. It is evidence of how innovation can create a knowledge environment that is inclusive of all people and productive, paving the way for a genuinely inclusive digital age.
Keywords: NLP, user-friendly, innovation, information retrieval, extraction
[This article belongs to International Journal of Computer Science Languages ]
Jyoti Jayesh Chavhan, Shoaib Hafiz Shaikh, Suyash Kiran Ayachit, Sheshasai Arjun Dusa, Nivedh Vijay Anakakkil. Advanced Document Query System. International Journal of Computer Science Languages. 2024; 02(02):26-35.
Jyoti Jayesh Chavhan, Shoaib Hafiz Shaikh, Suyash Kiran Ayachit, Sheshasai Arjun Dusa, Nivedh Vijay Anakakkil. Advanced Document Query System. International Journal of Computer Science Languages. 2024; 02(02):26-35. Available from: https://journals.stmjournals.com/ijcsl/article=2024/view=177659
References
- Jurafsky D, Martin JH. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition with language models. 3rd ed. [Online]. Draft of August 20, 2024. Stanford University; University of Colorado at Boulder; 2024. Available from: https://web.stanford.edu/~jurafsky/slp3/ed3bookaug20_2024.pdf.
- Bird S, Klein E, Loper E. Natural Language Processing with Python. Sebastopol (CA): O’Reilly Media, Inc.; 2009.
- Manning CD, Raghavan P, Schütze H. Introduction to Information Retrieval. New York: Cambridge University Press; 2008.
- Rahman N, Mozer R, McHugh RK, Rockett IRH, Chow CM, Vaughan G. Using natural language processing to improve suicide classification requires consideration of race. Suicide Life Threat Behav. 2022;52:782–91. DOI: 10.1111/sltb.12862. PubMed: 35384040.
- Irfan S, Ghosh S. Optimization of information retrieval using evolutionary computation: A survey. 2017 International Conference on Computing, Communication and Automation (ICCCA). IEEE; 2017. p. 328–33. DOI: 10.1109/CCAA.2017.8229837.
- Ren J, Wang H, Liu T. Information retrieval based on knowledge-enhanced word embedding through dialog: A case study. Int J Comput Intell Syst. 2020;13:275–90. DOI: 10.2991/ijcis.d.200310.002.
- Dörpinghaus J, Stefan A. Optimization of retrieval algorithms on large scale knowledge graphs. 2020 15th Conference on Computer Science and Information Systems (FedCSIS), Sofia, Bulgaria. 2020. pp. 227–36. DOI: 10.15439/2020F2.
- Pan Z. Optimization of information retrieval algorithm for digital library based on semantic search engine. 2020 International Conference on Computer Engineering and Application (ICCEA), Guangzhou, China. 2020. pp. 364–7. DOI: 10.1109/ICCEA50009.2020.00085.
- Ceri S, Bozzon A, Brambilla M, Della Valle E, Fraternali P, Quarteroni S. Web Information Retrieval. Germany: Springer Berlin Heidelberg; 2013. DOI: 10.1007/978-3-642-39314-3.

International Journal of Computer Science Languages
| Volume | 02 |
| Issue | 02 |
| Received | 27/06/2024 |
| Accepted | 08/09/2024 |
| Published | 07/10/2024 |
Login
PlumX Metrics