Using Machine Learning for Key phrase Extraction in Digital Libraries

[{“box”:0,”content”:”[if 992 equals=”Open Access”]n

n

n

n

Open Access

nn

n

n[/if 992]n

n

Year : July 19, 2024 at 3:57 pm | [if 1553 equals=””] Volume :01 [else] Volume :01[/if 1553] | [if 424 equals=”Regular Issue”]Issue[/if 424][if 424 equals=”Special Issue”]Special Issue[/if 424] [if 424 equals=”Conference”][/if 424] : 02 | Page : 8-13

n

n

n

n

n

n

By

n

[foreach 286]n

n

n

Neha Sahu, Rizwan Arif,

n

    n t

  • n

n

n[/foreach]

n

n[if 2099 not_equal=”Yes”]n

    [foreach 286] [if 1175 not_equal=””]n t

  1. Research Scholar, Assistant Professor Lingaya’s Vidyapeeth, Faridabad, Lingaya’s Vidyapeeth, Faridabad Haryana, Haryana India, India
  2. n[/if 1175][/foreach]

n[/if 2099][if 2099 equals=”Yes”][/if 2099]n

n

Abstract

nMachine learning has revolutionized various aspects of information retrieval, including key phrase extraction in digital libraries. Key phrase extraction is crucial for summarizing and categorizing vast amounts of textual data, enabling efficient search and retrieval processes. This study explores the application of machine learning techniques for automatic key phrase extraction in digital libraries. We review various supervised and unsupervised learning algorithms, including deep learning models, that are employed to identify and extract key phrases from academic papers, books, and other digital documents. Specifically, algorithms such as support vector machines (SVM), neural networks, and BERT (Bidirectional Encoder Representations from Transformers) are examined for their effectiveness in this task. The research highlights the importance of feature selection, dataset quality, and algorithm performance in achieving accurate and meaningful key phrase extraction. We also discuss the integration of natural language processing (NLP) tools and the challenges associated with multilingual and domain-specific libraries. Experimental results demonstrate the effectiveness of machine learning models in enhancing the accessibility and discoverability of digital content, ultimately contributing to the advancement of digital library services. Key phrase extraction is a crucial task in managing and utilizing the vast amounts of information stored in digital libraries. By automating the identification of key phrases, it becomes possible to enhance information retrieval, indexing, and summarization processes. This paper explores the application of machine learning techniques to key phrase extraction in digital libraries. We review various supervised and unsupervised learning methods, highlighting their strengths and weaknesses in different contexts. Supervised approaches, such as decision trees, support vector machines (SVM), and neural networks, as well as advanced models like BERT, require annotated datasets and can achieve high accuracy

n

n

n

Keywords: Machine learning, Digital libraries, Text mining, Information retrieval, Deep learning

n[if 424 equals=”Regular Issue”][This article belongs to International Journal of Cheminformatics(ijci)]

n

[/if 424][if 424 equals=”Special Issue”][This article belongs to Special Issue under section in International Journal of Cheminformatics(ijci)][/if 424][if 424 equals=”Conference”]This article belongs to Conference [/if 424]

n

n

n

How to cite this article: Neha Sahu, Rizwan Arif. Using Machine Learning for Key phrase Extraction in Digital Libraries. International Journal of Cheminformatics. July 19, 2024; 01(02):8-13.

n

How to cite this URL: Neha Sahu, Rizwan Arif. Using Machine Learning for Key phrase Extraction in Digital Libraries. International Journal of Cheminformatics. July 19, 2024; 01(02):8-13. Available from: https://journals.stmjournals.com/ijci/article=July 19, 2024/view=0

nn[if 992 equals=”Open Access”] Full Text PDF Download[/if 992] n

n[if 992 not_equal=’Open Access’] [/if 992]nn n

nn[if 379 not_equal=””]n

Browse Figures

n

n

[foreach 379]n

n[/foreach]n

n

n

n[/if 379]n

n

References

n[if 1104 equals=””]n

  1. Li and M. Daoutis, “Unsupervised key-phrase extraction and clustering for classification scheme in scientific publications,” CEUR Workshop Proceedings, vol. 2831, pp. 1–8, Jan. 2021, [Online]. Available: http://arxiv.org/abs/2101.09990
  2. Boudin and Y. Gallina, “Redefining Absent Key phrases and their Effect on Retrieval Effectiveness,” in Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mar. 2021, pp. 4185–4193, doi: 10.18653/v1/2021.naacl-main.330.
  3. Li, J. Zhu, J. Zhang, C. Zong, and X. He, “Keywords-guided abstractive sentence summarization,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 5, pp. 8196–8203, Apr. 2020, doi: 10.1609/aaai. v34i05.6333.
  4. Varastehpour, H. Sharifzadeh, and I. Ardekani, “A comprehensive review of deep learning algorithms,” New Zealand, 2021, doi: 10.34074/ocds.092.
  5. K. Mishra, G. Y. S. Reddy, and H. Pathak, “The understanding of deep learning: A comprehensive Review,” Mathematical Problems in Engineering, pp. 1–15, Apr. 2021, doi: 10.1155/2021/5548884.
  6. Abdel-Jaber, D. Devassy, A. A. Salam, L. Hidaytallah, and M. EL-Amir, “A review of deep learning algorithms and their applications in healthcare,” Algorithms, vol. 15, no. 2, pp. 1–55, Feb. 2022, doi: 10.3390/a15020071.
  7. Yang, Y. Ge, Y. Yao, and Y. Yang, “GCN-based document representation for key phrase generation enhanced by maximizing mutual information,” Knowledge-Based Systems, vol. 243, p. 108488, May 2022, doi: 10.1016/j.knosys.2022.108488.
  8. Munz, D. Väth, P. Kuznecov, N. T. Vu, and D. Weiskopf, “Visualization-based improvement of neural machine translation,” Computers & Graphics, vol. 103, pp. 45–60, Apr. 2022, doi: 10.1016/j.cag.2021.12.003.
  9. R. Bhowmik, M. Arifuzzaman, and M. R. H. Mondal, “Sentiment analysis on Bangla text using extended lexicon dictionary and deep learning algorithms,” Array, vol. 13, pp. 1–14, Mar. 2022, doi: 10.1016/j.array.2021.100123.
  10. Pota, M. Esposito, G. D. Pietro, and H. Fujita, “Best practices of convolutional neural networks for question classification,” Applied Sciences, vol. 10, no. 14, pp. 1–27, Jul. 2020, doi: 10.3390/app10144710.
  11. Hamida, B. Cherradi, and H. Ouajji, “Handwritten Arabic words recognition system based on hog and gabor filter descriptors,” in 2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), Apr. 2020, pp. 1–4, doi: 10.1109/IRASET48871.2020.9092067.
  12. Lee and D. Kim, “Deep learning-based recommender system using cross convolutional filters,” Information Sciences, vol. 592, pp. 112–122, May 2022, doi: 10.1016/j.ins.2022.01.033.
  13. Young, D. Hazarika, S. Poria, and E. Cambria, “Recent trends in deep learning based natural language processing,” IEEE Computational Intelligence Magazine, vol. 13, no. 3, pp. 55–75, Aug. 2018, doi: 10.1109/MCI.2018.2840738.
  14. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of data with neural networks,” Science, vol. 313, no. 5786, pp. 504–507, Jul. 2006, doi: 10.1126/science.1127647.
  15. Dixit and S. Silakari, “Deep learning algorithms for cybersecurity applications: A technological and status review,” Computer Science Review, vol. 39, pp. 1–15, Feb. 2021, doi: 10.1016/j.cosrev.2020.100317.
  16. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521, no. 7553, pp. 436–444, May 2015, doi: 10.1038/nature14539.
  17. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, Oct. 1986, doi: 10.1038/323533a0.
  18. Terrada, S. Hamida, B. Cherradi, A. Raihani, and O. Bouattane, “Supervised machine learning based medical diagnosis support system for prediction of patients with heart disease,” Advances in Science, Technology and Engineering Systems Journal, vol. 5, no. 5, pp. 269–277, 2020, doi: 10.25046/aj050533.
  19. Terrada, B. Cherradi, A. Raihani, and O. Bouattane, “Atherosclerosis disease prediction using supervised machine learning techniques,” in 2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), Apr. 2020, pp. 1–5, doi: 10.1109/IRASET48871.2020.9092082.
  20. Emmert-Streib, Z. Yang, H. Feng, S. Tripathi, and M. Dehmer, “An introductory review of deep learning for prediction models with big data,” Frontiers in Artificial Intelligence, vol. 3, pp. 1–23, Feb. 2020, doi: 10.3389/frai.2020.00004.

nn[/if 1104][if 1104 not_equal=””]n

    [foreach 1102]n t

  1. [if 1106 equals=””], [/if 1106][if 1106 not_equal=””],[/if 1106]
  2. n[/foreach]

n[/if 1104]

nn


nn[if 1114 equals=”Yes”]n

n[/if 1114]

n

n

[if 424 not_equal=””]Regular Issue[else]Published[/if 424] Subscription Review Article

n

n

[if 2146 equals=”Yes”][/if 2146][if 2146 not_equal=”Yes”][/if 2146]n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n

n[if 1748 not_equal=””]

[else]

[/if 1748]n

n

n

Volume 01
[if 424 equals=”Regular Issue”]Issue[/if 424][if 424 equals=”Special Issue”]Special Issue[/if 424] [if 424 equals=”Conference”][/if 424] 02
Received June 20, 2024
Accepted June 26, 2024
Published July 19, 2024

n

n

n

n

n

n nfunction myFunction2() {nvar x = document.getElementById(“browsefigure”);nif (x.style.display === “block”) {nx.style.display = “none”;n}nelse { x.style.display = “Block”; }n}ndocument.querySelector(“.prevBtn”).addEventListener(“click”, () => {nchangeSlides(-1);n});ndocument.querySelector(“.nextBtn”).addEventListener(“click”, () => {nchangeSlides(1);n});nvar slideIndex = 1;nshowSlides(slideIndex);nfunction changeSlides(n) {nshowSlides((slideIndex += n));n}nfunction currentSlide(n) {nshowSlides((slideIndex = n));n}nfunction showSlides(n) {nvar i;nvar slides = document.getElementsByClassName(“Slide”);nvar dots = document.getElementsByClassName(“Navdot”);nif (n > slides.length) { slideIndex = 1; }nif (n (item.style.display = “none”));nArray.from(dots).forEach(nitem => (item.className = item.className.replace(” selected”, “”))n);nslides[slideIndex – 1].style.display = “block”;ndots[slideIndex – 1].className += ” selected”;n}n”}]