Jayesh Chavan,
Smita Palnitkar,
Ajinkya Kaje,
Chaitanya Bhopnikar,
- Student, Department of Electronics and Telecommunication, Smt. Kashibai Navale College of Engineering, Savitribai Phule Pune University, Pune, Maharashtra, India
- Assistant Professor, Department of Electronics and Telecommunication, Smt. Kashibai Navale College of Engineering, Savitribai Phule Pune University, Pune, Maharashtra, India
- Student, Department of Electronics and Telecommunication, Smt. Kashibai Navale College of Engineering, Savitribai Phule Pune University, Pune, Maharashtra, India
- Student, Department of Electronics and Telecommunication, Smt. Kashibai Navale College of Engineering, Savitribai Phule Pune University, Pune, Maharashtra, India
Abstract
Transcriber Sum is a deep learning-based YouTube transcript summarization tool. It employs advanced machine learning techniques to automatically generate concise summaries of YouTube video transcripts, enabling users to quickly grasp the key content and insights of videos without the need to watch or read the entire transcript. This Transcriber Sum addresses the challenge of providing users with concise and informative summaries of video content by harnessing the power of deep learning techniques. The objectives encompass the creation of advanced models for natural language processing and video content analysis, building a robust framework for processing and summarizing video transcripts, collecting and annotating a diverse dataset of YouTube video transcripts, implementing data preprocessing and augmentation techniques, training and evaluating the model’s performance, developing a user- friendly application for real-time transcript summarization, and continuously refining the model based on user feedback. The methodology involves developing deep learning models for natural language understanding, utilizing advanced video content analysis techniques, and seamlessly integrating these components into an application for effortless YouTube transcript summarization. Performance evaluation is conducted through a combination of simulation using synthetic data and real-world testing with a wide range of YouTube video transcripts. Transcriber Sum aims to empower users with a time-efficient and informative tool for quickly summarizing the wealth of content available on YouTube, ultimately enhancing the overall viewing experience and accessibility of valuable information in the digital age.
Keywords: Transcript summarization, deep learning, machine learning techniques, natural language processing, real-time, AI/deep learning
[This article belongs to Journal of Artificial Intelligence Research & Advances ]
Jayesh Chavan, Smita Palnitkar, Ajinkya Kaje, Chaitanya Bhopnikar. Transcript Summarizer of YouTube Videos Using Deep Learning. Journal of Artificial Intelligence Research & Advances. 2024; 11(03):119-126.
Jayesh Chavan, Smita Palnitkar, Ajinkya Kaje, Chaitanya Bhopnikar. Transcript Summarizer of YouTube Videos Using Deep Learning. Journal of Artificial Intelligence Research & Advances. 2024; 11(03):119-126. Available from: https://journals.stmjournals.com/joaira/article=2024/view=171718
References
- Ilampiray P, Thilagavathy A, Nithin AS, Raj I. Video transcript summarizer. E3S Web Conf. 2023;399:04015. DOI: 10.1051/e3sconf/202339904015. Published online 2023 Jul 12.
- Dharmapuri S, Desu S, Alladi K, Gummadi H, Gupta H, Shareef SNM. An automated framework for summarizing YouTube videos using NLP. E3S Web Conf. EDP Sciences. 2023;430. DOI: 10.1051/e3sconf/202343001056.
- Albeer RA, Al-Shahad HF, Aleqabie HJ, Al-shakarchy ND. Automatic summarization of YouTube video transcription text using term frequency-inverse document frequency. Indones J Electr Eng Comput Sci. 2022;26:1512–9. DOI: 10.11591/ijeecs.v26.i3.pp1512-1519.
- Devi S, Nadar R, Nichat T, Lucas A. Abstractive summarizer for YouTube videos. In: International Conference on Applications of Machine Intelligence and Data Analytics (ICAMIDA 2022). Atlantis Press; 2023. p. 431–8.
- Ilampiray P, Thilagavathy A, Nithin AS, Raj I. Video transcript summarizer. E3S Web Conf. 2023;399:04015. DOI: 10.1051/e3sconf/202339904015. Published online 2023 Jul 12.
- Li H, Zhu J, Ma C, Zhang J, Zong C. Read, watch, listen, and summarize: Multi-modal summarization for asynchronous text, image, audio and video. IEEE Trans Knowl Data Eng. 2018;31:996–1009. DOI: 10.1109/TKDE.2018.2848260.
- Emad A, Bassel F, Refaat M, Abdelhamed M, Shorim N, AbdelRaouf A. Automatic video summarization with timestamps using natural language processing text fusion. In: 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC). IEEE Publications; 2021. p. 0060–6.
- Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, et al. Transformers: State-of-the-art natural language processing. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations; 2020. p. 38–45. DOI: 10.18653/v1/2020.emnlp-demos.6.
- Jain A, Kulkarni G, Shah V. Natural language processing. Int J Comput Sci Eng. 2018;6:161–7. DOI: 10.26438/ijcse/v6i1.161167.
- Joseph SR, Hlomani H, Letsholo K, Kaniwa F, Sedimo K. Natural language processing: A review. Int J Res Eng Appl Sci. 2016;6:207–10.

Journal of Artificial Intelligence Research & Advances
| Volume | 11 |
| Issue | 03 |
| Received | 03/07/2024 |
| Accepted | 06/09/2024 |
| Published | 11/09/2024 |
Login
PlumX Metrics
