Survey paper on Multilingual Live Call Translation using Deep Learning

Year : 2024 | Volume :11 | Issue : 02 | Page : –
By

Vishal Tyagi

Divyansh Agrawal

Vibhu Sehgal

Nishant Shokeen

Nishi Gupta

  1. Student Department of Computer Science and Engineering, The NorthCap University, Gurgaon Haryana India
  2. Student Department of Computer Science and Engineering, The NorthCap University, Gurgaon Haryana India
  3. Student Department of Computer Science and Engineering, The NorthCap University, Gurgaon Haryana India
  4. Student Department of Computer Science and Engineering, The NorthCap University, Gurgaon Haryana India
  5. Student Department of Computer Science and Engineering, The NorthCap University, Gurgaon Haryana India

Abstract

This research paper surveys cutting-edge language translation technologies, including multi-lingual, real-time translation, voice recognition, speech-to-text conversion, and transcription in the hearing process. The paper explores the complex mechanisms behind voice call language translation, focusing on sophisticated machine learning models integrated with cloud-based or local applications to facilitate seamless communication across language barriers. Furthermore, conducting research in live communication analyzes the complexity of text and voice techniques to deliver translated content in timely written and audio formats. Through a comprehensive analysis of current developments, challenges, and future possibilities, this study is analytically valuable to researchers, practitioners, and enthusiasts. Significant advances in natural language processing and machine learning have been shown, and by including advanced methods with deep learning and neural networks, along with strengthening learning, the research aims to stimulate innovation and further development in this dynamic field.

Keywords: Language translation, Deep learning, Machine learning models, Natural language processing, Real-time translation, Speech-to-text conversion, Voice recognition

[This article belongs to Journal of Image Processing & Pattern Recognition Progress(joipprp)]

How to cite this article: Vishal Tyagi, Divyansh Agrawal, Vibhu Sehgal, Nishant Shokeen, Nishi Gupta. Survey paper on Multilingual Live Call Translation using Deep Learning. Journal of Image Processing & Pattern Recognition Progress. 2024; 11(02):-.
How to cite this URL: Vishal Tyagi, Divyansh Agrawal, Vibhu Sehgal, Nishant Shokeen, Nishi Gupta. Survey paper on Multilingual Live Call Translation using Deep Learning. Journal of Image Processing & Pattern Recognition Progress. 2024; 11(02):-. Available from: https://journals.stmjournals.com/joipprp/article=2024/view=152539

References

  1. Gambier, Yves. “Translations| Rapid and Radical Changes in Translation and Translation Studies.” International Journal of Communication 10 (2016): 20.
  2. Remael, Aline, and Gert Vercauteren. “The translation of recorded audio description from English into Dutch.” Perspectives: Studies in Translatology3 (2010): 155-171.
  3. Szarkowska, Agnieszka, et al. “Cognitive load in intralingual and interlingual respeaking–a preliminary study.” Poznan Studies in Contemporary Linguistics 52.2 (2016): 209-233.
  4. Hutchins, John. “Machine translation: A concise history.” Computer aided translation: Theory and practice29-70 (2007): 11.
  5. Sager, Juan C. “A practical course in terminology processing.” A Practical Course in Terminology Processing (1990): 1-270.
  6. Turchi M, Fantinuoli C. Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings (ASLTRW). InProceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings (ASLTRW) 2021 Aug.
  7. Arkhangorodsky, Arkady, et al. “MeetDot: Videoconferencing with live translation captions.” arXiv preprint arXiv:2109.09577 (2021).
  8. Wu, Yonghui, et al. “Google’s neural machine translation system: Bridging the gap between human and machine translation.” arXiv preprint arXiv:1609.08144(2016).
  9. Lu, Ziyao, et al. “Exploring multi-stage information interactions for multi-source neural machine translation.” IEEE/ACM Transactions on Audio, Speech, and Language Processing30 (2021): 562-570.
  10. Anguera, Xavier, et al. “Speaker diarization: A review of recent research.” IEEE Transactions on audio, speech, and language processing2 (2012): 356-370.
  11. Callison-Burch, C., Fordyce, C., Koehn, P., Monz, C., & Schroeder, J. (2007). (Meta-) evaluation of machine translation. Proceedings of the Second Workshop on Statistical Machine Translation, 136-158.
  12. Graham, Y., Baldwin, T., Moffat, A., & Zobel, J. (2013). Continuous measurement scales in human evaluation of machine translation. Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, 33-41.
  13. Brown, Tom, et al. “Language models are few-shot learners.” Advances in neural information processing systems 33 (2020): 1877-1901.
  14. Hazelwood, Kim, et al. “Applied machine learning at facebook: A datacenter infrastructure perspective.” 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA). IEEE, 2018.
  15. Gracia, Jorge, IIan Kernerman, and Besim Kabashi. “Results of the translation inference across dictionaries 2021 shared task.” CEUR workshop proc.. No. ART-2021-131934. 2021.
  16. Ribeiro, Marco Tulio, Sameer Singh, and Carlos Guestrin. “” Why should i trust you?” Explaining the predictions of any classifier.” Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 2016.
  17. Wang, Xing, et al. “Neural machine translation advised by statistical machine translation.” Proceedings of the AAAI conference on artificial intelligence. Vol. 31. No. 1. 2017.
  18. Luong, Minh-Thang, Hieu Pham, and Christopher D. Manning. “Effective approaches to attention-based neural machine translation.” arXiv preprint arXiv:1508.04025(2015).
  19. Vaswani, Ashish, et al. “Attention is all you need.” Advances in neural information processing systems30 (2017).
  20. Chorowski, Jan K., et al. “Attention-based models for speech recognition.” Advances in neural information processing systems 28 (2015).
  21. Callison-Burch, Chris, Miles Osborne, and Philipp Koehn. “Re-evaluating the role of BLEU in machine translation research.” 11th conference of the european chapter of the association for computational linguistics. 2006.
  22. Bahdanau, Dzmitry, Kyunghyun Cho, and Yoshua Bengio. “Neural machine translation by jointly learning to align and translate.” arXiv preprint arXiv:1409.0473 (2014).
  23. Yang, Yilin, et al. “Improving multilingual translation by representation and gradient regularization.” arXiv preprint arXiv:2109.04778 (2021).
  24. Sutskever, Ilya, Oriol Vinyals, and Quoc V. Le. “Sequence to sequence learning with neural networks.” Advances in neural information processing systems 27 (2014).
  25. Callison-Burch, Chris, et al. “Findings of the 2010 joint workshop on statistical machine translation and metrics for machine translation.” Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. 2010.
  26. Ogundokun, R. O., Awotunde, J. B., Misra, S., Segun-Owolabi, T., Adeniyi, E. A., & Jaglan, V. (2021). An android based language translator application. Journal of Physics: Conference Series, 1767(1), 012032.
  27. Papineni, K., Roukos, S., Ward, T., & Zhu, W. J. (2002). BLEU: a method for automatic evaluation of machine translation. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, 311-318.
  28. Callison-Burch, C., Osborne, M., & Koehn, P. (2006). Re-evaluating the role of BLEU in machine translation research. 11th Conference of the European Chapter of the Association for Computational Linguistics.
  29. Snover, M., Dorr, B., Schwartz, R., Micciulla, L., & Makhoul, J. (2006). A study of translation edit rate with targeted human annotation. Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, 223-231.
  30. Snover, M., Madnani, N., Dorr, B., & Schwartz, R. (2009). TER-Plus: Paraphrase, semantic, and alignment enhancements to Translation Edit Rate. Machine Translation, 23(2-3), 117-127.
  31. Banerjee, S., & Lavie, A. (2005). METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, 65-72.
  32. Denkowski, M., & Lavie, A. (2014). Meteor universal: Language specific translation evaluation for any target language. Proceedings of the 9th Workshop on Statistical Machine Translation, 376-380.
  33. Li H, Ma B, Lee KA. Spoken language recognition: from fundamentals to practice. Proceedings of the IEEE. 2013 Feb 6;101(5):1136-59.

Regular Issue Subscription Review Article
Volume 11
Issue 02
Received April 9, 2024
Accepted June 18, 2024
Published June 29, 2024