Vishal Tyagi,
Divyansh Agrawal,
Vibhu Sehgal,
Nishant Shokeen,
Nishi Gupta,
- Student, Department of Computer Science and Engineering, The NorthCap University, Gurgaon, Haryana, India
- Student, Department of Computer Science and Engineering, The NorthCap University, Gurgaon, Haryana, India
- Student, Department of Computer Science and Engineering, The NorthCap University, Gurgaon, Haryana, India
- Student, Department of Computer Science and Engineering, The NorthCap University, Gurgaon, Haryana, India
- Student, Department of Computer Science and Engineering, The NorthCap University, Gurgaon, Haryana, India
Abstract
This research work surveys cutting-edge language translation technologies, including multi-lingual, real-time translation, voice recognition, speech-to-text conversion, and transcription in the hearing process. The study explores the complex mechanisms behind voice call language translation, focusing on sophisticated machine learning models integrated with cloud-based or local applications to facilitate seamless communication across language barriers. Furthermore, conducting research in live communication analyzes the complexity of text and voice techniques to deliver translated content in timely written and audio formats. Through a comprehensive analysis of current developments, challenges, and future possibilities, this study is analytically valuable to researchers, practitioners, and enthusiasts. Significant advances in natural language processing and machine learning have been shown, and by including advanced methods with deep learning and neural networks, along with strengthening learning, the research aims to stimulate innovation and further development in this dynamic field.
Keywords: Language translation, deep learning, machine learning models, natural language processing, real-time translation, speech-to-text conversion, voice recognition
[This article belongs to Journal of Image Processing & Pattern Recognition Progress ]
Vishal Tyagi, Divyansh Agrawal, Vibhu Sehgal, Nishant Shokeen, Nishi Gupta. Survey Paper on Multilingual Live Call Translation Using Deep Learning. Journal of Image Processing & Pattern Recognition Progress. 2024; 11(02):13-21.
Vishal Tyagi, Divyansh Agrawal, Vibhu Sehgal, Nishant Shokeen, Nishi Gupta. Survey Paper on Multilingual Live Call Translation Using Deep Learning. Journal of Image Processing & Pattern Recognition Progress. 2024; 11(02):13-21. Available from: https://journals.stmjournals.com/joipprp/article=2024/view=152539
References
- Gambier Yves. Translations| Rapid and Radical Changes in Translation and Translation Studies. Int J Commun. 2016; 10: 887–906.
- Remael Aline, Gert Vercauteren. The translation of recorded audio description from English into Dutch. Perspect: Stud Transl. 2010; 18(3): 155–171.
- Szarkowska Agnieszka, et al. Cognitive load in intralingual and interlingual respeaking–a preliminary study. Pozn Stud Contemp Linguist. 2016; 52(2): 209–233.
- Hutchins John. Machine translation: A concise history. Journal of Translation Studies 13 (1 & 2) (2010), 29–70.
- Sager Juan C. A practical course in terminology processing. A Practical Course in Terminology Processing. Amsterdam; Philadelphia: J. Benjamins Pub. Co.; 1990; 1–270.
- Turchi M, Fantinuoli C. Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings (ASLTRW). In Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings (ASLTRW). 2021 Aug.
- Arkhangorodsky Arkady, et al. MeetDot: Videoconferencing with live translation captions. arXiv preprint arXiv:2109.09577. 2021.
- Wu Yonghui, et al. Google’s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144. 2016.
- Lu Ziyao, et al. Exploring multi-stage information interactions for multi-source neural machine translation. IEEE/ACM Trans Audio Speech Lang Process. 2021; 30: 562–570.
- Anguera Xavier, et al. Speaker diarization: A review of recent research. IEEE Trans Audio Speech Lang Process. 2012; 20(2): 356–370.
- Callison-Burch C, Fordyce C, Koehn P, Monz C, Schroeder J. (Meta-) evaluation of machine translation. Proceedings of the Second Workshop on Statistical Machine Translation. 2007;
136–158. - Graham Y, Baldwin T, Moffat A, Zobel J. Continuous measurement scales in human evaluation of machine translation. Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse. 2013; 33–41.
- Brown Tom, et al. Language models are few-shot learners. 33th Int Conf on Advances in neural information processing systems. 2020; 1877–1901.
- Hazelwood Kim, et al. Applied machine learning at facebook: A datacenter infrastructure perspective. 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA). 2018; 620–629.
- Gracia Jorge, IIan Kernerman, Besim Kabashi. Results of the translation inference across dictionaries 2021 shared task. CEUR workshop proc. No. ART-2021-131934. 2021; 208–220.
- Ribeiro Marco Tulio, Sameer Singh, Carlos Guestrin. “Why should I trust you?” Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 2016; 1135–1144.
- Wang Xing, et al. Neural machine translation advised by statistical machine translation. 31st Proceedings of the AAAI conference on artificial intelligence. 2017; 3330–3336.
- Luong Minh-Thang, Hieu Pham, Manning Christopher D. Effective approaches to attention-based neural machine translation. arXiv preprint arXiv:1508.04025. 2015.
- Vaswani Ashish, et al. Attention is all you need. 30th Int Conf on Advances in neural information processing systems. 2017; 6000–6010.
- Chorowski Jan K, et al. Attention-based models for speech recognition. 28th Int Conf on Advances in neural information processing systems. 2015; 1: 577–585.
- Song X, Cohn T, Specia L. BLEU Deconstructed: Designing a Better MT Evaluation Metric. Int. J. Comput. Linguistics Appl. 2013 Jul;4(2):29–44.
- Bahdanau Dzmitry, Kyunghyun Cho, Yoshua Bengio. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473. 2014.
- Yang Yilin, et al. Improving multilingual translation by representation and gradient regularization. arXiv preprint arXiv:2109.04778. 2021.
- Sutskever Ilya, Oriol Vinyals, Le Quoc V. Sequence to sequence learning with neural networks. 27th Int Conf on Advances in neural information processing systems. 2014; 2: 3104–3112.
- Callison-Burch Chris, et al. Findings of the 2010 joint workshop on statistical machine translation and metrics for machine translation. Proceedings of the Joint 5th Workshop on Statistical Machine Translation and Metrics MATR. 2010; 17–53.
- Ogundokun RO, Awotunde JB, Misra S, Segun-Owolabi T, Adeniyi EA, Jaglan V. An android based language translator application. J Phys: Conf Ser. 2021; 1767(1): 012032.
- Papineni K, Roukos S, Ward T, Zhu WJ. BLEU: a method for automatic evaluation of machine translation. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 2002; 311–318.
- Callison-Burch C, Osborne M, Koehn P. Re-evaluating the role of BLEU in machine translation research. 11th Conference of the European Chapter of the Association for Computational Linguistics. 2006; 249–256.
- Snover M, Dorr B, Schwartz R, Micciulla L, Makhoul J. A study of translation edit rate with targeted human annotation. Proceedings of the 7th Conference of the Association for Machine Translation in the Americas. 2006; 223–231.
- Snover M, Madnani N, Dorr B, Schwartz R. TER-Plus: Paraphrase, semantic, and alignment enhancements to Translation Edit Rate. Mach Transl. 2009; 23(2–3): 117–127.
- Banerjee S, Lavie A. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization. 2005; 65–72.
- Denkowski M, Lavie A. Meteor universal: Language specific translation evaluation for any target language. Proceedings of the 9th Workshop on Statistical Machine Translation. 2014; 376–380.
- Li H, Ma B, Lee KA. Spoken language recognition: from fundamentals to practice. Proc IEEE. 2013 Feb 6; 101(5): 1136–59.

Journal of Image Processing & Pattern Recognition Progress
| Volume | 11 |
| Issue | 02 |
| Received | 09/04/2024 |
| Accepted | 18/06/2024 |
| Published | 29/06/2024 |
Login
PlumX Metrics