Text to Image Using Machine Learning

Year : 2024 | Volume : 11 | Issue : 02 | Page : 35 41
    By

    Aashutosh Meshram,

  • Yogini Kapse,

  • Shreedhar Mhatre,

  • Sandeep Patil,

  1. Student, Department of Electronics & Telecommunication, Smt. Kashibai Navale College of Engineering, Pune, Maharashtra, India
  2. Student, Department of Electronics & Telecommunication, Smt. Kashibai Navale College of Engineering, Pune, Maharashtra, India
  3. Student, Department of Electronics & Telecommunication, Smt. Kashibai Navale College of Engineering, Pune, Maharashtra, India
  4. Assistant Professor, Department of Electronics & Telecommunication, Smt. Kashibai Navale College of Engineering, Pune, Maharashtra, India

Abstract

In the era of digital transformation, our project addresses the convergence of computer vision and natural language processing to enhance user interaction and visual content creation. This project comprises three distinct modules: user authentication and session management, image colorization from grayscale inputs, and text-to-image generation. The login registration module provides secure access to the system, ensuring user privacy and data integrity. Once authenticated, users can utilize advanced computer vision techniques through our image colorization module, where grayscale images are transformed into vibrant, realistic color representations using deep learning models and OpenCV. Additionally, our text-to-image generation module leverages state-of-the-art natural language processing models to convert textual descriptions into visually compelling images. These modules are integrated into a Flask-based web application, ensuring seamless user interaction through intuitive interfaces and robust backend processing. The project not only showcases technical proficiency in AI-driven applications but also underscores practical applications in enhancing digital content creation and user experience.

Keywords: Computer vision, natural language processing (NLP), deep learning, flask web application, image colorization, text-to- image generation

[This article belongs to Journal of Image Processing & Pattern Recognition Progress ]

How to cite this article:
Aashutosh Meshram, Yogini Kapse, Shreedhar Mhatre, Sandeep Patil. Text to Image Using Machine Learning. Journal of Image Processing & Pattern Recognition Progress. 2024; 11(02):35-41.
How to cite this URL:
Aashutosh Meshram, Yogini Kapse, Shreedhar Mhatre, Sandeep Patil. Text to Image Using Machine Learning. Journal of Image Processing & Pattern Recognition Progress. 2024; 11(02):35-41. Available from: https://journals.stmjournals.com/joipprp/article=2024/view=159326


References

  1. Upendra Hruthik S, Shruti Suman, Vishnu Sai S, Jaya Dharani. (Department of ECE, Koneru Lakshmaiah Education Foundation, Andhra Pradesh, India). Real-Time Face Mask Detection using OpenCV and Deep Learning. International Conference on Emerging Technologies: AI, IoT, and CPS for Science & Technology Applications, NITTTR Chandigarh, India. 2021 Sep 06–07.
  2. Mishra S, Verma V, Akhtar N, Chaturvedi S, Perwej Y. An intelligent motion detection using OpenCV. Int J Sci Res Sci Eng Technol. 2022 Mar 5; 9(2): 51–63.
  3. You Y, Gong S, Liu C. Adaptive moving object detection algorithm based on back ground subtraction and motion estimation. Int J Advancements in Computing Technology (IJACT). 2013; 5(6): 357–63.
  4. Murshed M, Ramirez A, Chae O. Statistical background modeling: an edge segment based moving object detection approach. In 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance. 2010 Aug 29; 300–306.
  5. Duraimurugan N, Chokkalingam SP. Real-Time Object Detection with Yolo. Int J Eng Adv Technol (IJEAT). 2019 Feb; 8(3S): 578–81.
  6. Marengoni M, Stringhini D. High level computer vision using opencv. In 2011 IEEE 24th SIBGRAPI Conference on Graphics, Patterns, and Images Tutorials. 2011 Aug 28; 11–24.
  7. Gürel C, Erden A. Face detection algorithm with facial feature extraction for face recognition system. In The 20th Int Conf on Mechatronics and Machine Vision in Practice. 2013; 1–7.
  8. Huang P, Liu Y, Fu C, Zhao L. Multi-Semantic fusion generative adversarial network for text-to-image generation. In 2023 IEEE 8th International Conference on Big Data Analytics (ICBDA). 2023 Mar 3; 159–164.
  9. Raghavan V, Sree SR, Kaladevi R, Hariharan S, Bhanuprasad A. Black and White Image Colorization using Deep Learning. In 2022 IEEE International Conference on Disruptive Technologies for Multi-Disciplinary Research and Applications (CENTCON). 2022 Dec 22; 2: 27–30.
  10. Meshram K, Jadhav H, Narsale N, Raut R, Devkar A. Image Generation from Random Noise using Generative Adversarial Networks. In 2023 IEEE International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES). 2023 Apr 28; 381–385.
  11. Xue A. End-to-end Chinese landscape painting creation using generative adversarial networks. In Proceedings of the IEEE/CVF Winter conference on applications of computer vision. 2021; 3863–3871.
  12. Nimala K, Geetha G, Ponsam JG. Colorization of Black & White Videos & Photographs. In 2022 IEEE International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES). 2022 Jul 15; 1–6.

Regular Issue Subscription Review Article
Volume 11
Issue 02
Received 03/07/2024
Accepted 20/07/2024
Published 30/07/2024


Login


My IP

PlumX Metrics