MAC Unit Implementation on FPGA

Notice

This is an unedited manuscript accepted for publication and provided as an Article in Press for early access at the author’s request. The article will undergo copyediting, typesetting, and galley proof review before final publication. Please be aware that errors may be identified during production that could affect the content. All legal disclaimers of the journal apply.

Year : 2026 | Volume : 13 | 01 | Page :
    By

    Dnyaneshwari L. Shinde,

  • Alisha B. Mulani,

  1. Student, Electronics & Telecommunication, SKN Sinhgad College of Engineering, Pandharpur, Maharashtra, India
  2. Student, Electronics & Telecommunication, SKN Sinhgad College of Engineering, Pandharpur, Maharashtra, India

Abstract

Multiply–accumulate (MAC) computations account for a large part of machine learning accelerator operations. The pipelined structure is usually adopted to improve the performance by reducing the length of critical paths. An increase in the number of flip-flops due to pipelining, however, generally results in significant area and power increase. Using this method, we create and build a cutset-free feedforward MAC architecture that maximizes data propagation and removes superfluous pipeline registers. By reorganizing the computation flow to eliminate unnecessary dependencies, the suggested MAC unit permits continuous forward data movement without compromising correctness. In order to achieve high operating speed with low latency, pipeline and parallel processing techniques are combined to guarantee that intermediate results propagate smoothly across stages. The suggested architecture, in contrast to conventional designs, minimizes register insertion and prevents over-pipelining, resulting in a more area- and power-efficient implementation. A large number of flip-flops are often required to meet the feedforward-cutset rule. Based on the observation that this rule can be relaxed in machine learning applications, we propose a pipelining method that eliminates some of the flip-flops selectively. The simulation show that the proposed MAC unit achieved a 50% area reduction compared with the conventional pipelined MAC. In this work, we propose and implement a cutset-free feedforward MAC unit that eliminates redundant dependencies and optimizes data flow for high-speed and low-latency operation. The architecture leverages pipeline and parallel processing techniques to ensure continuous data propagation without stalling, thereby maximizing throughput.

Keywords: Feedforward, Cutset-Free, Multiply-Accumulate (MAC) Unit, FPGA Implementation, Low-Latency, High-Speed Computing, Digital Signal Processing (DSP), Parallel Processing, Power Efficiency.

How to cite this article:
Dnyaneshwari L. Shinde, Alisha B. Mulani. MAC Unit Implementation on FPGA. Journal of Microcontroller Engineering and Applications. 2026; 13(01):-.
How to cite this URL:
Dnyaneshwari L. Shinde, Alisha B. Mulani. MAC Unit Implementation on FPGA. Journal of Microcontroller Engineering and Applications. 2026; 13(01):-. Available from: https://journals.stmjournals.com/jomea/article=2026/view=238854


References

  1.  Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Communications of the ACM. 2017 May 24;60(6):84-90.
  2. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556. 2014 Sep 4.
  3. Moons B, Uytterhoeven R, Dehaene W, Verhelst M. 14.5 envision: A 0.26-to-10tops/w subword- parallel dynamic-voltage-accuracy-frequency-scalable convolutional neural network processor in 28nm fdsoi. In2017 IEEE International Solid-State Circuits Conference (ISSCC) 2017 Feb 5 (pp. 246-247). IEEE.
  4. Chorowski JK, Bahdanau D, Serdyuk D, Cho K, Bengio Y. Attention-based models for speech recognition. Advances in neural information processing systems. 2015;28.
  5. Parashar A, Rhu M, Mukkara A, Puglielli A, Venkatesan R, Khailany B, Emer J, Keckler SW, Dally WJ. SCNN: An accelerator for compressed-sparse convolutional neural networks. ACM SIGARCH computer architecture news. 2017 Jun 24;45(2):27-40.
  6. Stelling PF, Oklobdzija VG. Implementing multiply-accumulate operation in multiplication time. InProceedings 13th IEEE Sympsoium on Computer Arithmetic 1997 Jul 6 (pp. 99-106). IEEE.
  7. Dadda L. Some schemes for parallel multipliers. Alta frequenza. 1965;34:349-56.
  8. Rastegari M, Ordonez V, Redmon J, Farhadi A. Xnor-net: Imagenet classification using binary convolutional neural networks. InEuropean conference on computer vision 2016 Sep 17 (pp. 525-542). Cham: Springer International Publishing.
  9. Gao M, Pu J, Yang X, Horowitz M, Kozyrakis C. Tetris: Scalable and efficient neural network acceleration with 3d memory. InProceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems 2017 Apr 4 (pp. 751-764).
  10. Macsorley OL. High-speed arithmetic in binary computers. Proceedings of the IRE. 2007 Jan 22;49(1):67-91.
  11. Kambale A. Home automation using google assistant. UGC care approved journal. 2023;32(1):1071-7.
  12. Mulani AO, Bang AV, Birajadar GB, Deshmukh AB, Jadhav HM, Liyakat KK. IoT Based Air, Water, and Soil Monitoring System for Pomegranate Farming. Annals of Agri-Bio Research. 2024;29(2):71-86.
  13. Kulkarni TM, Mulani AO. Face Mask Detection on Real Time Images and Videos using Deep Learning. International Journal of Electrical Machine Analysis and Design (IJEMAD). 2024;2(1).
  14. Thigale SP, Jadhav HM, Mulani AO, Birajadar GB, Nagrale M, Sardey MP. Internet of things and robotics in transforming healthcare services. Afr J Biol Sci (S Afr). 2024;6(6):1567-75.
  15. Pol DR. Cloud Based Memory Efficient Biometric Attendance System Using Face Recognition. Stochastic Modeling & Applications. 2021;25(2).
  16. Nagtilak MA, Ulegaddi MS, Adat MA, Mulani AO. Breast Cancer Prediction using Machine Learning. International Journal of Research in Science and Engineering. 2021;1(2).
  17. Kulkarni TM, Mulani AO. Deep Learning Based Face-Mask Detection: An Approach to Reduce Pandemic Spreads in Human Healthcare. African Journal of Biological Sciences. 2024;6(6).
  18. Mulani A, Mane PB. DWT based robust invisible watermarking. Scholars& Press; 2016.
  19. Jadhav VS, Salunkhe SS, Salunkhe G, Yawle PR, Pol RS, Mulani AO, Rana M. IOT based health monitoring system for Human. African Journal of Biomedical Research. 2024 Sep:1309- 15.
  20. Nagtilak AG, Ulegaddi SN, Mane M, Mulani AO. Automatic Solar Powered Pesticide Sprayer for Farming. International Journal of Microwave Engineering and Technology. 2023;9(2):7-18.
  21. Dandage AS, Rupnar VR, Pise TA, Mulani AO. Real-Time Language Translation Application Using Tkinter. International Journal. 2025 Jan;11(1).
  22. Deo SS. Challenges in the Implementation of Legal Aid Services. Journal of Constitutional Law and Jurisprudence. 2025 Mar 25;8(2):33-40.
  23. Deo SS. Legal Aid in India: Framework and Policies. Journal of Constitutional Law and Jurisprudence. 2025 Mar 25;8(2):21-6.
  24. Deo SS. Legal Aid in Criminal Justice. Journal of Human Rights Law and Practice. 2025 Apr 3;8(2):30-6.

Ahead of Print Subscription Review Article
Volume 13
01
Received 16/07/2025
Accepted 05/12/2025
Published 19/03/2026
Publication Time 246 Days


Login


My IP

PlumX Metrics