CMOS-Based Process-Scalable Analog Circuits for Machine Learning: A Comprehensive Review and Future Directions.

Year : 2025 | Volume : 15 | Issue : 01 | Page : 8 17
    By

    Abhishek Agwekar,

  • Laxmi Singh,

  1. PhD Scholar, Department of Electronics & Communication Engineering, Rabindranath Tagore University (RNTU), Bhopal, Madhya Pradesh, India
  2. Professor & Head, Department of Electronics & Communication Engineering, Rabindranath Tagore University (RNTU), Bhopal, Madhya Pradesh, India

Abstract

Analog computing techniques are gaining attention for machine learning (ML) applications due to their ability to reduce computational complexity. Continuous operations such as addition and subtraction offer a simpler and more efficient approach compared to probabilistic product decoding, which can be sensitive to noise and inconsistent measurements. This paper presents a simulated VLSI implementation of a broadcast edge connection, independent of the MOS component model, along with experimental results. The implementation requires in-loop circuit training to address device incompatibilities and reduce artifacts. To speed up wafer-in-the-loop training, circuits are reweighted to ensure that simulated functions remain unchanged during training and inference. We propose a simulation paradigm and circuit using “positive” functions that remain invariant to transistor bias and ambient temperature variations. Additionally, we describe a 3-layer neural network based on S-AC (Shape-Based Analog Computing) with six hidden nodes. This design utilizes differential compressive input and in-memory computing to store weights, significantly reducing data movement energy. Experimental results demonstrate that S-AC circuits are tenable, conflict-free, and capable of operating under various biasing conditions and temperatures. The S-AC framework allows a trade-off between computational accuracy, speed, power, and area. This architecture is ideal for high-speed ML training in the SI regime and energy-efficient inference in the WI regime, making it suitable for both server and edge applications.

Keywords: Shape-based Analog computing, CMOS, machine learning, processor

[This article belongs to Journal of VLSI Design Tools and Technology ]

How to cite this article:
Abhishek Agwekar, Laxmi Singh. CMOS-Based Process-Scalable Analog Circuits for Machine Learning: A Comprehensive Review and Future Directions.. Journal of VLSI Design Tools and Technology. 2025; 15(01):8-17.
How to cite this URL:
Abhishek Agwekar, Laxmi Singh. CMOS-Based Process-Scalable Analog Circuits for Machine Learning: A Comprehensive Review and Future Directions.. Journal of VLSI Design Tools and Technology. 2025; 15(01):8-17. Available from: https://journals.stmjournals.com/jovdtt/article=2025/view=198472


References

1. Lars Warhammer. 12 – Integrated Circuit Design. DSP Integrated Circuits. Academic Press Series in Engineering. 1999, Pages 531-554.
2. Claeys C. Technological challenges of advanced CMOS processing and their impact on design aspects. In17th International Conference on VLSI Design. Proceedings. 2004 Jan 9 (pp. 275-282). IEEE.
3. Jouppi NP, Young C, Patil N, Patterson D, Agrawal G, Bajwa R, Bates S, Bhatia S, Boden N, Borchers A, Boyle R. In-datacentre performance analysis of a tensor processing unit. In Proceedings of the 44th annual international symposium on computer architecture 2017 Jun 24 (pp. 1-12).
4. Azmat H, Huma Z. Analog Computing for Energy-Efficient Machine Learning Systems. Aitoz Multidisciplinary Review. 2024 Dec 30;3(1):33-9.
5. Vijetha U, Geetha V. Optimizing Reinforcement Learning-Based Visual Navigation for Resource-Constrained Devices. IEEE Access. 2023 Oct 11.
6. Baranwal AR, Ullah S, Sahoo SS, Kumar A. ReLAccS: A multilevel approach to accelerator design for reinforcement learning on FPGA-based systems. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 2020 Oct 2;40(9):1754-67.
7. Gui P. Analog-to-digital converters and time-to-digital converters for high-energy physics experiments. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment. 2023 Aug 29:168649.
8. Agrawal A, Kar M, Kim KH, Rylov S, Jung J, Munetoh S, Ho-Sokawa K, Zhang X, Hekmatshoartabari B, Carta F, Cochet M. A Switched-Capacitor Integer Compute Unit with Decoupled Storage and Arithmetic for Cloud AI Inference in 5nm CMOS. In2023 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits) 2023 Jun 11 (pp. 1-2). IEEE.
9. Bailey S, Rigge P, Han J, Lin R, Chang EY, Mao H, Wang Z, Markley C, Izraelevitz AM, Wang A, Narevsky N. A mixed-signal risc-v signal analysis soc generator with a 16-nm finfet instance. IEEE Journal of Solid-State Circuits. 2019 Jul 17;54(10):2786-801.
10. Lee J, Yoo HJ. An overview of energy-efficient hardware accelerators for on-device deep-neural-network training. IEEE Open Journal of the Solid-State Circuits Society. 2021 Oct 13;1:115-28. .
11. Shuvo MM, Islam SK, Cheng J, Morshed BI. Efficient acceleration of deep learning inference on resource-constrained edge devices: A review. Proceedings of the IEEE. 2022 Dec 14;111(1):42-91.
12. Ambrosi J, Ankit A, Antunes R, Chalamalasetti SR, Chatterjee S, El Hajj I, Fachini G, Faraboschi P, Foltin M, Huang S, Hwu WM. Hardware-software co-design for an analog-digital accelerator for machine learning. In2018 IEEE International Conference on Rebooting Computing (ICRC) 2018 Nov 7 (pp. 1-13). IEEE.
13. Ivanov A, Dryden N, Ben-Nun T, Li S, Hoefler T. Data movement is all you need: A case study on optimizing transformers. Proceedings of Machine Learning and Systems. 2021 Mar 15;3:711-32.
14. Farooq U, Tariq N, Asim M, Baker T, Al-Shamma’a A. Machine learning and the Internet of Things security: Solutions and open challenges. Journal of Parallel and Distributed Computing. 2022 Apr 1; 162:89-104.
15. Albanese A, Nardello M, Brunelli D. Low-power deep learning edge computing platform for resource constrained lightweight compact UAVs. Sustainable Computing: Informatics and Systems. 2022 Apr 1; 34:100725.
16. Reuther A, Michaleas P, Jones M, Gadepally V, Samsi S, Kepner J. Survey of machine learning accelerators. In2020 IEEE high performance extreme computing conference (HPEC) 2020 Sep 22 (pp. 1-12). IEEE.
17. Lee KH, Verma N. A 1.2–0.55 V general-purpose biomedical processor with configurable machine-learning accelerators for high-order, patient-adaptive monitoring. In2012 Proceedings of the ESSCIRC (ESSCIRC) 2012 Sep 17 (pp. 285-288). IEEE.
18. Lebedev MA, Tate AJ, Hanson TL, Li Z, O’Doherty JE, Winans JA, Ifft PJ, Zhuang KZ, Fitzsimmons NA, Schwarz DA, Fuller AM. Future developments in brain-machine interface research. Clinics. 2011 Jan 1; 66:25-32.
19. Chen Y, Yao E, Basu A. A 128-channel extreme learning machine-based neural decoder for brain machine interfaces. IEEE transactions on biomedical circuits and systems. 2015 Dec 7;10(3):679-92.
20. Tu Y, Sadiq S, Tao Y, Shyu ML, Chen SC. A power efficient neural network implementation on heterogeneous FPGA and GPU devices. In2019 IEEE 20th international conference on information reuse and integration for data science (IRI) 2019 Jul 30 (pp. 193-199). IEEE.
21. Chen Y, Yao E, Basu A. A 128 channel 290 GMACs/W machine learning-based co-processor for intention decoding in brain machine interfaces. In2015 IEEE International Symposium on Circuits and Systems (ISCAS) 2015 May 24 (pp. 3004-3007). IEEE.
22. Yin S, Ouyang P, Tang S, Tu F, Li X, Zheng S, Lu T, Gu J, Liu L, Wei S. A high energy efficient reconfigurable hybrid neural network processor for deep learning applications. IEEE Journal of Solid-State Circuits. 2017 Dec 14;53(4):968-82.
23. Yiu M, Winstead C, Gaudet V, Schlegel C. Design for testability of CMOS analog sum-product error-control decoders. IEEE Transactions on Circuits and Systems II: Express Briefs. 2007 Jul 23;54(8):675-9.
24. Chakrabartty S. CMOS analog iterative decoders using margin propagation circuits. In2006 IEEE International Symposium on Circuits and Systems 2006 May 21 (pp. 4-pp). IEEE.
25. Loeliger HA, Lustenberger F, Helfenstein M, Tarkoy F. Probability propagation and decoding in analog VLSI. IEEE Transactions on Information Theory. 2001 Feb;47(2):837-43.
26. Burges CJ. A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery. 1998 Jun;2(2):121-67.
27. V. Vapnik, The Nature of Statistical Learning Theory, 2nd ed., Springer, 2000.
28. Zolotov V, Blaauw D, Sirichotiyakul S, Becer M, Oh C, Panda R, Grinshpon A, Levy R. Noise propagation and failure criteria for VLSI designs. InProceedings of the 2002 IEEE/ACM international conference on Computer-aided design 2002 Nov 10 (pp. 587-594).
29. Kumar P, Nandi A, Chakrabartty S, Thakur CS. Process, bias, and temperature scalable cmos analog computing circuits for machine learning. IEEE Transactions on Circuits and Systems I: Regular Papers. 2022 Nov 1;70(1):128-41.
30. Henkel J, Li H, Raghunathan A, Tahoori MB, Venkataramani S, Yang X, Zervakis G. Approximate computing and the efficient machine learning expedition. InProceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design 2022 Oct 30 (pp. 1-9).
31. Azmat H, Huma Z. Analog Computing for Energy-Efficient Machine Learning Systems. Aitoz Multidisciplinary Review. 2024 Dec 30;3(1):33-9.
32. Thakker RA, Baghini MS, Patil MB. Automatic design of low-power low-voltage analog circuits using particle swarm optimization with re-initialization. journal of low power electronics. 2009 Oct 1;5(3):291-302.
33. Nuzzo P, Sangiovanni-Vincentelli A. Robustness in analog systems: Design techniques, methodologies and tools. In2011 6th IEEE International Symposium on Industrial and Embedded Systems 2011 Jun 15 (pp. 194-203). IEEE.
34. Ramsay EP, Clark DT, Cormack JD, Murphy AE, Smith DA, Thompson RF, Young RA, Finney S. Digital and analogue integrated circuits in silicon carbide for high temperature operation. Additional Papers and Presentations. 2012 Jan 1;2012(HITEC):000373-7.
35. Rumberg B, Graham DW, Kulathumani V. Hibernets: Energy-efficient sensor networks using analog signal processing. InProceedings of the 9th ACM/IEEE International Conference on Information Processing in Sensor Networks 2010 Apr 12 (pp. 129-139).
36. Haensch W, Gokmen T, Puri R. The next generation of deep learning hardware: Analog computing. Proceedings of the IEEE. 2018 Oct 12;107(1):108-22.
37. Severo LC, Girardi A, de Oliveira AB, Kepler FN, Cera MC. Simulated annealing to improve analog integrated circuit design: Trade-offs and implementation issues. Simulated annealing-single and multiple objective problems. 2012 Oct 17:261-83.
38. Törngren M. Fundamentals of implementing real-time control applications in distributed computer systems. Real-time systems. 1998 May; 14:219-50.
39. Tarassenko L, Tombs J, Cairns G. On-chip learning with analogue VLSI neural networks. International journal of neural systems. 1993 Dec;4(04):419-26.
40. Ng W, Minasny B, Mendes WD, Demattê JA. Estimation of effective calibration sample size using visible near infrared spectroscopy: Deep learning vs machine learning. Soil Discussions. 2019 Sep 17; 2019:1-21.
41. Georgiou P, Toumazou C. ISFET characteristics in CMOS and their application to weak inversion operation. Sensors and Actuators B: Chemical. 2009 Dec 4;143(1):211-7.
42. Annema AJ, Nauta B, Van Langevelde R, Tuinhout H. Analog circuits in ultra-deep-submicron CMOS. IEEE journal of solid-state circuits. 2005 Jan 3;40(1):132-43.


Regular Issue Subscription Review Article
Volume 15
Issue 01
Received 22/01/2025
Accepted 27/01/2025
Published 30/01/2025
Publication Time 8 Days


Login


My IP

PlumX Metrics