Research – Old

Ph.D. Dissertation Research

Dissertation Title: 

  • Optimizing FPGA Architecture for Deep Learning Workloads

Highlights:

  • Adding matrix multiplier blocks to FPGAs
    • Read more
    • Impact: The FPGA industry has developed similar blocks recently like Intel AI Tensor Block and Achronix Machine Learning Processor. Although these blocks do not support matrix multiplication natively, the fact that industry is finding similar solutions demonstrates that our research is on the right track.
  • Incorporating compute-in-memory on FPGAs
    • Read more
    • Impact: Our paper proposing these memory blocks at FCCM’22 won the Best Paper Award, and companies like Rapid Silicon have expressed interest in this technology.
  • Benchmarks for FPGA architecture research
    • Read more
    • Impact: Koios is being used by many researchers worldwide and also by companies like Efinix in their regression suite.

Poster:

  • Click here for a poster that summarizes my PhD research

Other Current/Prior Research

  • Developing a FPGA-like spatial accelerator for ML workloads using compute-enabled RAMs, with a focus on easy compilation
  • Accelerating an entire different type of ML algorithm that does not use multiplication for computation: Weightless Neural Networks
  • Predicting the power consumption of an application deployed on FPGA#1 using an ML model trained on a FPGA#2

Collaborations

  • University of Toronto, Canada
  • University of California Los Angeles
  • University of New Brunswick, Canada
  • International Institute of Technology, Bengaluru, India
  • Federal University of Rio de Janeiro, Brazil
  • Federal University of Reconcavo da Bahia, Brazil

Funding

My research at UT Austin has been funded by a NSF grant for 2020-2022. I was a Graduate Research Assistant at the University of Austin.

For 2022-2023, my research is funded by the UT Austin Graduate School Fellowship. I was awarded this fellowship because of an excellent publication record.

List of Publications

Peer Reviewed (first author)

CoMeFa: Deploying Compute-in-Memory on FPGAs for Deep Learning Acceleration [PDF]

Aman Arora, Atharva Bhamburkar, Aatman Borda, Tanmay Anand, Rishabh Sehgal, Bagus Hanindhito, Pierre-Emmanuel Gaillardon, Jaydeep Kulkarni, Lizy K. John

ACM Transactions on Reconfigurable Technology and Systems (TRETS), June 2023 

Koios 2.0: Open-Source Deep Learning Benchmarks for FPGA Architecture and CAD Research [PDF]

Aman Arora, Andrew Boutros, Seyed Alireza Damghani, Karan Mathur, Vedant Mohanty, Tanmay Anand, Mohamed Elgammal, Kenneth B. Kent, Vaughn Betz, Lizy K. John

IEEE Transactions on Computer Aided Design of Integrated Circuits and Systems (TCAD)2023 

CoMeFa: Compute-in-Memory Blocks for FPGAs [PDF][Video]

Aman Arora, Tanmay Anand, Aatman Borda, Rishabh Sehgal, Bagus Hanindhito, Jaydeep Kulkarni and Lizy K. John

IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM)May 2022 

(Acceptance Rate = 20.8%, Best Paper Award

Tensor Slices: FPGA Building Blocks For The Deep Learning Era [PDF]

Aman Arora, Moinak Ghosh, Samidh Mehta, Vaughn Betz and Lizy K. John

ACM Transactions on Reconfigurable Technology and Systems (TRETS), 2022 

MathRAMs: Configurable Fused Compute-Memory Blocks for FPGAs (Poster) [PDF]

Aman Arora, Aatman Borda, Tanmay Anand, Bagus Hanindhito, Lizy K. John

ACM International Symposium on Field-Programmable Gate Arrays (FPGA), February 2022

Compute RAMs: Adaptable Compute and Storage Blocks for DL-Optimized FPGAs [PDF]

Aman Arora, Bagus Hanindhito, Lizy K. John

IEEE Asilomar Conference on Signals, Systems, and Computers (ASILOMAR), October 2021 

(Acceptance Rate = 71.5%, Student Paper Contest Finalist, Top-8 out of 477

Koios: A Deep Learning Benchmark Suite for FPGA Architecture and CAD Research [PDF][Video]

Aman Arora, Andrew Boutros, Daniel Rauch, Aishwarya Rajen, Aatman Borda, Seyed Alireza Damghani, Samidh Mehta, Sangram Kate, Pragnesh Patel, Kenneth B. Kent, Vaughn Betz, Lizy K. John

IEEE International Conference on Field-Programmable Logic and Applications (FPL), August 2021

(Acceptance Rate = 22%) 

Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs [PDF][Video]

Aman Arora, Samidh Mehta, Vaughn Betz, Lizy K. John

ACM International Symposium on Field-Programmable Gate Arrays (FPGA), February 2021

(Acceptance Rate = 23%) 

Hamamu: Specializing FPGAs for ML Applications by Adding Hard Matrix Multiplier Blocks [PDF][Video]

Aman Arora, Zhigang Wei, Lizy K. John

IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), July 2020

(Acceptance Rate = 24%) 

The Case for Hard Matrix Multiplier Blocks in an FPGA (Poster) [PDF]

Aman Arora, Zhigang Wei, Lizy K. John

ACM International Symposium on Field-Programmable Gate Arrays (FPGA), February 2020

(Acceptance Rate = 25% for papers, 30% for posters) 

Peer Reviewed (co-authored)

HLSDataset: Open-Source Dataset for ML-Assisted FPGA Design using High Level Synthesis []

Zhigang Wei, Aman Arora, Ruihao Li, Lizy K. John

IEEE International Conference on Application-specific Systems, Architectures, and Processors (ASAP), July 2023

COIN: Combinational Intelligent Networks (Poster) []

Igor D. S. Miranda, Aman Arora, Zachary Susskind, Josias S. A. Souza, Mugdha P. Jadhao, Luis A. Q. Villon, Diego L. C. Dutra, Priscila M. V. Lima, Felipe M. G. França, Mauricio Breternitz and Lizy K. John

IEEE International Conference on Application-specific Systems, Architectures, and Processors (ASAP), July 2023

An FPGA-Based Weightless Neural Network for Edge Network Intrusion Detection (Poster) [PDF]

Zachary Susskind, Aman Arora, Alan Bacellar, Diego L. C. Dutra, Igor D. S. Miranda, Mauricio Breternitz Jr., Priscila M. V. Lima, Felipe M. G. Franca, Lizy K. John

ACM International Symposium on Field-Programmable Gate Arrays (FPGA), February 2023

Weightless Neural Networks for Efficient Edge Inference [PDF]

Zachary Susskind, Aman Arora, Igor D. S. Miranda, Luis A. Q. Villon, Rafael F. Katopodis, Leandro S. de Araujo, Diego L. C. Dutra, Priscila M. V. Lima, Felipe M. G. Franca, Mauricio Breternitz Jr., Lizy K. John

Parallel Architectures and Compilation Techniques (PACT), October 2022 

Pruning Weightless Neural Networks [PDF]

Zachary Susskind, Alan T. L. Bacellar, Aman Arora, Luis A. Q. Villon, Renan Mendanha, Leandro S. de Araujo, Diego L. C. Dutra, Priscila M. V. Lima, Felipe M. G. Franca, Igor D. S. Miranda, Mauricio Breternitz Jr., Lizy K. John

European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), October 2022 

LogicWiSARD: Memoryless Synthesis of Weightless Neural Networks [PDF]

Igor Dantas Dos Santos Miranda, Aman Arora, Zachary Susskind, Lizy K. John, Luis Armando Quintanilla Villon, Rafael Fontella Katopodis, Diego Leonel Cadette Dutra, Priscila Machado Vieira Lima, Felipe Maia Galvao Franca, Leandro Santiago de Araujo, Mauricio Breternitz Jr.

IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), July 2022 

(Acceptance Rate = TBD, Best Paper Candidate

Hardware-aware 3D Model Workload Selection and Characterization for Graphics and ML Applications (Poster) [PDF]

Ruihao Li, Aman Arora, Sikan Li, Qinzhe Wu and Lizy K. John

IEEE International Symposium on Quality Electronic Design (ISQED), April 2022 

LogGen: A Parameterized Generator for Designing Floating-Point Logarithm Units for Deep Learning [PDF]

Pragnesh Patel, Aman Arora, Earl Swartzlander and Lizy K. John

IEEE International Symposium on Quality Electronic Design (ISQED), April 2022 

Design Space Exploration of Softmax Implementations [PDF]

Zhigang Wei, Aman Arora, Lizy K. John

IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP), July 2020

(Acceptance Rate = 24%) 

Other Publications

A TPU-like Design for FPGA Benchmarking [PDF]

Aman Arora, Bagus Hanindhito, Harsh Gugale, Lizy K. John

Technical Report, Class Project

A novel approach to sideband signal verification with a reusable and configurable UVM agent

Aman Arora and Jeetendra Gupta

NTECH 2016 (NVIDIA’s internal tech conference)

Compile time parameter distribution for highly reusable testbenches

Aman Arora and Mark Glasser

Synopsys Users Group (SNUG) Silicon Valley 2015

RTL-Agent Switch: Implementation and Applications

Aman Arora, Nathan Wooster, Pavan Mula, Rob Porter

Synopsys Users Group (SNUG) Austin 2015

Method and apparatus to debug simulation failures for [on-chip] scan compression patterns

Anurag Jindal and Aman Arora

Defensive Publication (IPCOM #000185968D)

Testing memories through functional interface using BIST, with 100% test quality and 0% timing impact

Anurag Jindal and Aman Arora

Defensive Publication (IPCOM #000185974D)

Method and apparatus for robust verification of test views of library cells

Nikila K and Aman Arora

Defensive Publication (IPCOM #000191493D)

Testing flash memories

Aman Arora, Akshat Gupta & Mayuri Agarwal

White Paper on EETimes/Techonline 

Automated approach for efficient DFT pattern generation and simulation debug

Anurag Jindal, Aman Arora

Freescale Design Competitiveness Conference 2008

Contact Me