1.
Efficient Processing of Sparse and Compact DNN Models on Hardware Accelerators: Survey and Insights Workshop
In Annual Workshop on Sparsity in Neural Networks (SparseNN), 2022.
2.
Memory Performance Estimation of CUDA Programs Journal Article
In: ACM Transactions on Embedded Computing Systems, vol. 13, no. 21, pp. 21:1-21:22, 2013.
3.
CuMAPz: A tool to Analyze Memory Access Patterns in CUDA Proceedings Article
In: Proceedings of the 48th Design Automation Conference (DAC), pp. 128-133, 2011.