Hi! I am a Senior Research Scientist at the Architecture Research Group (ARG) at NVIDIA

Before joining NVIDIA, I completed my PhD at the ECE department at Carnegie Mellon University where I worked with Professor Brandon Lucia. My dissertation research focused on developing architectural support for optimizing irregular applications (particularly, graph processing applications)

Publications

"Symphony: Orchestrating Sparse and Dense Tensors with Hierarchical Heterogeneous Processing" ,
Michael Pellauer, Jason Clemons, Vignesh Balaji, Neal Crago, Aamer Jaleel, Donghyuk Lee, Mike O'Connor, Angshuman Parashar, Sean Treichler, Po-An Tsai, Steve W. Keckler, and Joel S. Emer,
ACM Transactions on Computer Systems (TOCS Nov 2023 issue)
[paper]

"Community-based Matrix Reordering for Sparse Linear Algebra Optimization" ,
Vignesh Balaji, Neal Crago, Aamer Jaleel, and Stephen W. Keckler,
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2023)
[paper] [slides]

"Improving Locality of Irregular Updates with Hardware Assisted Propagation Blocking" ,
Vignesh Balaji, and Brandon Lucia,
International Symposium on High Performance Computer Architecture (HPCA 2022)
(BEST PAPER NOMINEE)
[preprint] [slides] [teaser] [talk]

"P-OPT: Practical Optimal Cache Replacement for Graph Analytics" ,
Vignesh Balaji, Neal Crago, Aamer Jaleel, and Brandon Lucia,
International Symposium on High Performance Computer Architecture (HPCA 2021)
(BEST PAPER NOMINEE)
[paper] [slides] [teaser] [talk] [github]

"Optimizing Graph Processing and Preprocessing with Hardware Assisted Propagation Blocking",
Vignesh Balaji, and Brandon Lucia,
ArXiv 2020
[paper]

"Peacenik: Architecture Support for Not Failing under Fail-Stop Memory Consistency",
Rui Zhang, Swarnendu Biswas, Vignesh Balaji, Michael D. Bond, and Brandon Lucia,
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2020)
[paper] [slides] [github]

"Combining Data Duplication and Graph Reordering to Accelerate Parallel Graph Processing",
Vignesh Balaji, and Brandon Lucia,
International Symposium on High-Performance Parallel and Distributed Computing (HDPC 2019)
[paper] [slides] [github]

"When is Graph Reordering an Optimization? Studying the effect of lightweight graph reordering across applications and input graphs" ,
Vignesh Balaji, and Brandon Lucia,
IEEE International Symposium on Workload Characterization (IISWC 2018)
(BEST PAPER AWARD)
[preprint] [slides] [github]

"Flexible Support for Fast Parallel Commutative Updates",
Vignesh Balaji, Dhruva Tirumala and Brandon Lucia,
ArXiv 2018
[paper]

"An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization",
Vignesh Balaji, Dhruva Tirumala and Brandon Lucia,
Symposium on Principles and Practice of Parallel Programming (PPoPP 2017)
[poster]

"Intermittent Computing: Challenges and Opportunities",
Brandon Lucia, Vignesh Balaji, Alexei Colin, Kiwan Maeng, and Emily Ruppel,
Summit on Advances in Programming Languages (SNAPL 2017)
[paper]

"Overcoming the Data-flow Limit on Parallelism with Structural Approximation",
Vignesh Balaji, Brandon Lucia, and Radu Marculescu,
Workshop on Approximate Computing (WAX) colocated with (ASPLOS 2016)
[paper]

Interesting Stuff

A small collection of enlightening articles/papers/opinions...
[StrongInference]
[You-and-your-research]
[Technology-and-courage]

Contact