How a single chip causes massive power bills GPUSimPow: A GPGPU power simulator J Lucas, S Lal, M Andersch, M Alvarez-Mesa, B Juurlink 2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013 | 87 | 2013 |
The neuro vector engine: Flexibility to improve convolutional net efficiency for wearable vision M Peemen, R Shi, S Lal, B Juurlink, B Mesman, H Corporaal 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2016 | 23 | 2016 |
E^ 2MC: Entropy Encoding Based Memory Compression for GPUs S Lal, J Lucas, B Juurlink 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017 | 22 | 2017 |
SYCL-bench: a versatile cross-platform benchmark suite for heterogeneous computing S Lal, A Alpay, P Salzmann, B Cosenza, A Hirsch, N Stawinoga, ... Euro-Par 2020: Parallel Processing: 26th International Conference on …, 2020 | 21 | 2020 |
GPGPU workload characteristics and performance analysis S Lal, J Lucas, M Andersch, M Alvarez-Mesa, A Elhossini, B Juurlink 2014 International Conference on Embedded Computer Systems: Architectures …, 2014 | 12 | 2014 |
A quantitative study of locality in GPU caches for memory-divergent workloads S Lal, BS Varma, B Juurlink International journal of parallel programming 50 (2), 189-216, 2022 | 10 | 2022 |
SLC: Memory access granularity aware selective lossy compression for GPUs S Lal, J Lucas, B Juurlink 2019 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2019 | 10 | 2019 |
SYCL-bench: A versatile single-source benchmark suite for heterogeneous computing S Lal, A Alpay, P Salzmann, B Cosenza, N Stawinoga, P Thoman, ... Proceedings of the International Workshop on OpenCL, 1-1, 2020 | 9 | 2020 |
Optimal DC/AC Data Bus Inversion Coding J Lucas, S Lal, B Juurlink Design, Automation & Test in Europe Conference & Exhibition (DATE), 2018, 2018 | 9 | 2018 |
Performance counters based power modeling of mobile GPUs using deep learning N Mammeri, M Neu, S Lal, B Juurlink 2019 International Conference on High Performance Computing & Simulation …, 2019 | 5 | 2019 |
A quantitative study of locality in GPU caches S Lal, B Juurlink Embedded Computer Systems: Architectures, Modeling, and Simulation: 20th …, 2020 | 4 | 2020 |
QSLC: Quantization-Based, Low-Error Selective Approximation for GPUs S Lal, J Lucas, B Juurlink | 3 | 2021 |
Accelerating metabolic pathways simulation using gpus S Lal, K Paul, J Gomes Annual International Conference on Advances in Distributed and Parallel …, 2010 | 2 | 2010 |
Memory access granularity aware lossless compression for GPUs S Lal, M Renz, J Hartmer, B Juurlink 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 1 | 2022 |
A Case for Memory Access Granularity Aware Selective Lossy Compression for GPUs S Lal, B Juurlink ACM Student Research Competition, MICRO, 2018 | 1 | 2018 |
DART: A GPU architecture exploiting temporal SIMD for divergent workloads J Lucas, S Lal, MA Mesa, A Elhossini, B Juurlink Proceedingsof the 9th International Summer School on AdvancedComputer …, 2013 | 1 | 2013 |
Beyond compression ratio: a throughput analysis of memory compression techniques for GPUs M Renz, S Lal 2023 IEEE 41st International Conference on Computer Design (ICCD), 255-262, 2023 | | 2023 |
An Efficient Lightweight Framework for Porting Vision Algorithms on Embedded SoCs A Ashish, S Lal, B Juurlink International Embedded Systems Symposium, 130-141, 2023 | | 2023 |
Power modeling and architectural techniques for energy-efficient GPUs S Lal | | 2019 |
Exploring GPGPUs Workload Characteristics and Power Consumption S Lal, J Lucas, M Alvarez-Mesa, A Elhossini, B Juurlink Proceedings of the 9th International Summer School on Advanced Computer …, 2013 | | 2013 |