Loop and task parallelization on heterogeneous cloud clusters using map-reduce for...
SPMD parallelization of octrees for CUDA architectures with applications to SPH an...
Modeling and simulation of relations among real-time computational tasks using gra...
Machine Learning Under The Hood: Efficient Accelerators for Deep Networks and its ...
Custom heterogeneous hardware acceleration for high-performance computing applicat...
Custom heterogeneous hardware acceleration for high-performance computing applicat...