Loop and task parallelization on heterogeneous cloud clusters using map-reduce for...
Modeling and simulation of relations among real-time computational tasks using gra...
SPMD parallelization of octrees for CUDA architectures with applications to SPH an...
Development of a virtual high performance software laboratory for nanotherapy simu...
LLM-enabled control with dynamic compute offloading of AI modules
Machine Learning Under The Hood: Efficient Accelerators for Deep Networks and its ...