Advanced search
Start date
Betweenand


The Droplet Search Algorithm for Kernel Scheduling

Full text
Author(s):
Canesche, Michael ; Rosario, Vanderson M. ; Borin, Edson ; Quintao Pereira, Fernando Magno
Total Authors: 4
Document type: Journal article
Source: ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION; v. 21, n. 2, p. 28-pg., 2024-06-01.
Abstract

Kernel scheduling is the problem of finding the most efficient implementation for a computational kernel. Identifying this implementation involves experimenting with the parameters of compiler optimizations, such as the size of tiling windows and unrolling factors. This article shows that it is possible to organize these parameters as points in a coordinate space. The function that maps these points to the running time of kernels, in general, will not determine a convex surface. However, this article provides empirical evidence that the origin of this surface (an unoptimized kernel) and its global optimum (the fastest kernel) reside on a convex region. We call this hypothesis the "droplet expectation." Consequently, a search method based on the Coordinate Descent algorithm tends to find the optimal kernel configuration quickly if the hypothesis holds. This approach-called Droplet Search-has been available in Apache TVM since April of 2023. Experimental results with six large deep learning models on various computing devices (ARM, Intel, AMD, and NVIDIA) indicate that Droplet Search is not only as effective as other AutoTVM search techniques but also 2 to 10 times faster. Moreover, models generated by Droplet Search are competitive with those produced by TVM's AutoScheduler (Ansor), despite the latter using 4 to 5 times more code transformations than AutoTVM. (AU)

FAPESP's process: 13/08293-7 - CCES - Center for Computational Engineering and Sciences
Grantee:Munir Salomao Skaf
Support Opportunities: Research Grants - Research, Innovation and Dissemination Centers - RIDC