Abstract
Large Language Models (LLM) have gained a lot of attention given the recent technological advances. However, they require large memory requirement and high processing power, particularly for systems employing a limited amount of DRAM and slow storage devices. Often times, the data needed by the models cannot fit entirely in the main memória (DRAM), requiring external storage systems which…