The paper discusses the development of a parallel computing system called KernelHive that optimizes data prefetching in heterogeneous high-performance computing (HPC) environments by utilizing a new optimizer and network-aware scheduling. It proposes using NoSQL databases for effective data management and presents an architecture that enhances performance through efficient data handling and task scheduling across diverse computing devices. Experiments demonstrate the advantages and performance improvements achieved by this implementation, particularly in terms of execution time for applications processing large input data.