Logo icon

Optimization of mapping non-uniform interacting processes onto the architecture of computing cluster

Developed for: Intel Inc., Nizhny Novgorod, Russia

Purpose: reduction in delays during synchronization by assigning the most intensively interacting processes to computing cores with the fastest interconnect

The project aims to optimize the projection of non-uniform interacting MPI processes on the architecture of computing cluster. In the course of the project, the method for mapping non-uniform interacting parallel processes of a computing application, which uses MPI, onto cluster architecture has been developed. The method is intended to reduce delays during synchronization by assigning the most intensively interacting processes to computing cores with the fastest interconnect. The method uses the representation of the computational task and cluster architecture in the form of a weighted graph. The heuristic algorithm, which gives the result of mapping process numbers onto the numbers of cluster computing cores in an acceptable time, has been developed. Acceleration of calculations by 16-20% has been obtained by optimizing the mapping for the tests of 300-4800 processes and shown with using a well-scalable computing package CORAL QBOX as an example.

Specification

Client: Intel Inc., Nizhny Novgorod, Russia
Area of use: optimization of mapping non-uniform interacting processes onto the architecture of computing cluster
Type (platform): Intel MPI
Technologies and algorithms in use: MPI, C/C++
More information:

The article on our website

Similar projects

the analysis of WRF, MILC and HPC Repast packages to run and work efficiently on the computing cluster Petastream consisting of computing nodes of Xeon Phi coprocessors

optimization of matrix multiplication by Strassen algorithm in systems built on MIC architecture