A New Hardware Counters Based Thread Migration Strategy for NUMA Systems

TítuloA New Hardware Counters Based Thread Migration Strategy for NUMA Systems
AutoresOscar García Lorenzo, Rubén Laso Rodríguez, Tomás Fernández Pena, Jose Carlos Cabaleiro Domínguez, Francisco Fernández Rivera, Juan Ángel Lorenzo del Castillo
TipoCapítulo de libro
Fonte Parallel Processing and Applied Mathematics. PPAM 2019. Lecture Notes in Computer Science, Springer, Vol. 12044, pp. 205-216 , 2020.
ISBN978-3-030-43221-8
ISSN0302-9743
DOI10.1007/978-3-030-43222-5_18
AbstractMulticore NUMA systems present on-board memory hierarchies and communication networks that influence performance when executing shared memory parallel codes. Characterising this influence is complex, and understanding the effect of particular hardware configurations on different codes is of paramount importance. In this paper, monitoring information extracted from hardware counters at runtime is used to characterise the behaviour of each thread in the processes running in the system. This characterisation is given in terms of number of instructions per second, operational intensity, and latency of memory access. We propose to use all this information to guide a thread migration strategy that improves execution efficiency by increasing locality and affinity. Different configurations of NAS Parallel OpenMP benchmarks running concurrently on multicore systems were used to validate the benefits of the proposed thread migration strategy. Our proposal produces up to 25% improvement over the OS for heterogeneous workloads, under different and realistic locality and affinity scenarios.
Palabras chaveRoofline model, Hardware counters, Performance, Thread migration