From Top-down Microarchitecture Analysis to Structured Performance Optimizations - presented by Ahmad Yasin

From Top-down Microarchitecture Analysis to Structured Performance Optimizations

Ahmad Yasin

Ahmad Yasin
From Top-down Microarchitecture Analysis to Structured Performance Optimizations
Ahmad Yasin
Ahmad Yasin
Intel Corporation

This session re-caps on the Top-down µarch Analysis (TMA) method - that is widely adopted in performance profiling tools, Microarchitecture challenges faced in out-of-order cores, and the abstraction that helped the method to be universally supported across CPU vendors (Intel as well as AMD & ARM).

Then, we show how the primary TMA metrics of Frontend Bound, Bad Speculation, Core Bound, Memory Bound and Retiring can be used to classify and direct exploitation of popular software optimizations.

The session closes with a use-case that got deployed in code generation of modern compilers. The use-case demonstrates how to mitigate Instruction Fetch Bandwidth issue through tuning of loop unrolling to speedup tight loops in recent wide-issue out-of-order cores.

References
  • 1.
    A. Yasin (2014) A Top-Down method for performance analysis and counters architecture.
  • 2.
    How TMA Addresses Challenges in Modern Servers and Enhancements Coming in IceLake. Scalable Tools Workshop, Solitude; Utah, July 2018
  • 3.
    perf-tools @ github - A collection of performance analysis tools, recipes, handy scripts, microbenchmarks & more
  • 4.
    A. Yasin et al. (2019) Tuning Performance via Metrics with Expectations. IEEE Computer Architecture Letters
  • 5.
    A. Yasin et al. (2019) A Metric-Guided Method for Discovering Impactful Features and Architectural Insights for Skylake-Based Processors. ACM Transactions on Architecture and Code Optimization
HiPEDS Centre logo
NANDA Workshop 2023
HiPEDS Centre
Cite as
A. Yasin (2023, September 11), From Top-down Microarchitecture Analysis to Structured Performance Optimizations
Share
Details
Listed event This event is open to all
Recorded Available to all
Video length 33:59
Q&A Now closed