Investigating the TLB behavior of high-end scientific applications on commodity microprocessors

Collin McCurdy, Alan L. Cox, Jeffrey Vetter

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

37 Scopus citations

Abstract

The floating point portion of the SPEC CPU suite and the HPC Challenge suite are widely recognized and utilized as benchmarks that represent scientific application behavior. In this work we show that while these benchmark suites may be representative of the cache behavior of production scientific applications, they do not accurately represent the TLB behavior of these applications. Furthermore, we demonstrate that the difference can have a significant impact on performance. In the first part of the paper we present results from implementation-independent trace-based simulations which demonstrate that benchmarks exhibit significantly different TLB behavior for a range of page sizes than a representative set of production applications. In the second part we validate these results on the AMD Opteron implementation of the x86 architecture, showing that false conclusions about choice of page size, drawn from benchmark performance, can result in performance degradations of up to nearly 50% for the production applications we investigated.

Original languageEnglish
Title of host publicationISPASS 2008 - IEEE International Symposium on Performance Analysis of Systems and Software
Pages95-104
Number of pages10
DOIs
StatePublished - 2008
EventIEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2008 - Austin, TX, United States
Duration: Apr 20 2008Apr 22 2008

Publication series

NameISPASS 2008 - IEEE International Symposium on Performance Analysis of Systems and Software

Conference

ConferenceIEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2008
Country/TerritoryUnited States
CityAustin, TX
Period04/20/0804/22/08

Fingerprint

Dive into the research topics of 'Investigating the TLB behavior of high-end scientific applications on commodity microprocessors'. Together they form a unique fingerprint.

Cite this