Please use this identifier to cite or link to this item: https://doi.org/10.1109/CGO.2004.1281683
DC FieldValue
dc.titleStatic identification of delinquent loads
dc.contributor.authorPanait, V.-M.
dc.contributor.authorSasturkar, A.
dc.contributor.authorWong, W.-F.
dc.date.accessioned2013-07-04T08:45:31Z
dc.date.available2013-07-04T08:45:31Z
dc.date.issued2004
dc.identifier.citationPanait, V.-M., Sasturkar, A., Wong, W.-F. (2004). Static identification of delinquent loads. International Symposium on Code Generation and Optimization, CGO : 303-314. ScholarBank@NUS Repository. https://doi.org/10.1109/CGO.2004.1281683
dc.identifier.isbn0769521029
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/42186
dc.description.abstractThe effective use of processor caches is crucial to the performance of applications. It has been shown that cache misses are not evenly distributed throughout a program. In applications running on RISC-style processors, a small number of delinquent load instructions are responsible for most of the cache misses. Identification of delinquent loads is the key to the success of many cache optimization and prefetching techniques. In this paper, we propose a method for identifying delinquent loads that can be implemented at compile time. Our experiments over eighteen benchmarks from the SPEC suite shows that our proposed scheme is stable across benchmarks, inputs, and cache structures, identifying an average of 10% of the total number of loads in the benchmarks we tested that account for over 90% of all data cache misses. As far as we know, this is the first time a technique for static delinquent load identification with such a level of precision and coverage has been reported. While comparable techniques can also identify load instructions that cover 90% of all data cache misses, they do so by selecting over 50% of all load instructions in the code, resulting in a high number of false positives. If basic block profiling is used in conjunction with our heuristic, then our results show that it is possible to pin down just 1.3% of the load instructions that account for 82% of all data cache misses.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/CGO.2004.1281683
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1109/CGO.2004.1281683
dc.description.sourcetitleInternational Symposium on Code Generation and Optimization, CGO
dc.description.page303-314
dc.identifier.isiut000189495000024
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.