If the -enable-bat option is enabled, the effect of secondary optimization is weakened

zcfh · December 13, 2023, 9:05am

In our environment, it is not possible to deploy a single instance for perf sampling, so we turn on the -enable-bat option for continuous optimization.
However, we find that the optimization effect of BoltOptBinary2 is less than that of BoltOptBinary1 without any changes to the origin code. Through analysis, we found that compared with BoltOptBinary1, BoltOptBinary2’s itlb-miss-rate and L1-icache-load-misses-rate have increased

OriginBinary — [perf record] — [perf.data] — [bolt transform] –enable-bat → BoltOptBinary1

BoltOptBinary1 —— [perf record] — [perf.data] — OriginBinary — [bolt transform]---- > BoltOptBinary2

We are curious, is this as expected? Does enable-bat make some optimizations less effective in our scenario?

aaupov · January 12, 2024, 12:36am

Thanks for a report.
This is somewhat expected given we lose accuracy of mapping back to input binary with some transformations (nop removal, indirect call promotion, etc).
Can you share BOLT log from the last step? What’s profile staleness that you see?

Topic		Replies	Views
Can base binaries be optimized using optimized binary perf data? BOLT bolt	3	422	November 11, 2023
[RFC] BOLT: A Framework for Binary Analysis, Transformation, and Optimization LLVM Dev List Archives	9	467	November 24, 2020
Error with perf2bolt in LLVM BOLT LLVM Dev List Archives	3	201	April 10, 2020
How to solve the problem of stale Profile data when Bolt is used with pgo? BOLT pgo , llvm , bolt	12	476	July 9, 2025
BOLT for libart.so on Android doesn't show obvious performance improvement BOLT	17	643	April 22, 2025

If the -enable-bat option is enabled, the effect of secondary optimization is weakened

Related topics