diff options
| author | Robin Murphy <robin.murphy@arm.com> | 2018-04-24 16:25:47 +0100 |
|---|---|---|
| committer | Catalin Marinas <catalin.marinas@arm.com> | 2018-05-16 11:50:52 +0100 |
| commit | e75bef2a4fe259b779765a85589e92657d26fdc9 (patch) | |
| tree | dce158ed5840192161def2c998dd80f77307bb85 /tools/perf/scripts/python | |
| parent | arm64: Remove duplicate include (diff) | |
| download | linux-e75bef2a4fe259b779765a85589e92657d26fdc9.tar.gz linux-e75bef2a4fe259b779765a85589e92657d26fdc9.zip | |
arm64: Select ARCH_HAS_FAST_MULTIPLIER
It is probably safe to assume that all Armv8-A implementations have a
multiplier whose efficiency is comparable or better than a sequence of
three or so register-dependent arithmetic instructions. Select
ARCH_HAS_FAST_MULTIPLIER to get ever-so-slightly nicer codegen in the
few dusty old corners which care.
In a contrived benchmark calling hweight64() in a loop, this does indeed
turn out to be a small win overall, with no measurable impact on
Cortex-A57 but about 5% performance improvement on Cortex-A53.
Acked-by: Will Deacon <will.deacon@arm.com>
Signed-off-by: Robin Murphy <robin.murphy@arm.com>
Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
Diffstat (limited to 'tools/perf/scripts/python')
0 files changed, 0 insertions, 0 deletions
