aarch64: Adjust SVE vector costs for NVIDIA Olympus

After some more analysis, I'd like to adjust the SVE load
cost for NVIDIA Olympus to more accurately reflect the core SWOG.
This leads to more sensible Advanced SIMD vs SVE autovec decisions.

Bootstrapped and tested on aarch64-none-linux-gnu.

Signed-off-by: Kyrylo Tkachov <ktkachov@nvidia.com>

gcc/ChangeLog

	* config/aarch64/tuning_models/olympus.h (olympus_sve_vector_cost):
	Set align_load_cost to 6.
This commit is contained in:
Kyrylo Tkachov
2026-02-06 02:34:29 -08:00
parent e55de74d50
commit 4d2af07350

View File

@@ -77,7 +77,7 @@ static sve_vec_cost olympus_sve_vector_cost =
2, /* store_elt_extra_cost */
8, /* vec_to_scalar_cost */
4, /* scalar_to_vec_cost */
4, /* align_load_cost */
6, /* align_load_cost */
6, /* unalign_load_cost */
1, /* unalign_store_cost */
1 /* store_cost */