Public Git Hosting - llvm-project.git/commit

commit	67275263b3b781a55ec4f297b5f42ffd783349ec
author	Simon Pilgrim <RKSimon@users.noreply.github.com>
	Thu, 23 Nov 2023 14:10:23 +0000 (23 14:10 +0000)
committer	GitHub <noreply@github.com>
	Thu, 23 Nov 2023 14:10:23 +0000 (23 14:10 +0000)
tree	87e0af9344a1f9bda0e823fd7266b4ca9c6b2b74	tree \| snapshot (tar.gz zip)
parent	aaae104e282505add432ccc76a4adb674087190f	commit \| diff

[X86] X86DAGToDAGISel - attempt to merge XMM/YMM loads with YMM/ZMM loads of the same ptr (#73126)

If we are loading the same ptr at different vector widths, then reuse the larger load and just extract the low subvector.

Unlike the equivalent VBROADCAST_LOAD/SUBV_BROADCAST_LOAD folds which can occur in DAG, we have to wait until DAGISel otherwise we can hit infinite loops if constant folding recreates the original constant value.

This is mainly useful for better constant sharing.

37 files changed:

llvm/lib/Target/X86/X86ISelDAGToDAG.cpp		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/avx512-regcall-Mask.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/bfloat.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/bitcast-int-to-vector-bool-sext.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/bitcast-int-to-vector-bool-zext.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/broadcast-elm-cross-splat-vec.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/constant-pool-sharing.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/insert-into-constant-vector.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/midpoint-int-vec-512.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/pr57340.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/splat-for-size.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/subvector-broadcast.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vec_fabs.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vec_int_to_fp.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-fshl-256.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-fshl-512.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-fshl-rot-256.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-fshr-256.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-fshr-512.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-fshr-rot-256.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-load-i16-stride-7.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-4.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-5.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-load-i8-stride-6.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-store-i16-stride-3.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-store-i16-stride-5.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-store-i16-stride-7.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-store-i8-stride-5.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-store-i8-stride-6.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-interleaved-store-i8-stride-7.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-sext.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vector-shuffle-combining-avx.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/viabs.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/vselect-avx.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/x86-interleaved-access.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/zero_extend_vector_inreg_of_broadcast.ll		diff \| blob \| blame \| history
llvm/test/CodeGen/X86/zero_extend_vector_inreg_of_broadcast_from_memory.ll		diff \| blob \| blame \| history

LLVM monorepo

RSS Atom