teak-llvm

mirror of https://github.com/Gericom/teak-llvm.git synced 2025-06-19 19:45:40 -04:00

History

Changpeng Fang 2531535984 AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare Summary: RCP has the accuracy limit. If FDIV fpmath require high accuracy rcp may not meet the requirement. However, in DAG lowering, fpmath information gets lost, and thus we may generate either inaccurate rcp related computation or slow code for fdiv. In patch implements fdiv optimizations in the AMDGPUCodeGenPrepare, which could exactly know !fpmath. FastUnsafeRcpLegal: We determine whether it is legal to use rcp based on unsafe-fp-math, fast math flags, denormals and fpmath accuracy request. RCP Optimizations: 1/x -> rcp(x) when fast unsafe rcp is legal or fpmath >= 2.5ULP with denormals flushed. a/b -> a*rcp(b) when fast unsafe rcp is legal. Use fdiv.fast: a/b -> fdiv.fast(a, b) when RCP optimization is not performed and fpmath >= 2.5ULP with denormals flushed. 1/x -> fdiv.fast(1,x) when RCP optimization is not performed and fpmath >= 2.5ULP with denormals. Reviewers: arsenm Differential Revision: https://reviews.llvm.org/D71293		2020-01-23 16:57:43 -08:00
..
GlobalISel	AMDGPU/GlobalISel: Fix RegBanKSelect for llvm.amdgcn.exp.compr	2020-01-23 13:30:46 -08:00
32-bit-local-address-space.ll
accvgpr-copy.mir
add3.ll
add_i1.ll
add_i64.ll
add_i128.ll
add_shl.ll
add-debug.ll
add.i16.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
add.ll
add.v2i16.ll
addrspacecast-captured.ll
addrspacecast-constantexpr.ll
addrspacecast.ll
adjust-writemask-invalid-copy.ll
agpr-register-count.ll	[AMDGPU] separate accounting for agprs	2019-10-02 00:26:58 +00:00
alignbit-pat.ll
alloca.ll
always-uniform.ll
amdgcn-ieee.ll
amdgcn.bitcast.ll
amdgcn.private-memory.ll	AMDGPU: Change boolean content type to 0 or 1	2019-11-15 13:43:47 +05:30
amdgpu-alias-analysis.ll
amdgpu-codegenprepare-fdiv.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
amdgpu-codegenprepare-fold-binop-select.ll	AMDGPU: Check for other uses when looking through casted select	2020-01-23 11:31:24 -05:00
amdgpu-codegenprepare-i16-to-i32.ll	AMDGPU: Generate test checks	2020-01-20 20:03:45 -05:00
amdgpu-codegenprepare-idiv.ll	AMDGPU: Don't create weird sized integers	2020-01-20 20:02:54 -05:00
amdgpu-codegenprepare-mul24.ll
amdgpu-function-calls-option.ll
amdgpu-inline.ll	[AMDGPU] Tune inlining parameters for AMDGPU target (part 2)	2019-11-19 16:33:16 +03:00
amdgpu-mul24-knownbits.ll	Fix for AMDGPU MUL_I24 known bits calculation	2019-12-16 10:25:57 +00:00
amdgpu-shader-calling-convention.ll
amdgpu-unroll-threshold.ll	[AMDGPU] Add attribute for target loop unroll threshold default	2019-11-21 09:47:28 +00:00
amdgpu.private-memory.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
amdgpu.work-item-intrinsics.deprecated.ll
amdhsa-trap-num-sgprs.ll
amdpal_scratch_mergedshader.ll
amdpal-cs.ll
amdpal-es.ll
amdpal-gs.ll
amdpal-hs.ll
amdpal-ls.ll
amdpal-msgpack-cs.ll
amdpal-msgpack-es.ll
amdpal-msgpack-gs.ll
amdpal-msgpack-hs.ll
amdpal-msgpack-ls.ll
amdpal-msgpack-ps.ll
amdpal-msgpack-psenable.ll
amdpal-msgpack-vs.ll
amdpal-ps.ll
amdpal-psenable.ll
amdpal-vs.ll
amdpal.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
and_or.ll
and-gcn.ll
and.ll
andorbitset.ll
andorn2.ll
andorxorinvimm.ll
annotate-kernel-features-hsa-call.ll
annotate-kernel-features-hsa.ll
annotate-kernel-features.ll
anonymous-gv.ll
any_extend_vector_inreg.ll
anyext.ll
are-loads-from-same-base-ptr.ll
array-ptr-calc-i32.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
array-ptr-calc-i64.ll
ashr.v2i16.ll
at-least-one-def-value-assert.mir	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
atomic_cmp_swap_local.ll
atomic_load_add.ll
atomic_load_local.ll
atomic_load_sub.ll
atomic_optimizations_buffer.ll	AMDGPU: Update more tests to use modern buffer intrinsics	2020-01-16 14:29:38 -05:00
atomic_optimizations_global_pointer.ll
atomic_optimizations_local_pointer.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
atomic_optimizations_pixelshader.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
atomic_optimizations_raw_buffer.ll
atomic_optimizations_struct_buffer.ll
atomic_store_local.ll
atomicrmw-nand.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
attr-amdgpu-flat-work-group-size-v3.ll	AMDGPU Reduce reported maximum group size to 1024	2019-11-13 06:34:28 +05:30
attr-amdgpu-flat-work-group-size.ll	AMDGPU Reduce reported maximum group size to 1024	2019-11-13 06:34:28 +05:30
attr-amdgpu-num-sgpr.ll	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem.	2019-10-18 21:48:22 +00:00
attr-amdgpu-num-vgpr.ll
attr-amdgpu-waves-per-eu.ll
attr-unparseable.ll
barrier-elimination.ll
basic-branch.ll	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem.	2019-10-18 21:48:22 +00:00
basic-call-return.ll
basic-loop.ll
bfe_uint.ll
bfe-combine.ll
bfe-patterns.ll
bfi_int.ll
bfm.ll
big_alu.ll
bitcast-constant-to-vector.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
bitcast-v4f16-v4i16.ll
bitcast-vector-extract.ll
bitreverse-inline-immediates.ll
bitreverse.ll	[AMDGPU] SIFoldOperands should not fold register acrocc the EXEC definition	2019-09-30 15:31:17 +00:00
br_cc.f16.ll
branch-condition-and.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
branch-relax-bundle.ll
branch-relax-spill.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
branch-relaxation-debug-info.ll
branch-relaxation-inst-size-gfx10.ll
branch-relaxation.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
branch-uniformity.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
break-smem-soft-clauses.mir
break-vmem-soft-clauses.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
bswap.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
buffer-intrinsics-mmo-offsets.ll	Revert "Revert "[MIR] Target specific MIR formating and parsing""	2020-01-08 20:03:29 -08:00
buffer-schedule.ll
bug-vopc-commute.ll
build_vector.ll
build-vector-insert-elt-infloop.ll
build-vector-packed-partial-undef.ll
bundle-latency.mir	[AMDGPU] Model distance to instruction in bundle	2020-01-14 01:18:59 -08:00
byval-frame-setup.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
call_fs.ll
call-argument-types.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
call-constant.ll	AMDGPU: Don't error on calls to null or undef	2019-10-20 07:46:04 +00:00
call-constexpr.ll
call-encoding.ll
call-graph-register-usage.ll	[AMDGPU] Add handling of 160 bit registers in analyzeResourceUsage	2019-11-06 15:47:32 -08:00
call-preserved-registers.ll
call-return-types.ll
call-skip.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
call-to-kernel-undefined.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
call-to-kernel.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
call-waitcnt.ll
call-waw-waitcnt.mir	AMDGPU: Avoid overwriting saved PC	2019-10-28 10:02:22 -07:00
callee-frame-setup.ll
callee-special-input-sgprs.ll
callee-special-input-vgprs.ll
calling-conventions.ll
captured-frame-index.ll
cayman-loop-bug.ll
cc-sgpr-limit.ll
cc-sgpr-over-limit.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
cf_end.ll
cf-loop-on-constant.ll
cf-stack-bug.ll
cgp-addressing-modes-flat.ll
cgp-addressing-modes.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
cgp-bitfield-extract.ll
chain-hi-to-lo.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
clamp-modifier.ll
clamp-omod-special-case.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
clamp.ll
cluster-flat-loads-postra.mir
cluster-flat-loads.mir
cndmask-no-def-vcc.ll
coalescer_distribute.ll
coalescer_remat.ll
coalescer-extend-pruned-subrange.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescer-identical-values-undef.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescer-subranges-another-copymi-not-live.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescer-subranges-another-prune-error.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescer-subranges-prune-kill-copy.mir
coalescer-subreg-join.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescer-subregjoin-fullcopy.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescer-with-subregs-bad-identical.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
coalescing-with-subregs-in-loop-bug.mir
code-object-v3.ll
codegen-prepare-addrmode-sext.ll
collapse-endcf2.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
collapse-endcf-broken.mir
collapse-endcf.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
collapse-endcf.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
combine_vloads.ll
combine-and-sext-bool.ll
combine-cond-add-sub.ll	Revert "AMDGPU: Try to commute sub of boolean ext"	2019-12-13 12:49:06 +00:00
combine-ftrunc.ll
comdat.ll
commute_modifiers.ll
commute-compares.ll
commute-shifts.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
complex-folding.ll
computeKnownBits-scalar-to-vector-crash.ll
computeNumSignBits-mul.ll	[AMDGPU] Update autogenerated checks	2019-12-17 16:48:02 +00:00
concat_vectors.ll
constant-address-space-32bit.ll
constant-fold-imm-immreg.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
constant-fold-mi-operands.ll
control-flow-fastregalloc.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
control-flow-optnone.ll
convergent-inlineasm.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
copy-illegal-type.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
copy-to-reg.ll
couldnt-join-subrange-3.mir	AMDGPU: Remove IR section from MIR test	2020-01-16 13:49:44 -05:00
cross-block-use-is-not-abi-copy.ll
cse-phi-incoming-val.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
csr-gfx10.ll
ctlz_zero_undef.ll
ctlz.ll
ctpop16.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
ctpop64.ll
ctpop.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
cttz_zero_undef.ll
cube.ll
cvt_f32_ubyte.ll	[DAG] Fold extract_vector_elt (scalar_to_vector), K to undef (K != 0)	2020-01-21 10:58:30 +00:00
cvt_flr_i32_f32.ll
cvt_rpi_i32_f32.ll
dag-divergence.ll
dagcomb-shuffle-vecextend-non2.ll
dagcombine-reassociate-bug.ll
dagcombine-select.ll	AMDGPU: Look through casted selects to constant fold bin ops	2020-01-22 10:16:39 -05:00
dagcombine-setcc-select.ll
dagcombiner-bug-illegal-vec4-int-to-fp.ll
dce-disjoint-intervals.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
dead_copy.mir
dead-lane.mir
dead-machine-elim-after-dead-lane.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
debug-value2.ll
debug-value-scheduler-crash.mir
debug-value.ll
debug.ll
default-fp-mode.ll
detect-dead-lanes.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
directive-amdgcn-target.ll
disable_form_clauses.ll
disconnected-predset-break-bug.ll
div_i128.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
diverge-extra-formal-args.ll
diverge-interp-mov-lower.ll
diverge-switch-default.ll
divergence-at-use.ll
divergent-branch-uniform-condition.ll	[AMDGPU] SIRemoveShortExecBranches should not remove branches exiting loops	2020-01-22 13:18:40 +09:00
divrem24-assume.ll	[IRBuilder] Update IRBuilder::CreateFNeg(...) to return a UnaryOperator	2019-10-14 15:35:01 +00:00
dpp_combine.ll	[AMDGPU] link dpp pseudos and real instructions on gfx10	2019-10-11 22:03:36 +00:00
dpp_combine.mir	[AMDGPU] Disallow dpp combining for dpp instructions without Src2 operand (when Src2 is required)	2019-10-25 21:30:37 +03:00
drop-mem-operand-move-smrd.ll
ds_read2_offset_order.ll
ds_read2_superreg.ll
ds_read2.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
ds_read2st64.ll
ds_write2.ll
ds_write2st64.ll
ds-combine-large-stride.ll
ds-combine-with-dependence.ll
ds-negative-offset-addressing-mode-loop.ll	AMDGPU: Increase vcc liveness scan threshold	2019-10-20 17:44:17 +00:00
ds-sub-offset.ll
dynamic_stackalloc.ll
early-if-convert-cost.ll
early-if-convert.ll
early-inline-alias.ll
early-inline.ll
early-tailduplicator-nophis.mir	TailDuplication: Clear NoPHIs property	2019-12-27 14:06:31 -05:00
elf-header-flags-mach.ll
elf-header-flags-sram-ecc.ll
elf-header-flags-xnack.ll
elf-header-osabi.ll
elf-notes.ll
elf.ll
elf.metadata.ll
elf.r600.ll
else.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
empty-function.ll
enable-no-signed-zeros-fp-math.ll
endcf-loop-header.ll
endpgm-dce.mir
enqueue-kernel.ll
exceed-max-sgprs.ll
extend-bit-ops-i16.ll
extload-align.ll
extload-private.ll
extload.ll
extract_subvector_vec4_vec3.ll	Revert "Revert "[MIR] Target specific MIR formating and parsing""	2020-01-08 20:03:29 -08:00
extract_vector_dynelt.ll	AMDGPU: Change boolean content type to 0 or 1	2019-11-15 13:43:47 +05:30
extract_vector_elt-f16.ll
extract_vector_elt-f64.ll
extract_vector_elt-i8.ll
extract_vector_elt-i16.ll
extract_vector_elt-i64.ll
extract-lowbits.ll
extract-subvector-equal-length.ll	AMDGPU: Remove optnone from a test	2019-10-19 01:34:59 +00:00
extract-subvector.ll	[DAGCombine] Replace `getIntPtrConstant()` with `getVectorIdxTy()`.	2020-01-14 17:03:05 -05:00
extract-vector-elt-build-vector-combine.ll
extractelt-to-trunc.ll
fabs.f16.ll
fabs.f64.ll
fabs.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
fadd64.ll
fadd-fma-fmul-combine.ll	[DAGCombiner] Check term use before applying aggressive FSUB optimisations	2019-12-23 09:37:58 +09:00
fadd.f16.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
fadd.ll
fcanonicalize-elimination.ll
fcanonicalize.f16.ll
fcanonicalize.ll
fceil64.ll
fceil.ll
fcmp64.ll
fcmp-cnd.ll
fcmp-cnde-int-args.ll
fcmp.f16.ll
fcmp.ll
fconst64.ll
fcopysign.f16.ll
fcopysign.f32.ll
fcopysign.f64.ll
fdiv32-to-rcp-folding.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
fdiv.f16.ll
fdiv.f64.ll
fdiv.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
fdot2.ll
fence-barrier.ll	AMDGPU: Increase vcc liveness scan threshold	2019-10-20 17:44:17 +00:00
fetch-limits.r600.ll
fetch-limits.r700+.ll
fexp.ll	[AMDGPU] Clean up update_llc_test_checks CodeGen tests	2019-10-24 17:35:33 -04:00
ffloor.f64.ll
ffloor.ll
fix-sgpr-copies.mir	AMDGPU: Fix infinite searches in SIFixSGPRCopies	2019-10-15 19:59:45 +00:00
fix-vgpr-copies.mir
fix-wwm-vgpr-copy.ll
flat_atomics_i64.ll
flat_atomics.ll
flat-address-space.ll	[AMDGPU] allow multi-dword flat scratch access since GFX9	2020-01-17 10:47:03 -08:00
flat-error-unsupported-gpu-hsa.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
flat-for-global-subtarget-feature.ll
flat-load-clustering.mir
flat-offset-bug.ll
flat-scratch-reg.ll
floor.ll
fma-combine.ll
fma.f64.ll	[AMDGPU] Improve fma.f64 test. NFC.	2019-09-25 18:50:34 +00:00
fma.ll
fmac.sdwa.ll
fmad.ll
fmax3.f64.ll
fmax3.ll
fmax_legacy.f16.ll
fmax_legacy.f64.ll
fmax_legacy.ll
fmax.ll
fmaxnum.f64.ll
fmaxnum.ll
fmaxnum.r600.ll
fmed3.ll
fmin3.ll
fmin_fmax_legacy.amdgcn.ll
fmin_legacy.f16.ll
fmin_legacy.f64.ll
fmin_legacy.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
fmin.ll
fminnum.f64.ll
fminnum.ll
fminnum.r600.ll
fmul64.ll
fmul-2-combine-multi-use.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
fmul.f16.ll
fmul.ll
fmuladd.f16.ll	[AMDGPU] Fix test checks	2019-10-07 10:57:41 +00:00
fmuladd.f32.ll
fmuladd.f64.ll
fmuladd.v2f16.ll
fnearbyint.ll
fneg-combines.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
fneg-combines.si.ll
fneg-fabs.f16.ll
fneg-fabs.f64.ll
fneg-fabs.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
fneg-fold-legalize-dag-increase-insts.ll	DAG: Stop trying to fold FP -(x-y) -> y-x in getNode with nsz	2019-12-31 22:49:51 -05:00
fneg.f16.ll
fneg.f64.ll
fneg.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
fold_acc_copy_into_valu.mir	[AMDGPU] Fix illegal agpr use by VALU	2019-10-02 23:23:46 +00:00
fold-cndmask.mir
fold-fi-mubuf.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
fold-fi-operand-shrink.mir
fold-fmul-to-neg-abs.ll
fold-imm-copy.mir	[AMDGPU] Skip additional folding on the same operand.	2019-10-24 11:30:22 -04:00
fold-imm-f16-f32.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
fold-immediate-operand-shrink-with-carry.mir
fold-immediate-operand-shrink.mir
fold-immediate-output-mods.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
fold-implicit-operand.mir
fold-multiple.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
fold-operands-order.mir
fold-operands-remove-m0-redef.mir	AMDGPU: Erase redundant redefs of m0 in SIFoldOperands	2019-10-21 19:53:46 +00:00
fold-over-exec.mir	[AMDGPU] SIFoldOperands should not fold register acrocc the EXEC definition	2019-09-30 15:31:17 +00:00
fold-readlane.mir
fold-reload-into-m0.mir	AMDGPU: Disallow spill folding with m0 copies	2019-10-30 14:56:33 -07:00
fold-sgpr-copy.mir	[AMDGPU] Enable SGPR copy folding	2019-10-25 15:08:30 -07:00
fold-sgpr-multi-imm.mir	AMDGPU: Avoid folding 2 constant operands into an SALU operation	2019-12-04 10:25:34 +00:00
fold-vgpr-copy.mir	[AMDGPU] Allow folding of sgpr to vgpr copy	2019-10-23 18:42:48 -07:00
force-alwaysinline-lds-global-address-codegen.ll
force-alwaysinline-lds-global-address.ll
fp16_to_fp32.ll
fp16_to_fp64.ll
fp32_to_fp16.ll
fp_to_sint.f64.ll
fp_to_sint.ll
fp_to_uint.f64.ll
fp_to_uint.ll
fp-atomic-to-s_denormmode.mir
fp-classify.ll
fpext-free.ll
fpext.f16.ll
fpext.ll
fpow.ll	AMDGPU: Fix legalizing f16 fpow	2020-01-06 17:21:51 -05:00
fptosi.f16.ll
fptoui.f16.ll
fptrunc.f16.ll
fptrunc.ll	[SelectionDAG] Add support for FP_ROUND in WidenVectorOperand.	2019-10-30 15:18:21 +00:00
fract.f64.ll
fract.ll
frame-index-elimination.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
frame-lowering-entry-all-sgpr-used.mir
frame-lowering-fp-adjusted.mir	Migrate function attribute "no-frame-pointer-elim" to "frame-pointer"="all" as cleanups after D56351	2019-12-24 15:57:33 -08:00
frem.ll
fsqrt.f64.ll
fsqrt.ll
fsub64.ll
fsub.f16.ll
fsub.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
ftrunc.f64.ll
ftrunc.ll
function-args.ll
function-call-relocs.ll
function-returns.ll
gds-atomic.ll
gep-address-space.ll
gfx10-vop-literal.ll
gfx902-without-xnack.ll
global_atomics_i64.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
global_atomics.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
global_smrd_cfg.ll
global_smrd.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
global-atomics-fp.ll	AMDGPU: Select global atomicrmw fadd	2019-11-06 16:06:38 -08:00
global-constant.ll	AMDGPU/R600: Emit rodata in text segment	2020-01-22 14:31:51 -05:00
global-directive.ll
global-extload-i16.ll
global-load-store-atomics.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
global-saddr.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
global-smrd-unknown.ll
global-variable-relocs.ll
gv-const-addrspace.ll
gv-offset-folding.ll
gws-hazards.mir
half.ll
hazard-buffer-store-v-interp.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
hazard-hidden-bundle.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
hazard-in-bundle.mir
hazard-inlineasm.mir
hazard-kill.mir
hazard.mir
hoist-cond.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
hsa-default-device.ll
hsa-fp-mode.ll
hsa-func-align.ll
hsa-func.ll
hsa-globals.ll
hsa-group-segment.ll
hsa-metadata-deduce-ro-arg-v3.ll
hsa-metadata-deduce-ro-arg.ll
hsa-metadata-enqueue-kernel-v3.ll
hsa-metadata-enqueue-kernel.ll
hsa-metadata-from-llvm-ir-full-v3.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
hsa-metadata-from-llvm-ir-full.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
hsa-metadata-hidden-args-v3.ll
hsa-metadata-hidden-args.ll
hsa-metadata-hostcall-absent-v3.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
hsa-metadata-hostcall-absent.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
hsa-metadata-hostcall-present-v3.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
hsa-metadata-hostcall-present.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
hsa-metadata-images-v3.ll
hsa-metadata-images.ll
hsa-metadata-invalid-ocl-version-1-v3.ll
hsa-metadata-invalid-ocl-version-1.ll
hsa-metadata-invalid-ocl-version-2-v3.ll
hsa-metadata-invalid-ocl-version-2.ll
hsa-metadata-invalid-ocl-version-3-v3.ll
hsa-metadata-invalid-ocl-version-3.ll
hsa-metadata-kernel-code-props-v3.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
hsa-metadata-kernel-code-props.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
hsa-metadata-wavefrontsize.ll
hsa-note-no-func.ll
hsa.ll
huge-private-buffer.ll
i1_copy_phi_with_phi_incoming_value.mir	[AMDGPU] Fix Vreg_1 PHI lowering in SILowerI1Copies.	2019-10-26 14:37:45 +05:30
i1-copies-rpo.mir
i1-copy-from-loop.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
i1-copy-implicit-def.ll
i1-copy-phi-uniform-branch.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
i1-copy-phi.ll
i8-to-double-to-float.ll
icmp64.ll
icmp-select-sete-reverse-args.ll
icmp.i16.ll
idiv-licm.ll
idot2.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
idot4s.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
idot4u.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
idot8s.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
idot8u.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
illegal-sgpr-to-vgpr-copy.ll	[AMDGPU] Fix illegal agpr use by VALU	2019-10-02 23:23:46 +00:00
image_ls_mipmap_zero.ll
image-attributes.ll
image-load-d16-tfe.ll	AMDGPU: Fix interaction of tfe and d16	2020-01-22 09:26:17 -05:00
image-resource-id.ll
image-schedule.ll
img-nouse-adjust.ll
imm16.ll
imm.ll
immv216.ll
implicit-def-muse.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
indirect-addressing-si-gfx9.ll
indirect-addressing-si-noopt.ll	[SDAG] fold extract_vector_elt with undef index	2019-10-25 19:27:26 -04:00
indirect-addressing-si-pregfx9.ll
indirect-addressing-si.ll
indirect-addressing-term.ll	AMDGPU: Stop adding m0 implicit def to SGPR spills	2019-10-21 19:42:29 +00:00
indirect-private-64.ll
infer-addrpace-pipeline.ll
infinite-loop-evergreen.ll
infinite-loop.ll
inline-asm.ll	AMDGPU: Make VReg_1 only include 1 artificial register	2019-10-28 20:51:51 -07:00
inline-attr.ll
inline-calls.ll
inline-constraints.ll	AMDGPU: Relax 32-bit SGPR register class	2019-10-18 18:26:37 +00:00
inline-maxbb.ll
inlineasm-16.ll
inlineasm-illegal-type.ll
inlineasm-packed.ll
InlineAsmCrash.ll
input-mods.ll
insert_subreg.ll
insert_vector_dynelt.ll
insert_vector_elt.ll	[amdgpu] Fix typos in a test case.	2020-01-14 20:08:39 -05:00
insert_vector_elt.v2i16.ll
insert_vector_elt.v2i16.subtest-nosaddr.ll
insert_vector_elt.v2i16.subtest-saddr.ll
insert-skip-from-vcc.mir
insert-skips-flat-vmem.mir	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
insert-skips-gws.mir	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
insert-skips-ignored-insts.mir	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
insert-skips-kill-uncond.mir	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
insert-subvector-unused-scratch.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
insert-waitcnts-callee.mir
insert-waitcnts-exp.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
inserted-wait-states.mir	AMDGPU: Prepare to use scalar register indexing	2020-01-20 17:19:16 -05:00
internalize.ll
invalid-addrspacecast.ll
invalid-alloca.ll
invariant-load-no-alias-store.ll
invert-br-undef-vcc.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
ipra-regmask.ll
ipra.ll
jump-address.ll
kcache-fold.ll
kernarg-stack-alignment.ll
kernel-args.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
kernel-argument-dag-lowering.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
known-never-nan.ll
known-never-snan.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
knownbits-recursion.ll
large-alloca-compute.ll
large-alloca-graphics.ll
large-constant-initializer.ll
large-work-group-promote-alloca.ll	AMDGPU Reduce reported maximum group size to 1024	2019-11-13 06:34:28 +05:30
lcssa-optnone.ll
lds_atomic_f32.ll
lds-alignment.ll
lds-bounds.ll
lds-branch-vmem-hazard.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
lds-global-non-entry-func.ll
lds-initializer.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
lds-m0-init-in-loop.ll
lds-misaligned-bug.ll
lds-oqap-crash.ll
lds-output-queue.ll
lds-relocs.ll
lds-size.ll
lds-zero-initializer.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
legalize-fp-load-invariant.ll
legalizedag-bug-expand-setcc.ll
limit-coalesce.mir
lit.local.cfg
literals.ll
liveness.mir
llvm.amdgcn.alignb.ll
llvm.amdgcn.atomic.dec.ll
llvm.amdgcn.atomic.fadd.ll
llvm.amdgcn.atomic.inc.ll
llvm.amdgcn.buffer.atomic.ll
llvm.amdgcn.buffer.load.dwordx3.ll
llvm.amdgcn.buffer.load.format.d16.ll
llvm.amdgcn.buffer.load.format.ll
llvm.amdgcn.buffer.load.ll
llvm.amdgcn.buffer.store.dwordx3.ll
llvm.amdgcn.buffer.store.format.d16.ll
llvm.amdgcn.buffer.store.format.ll
llvm.amdgcn.buffer.store.ll
llvm.amdgcn.buffer.wbinvl1.ll
llvm.amdgcn.buffer.wbinvl1.sc.ll
llvm.amdgcn.buffer.wbinvl1.vol.ll
llvm.amdgcn.class.f16.ll
llvm.amdgcn.class.ll
llvm.amdgcn.cos.f16.ll
llvm.amdgcn.cos.ll
llvm.amdgcn.cubeid.ll
llvm.amdgcn.cubema.ll
llvm.amdgcn.cubesc.ll
llvm.amdgcn.cubetc.ll
llvm.amdgcn.cvt.pk.i16.ll
llvm.amdgcn.cvt.pk.u16.ll
llvm.amdgcn.cvt.pknorm.i16.ll
llvm.amdgcn.cvt.pknorm.u16.ll
llvm.amdgcn.cvt.pkrtz.ll
llvm.amdgcn.dispatch.id.ll
llvm.amdgcn.dispatch.ptr.ll
llvm.amdgcn.div.fixup.f16.ll
llvm.amdgcn.div.fixup.ll
llvm.amdgcn.div.fmas.ll
llvm.amdgcn.div.scale.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
llvm.amdgcn.ds.append.ll	AMDGPU/GlobalISel: Select DS append/consume	2020-01-17 20:09:53 -05:00
llvm.amdgcn.ds.bpermute.ll
llvm.amdgcn.ds.consume.ll	AMDGPU/GlobalISel: Select DS append/consume	2020-01-17 20:09:53 -05:00
llvm.amdgcn.ds.gws.barrier.ll	AMDGPU/GlobalISel: Select DS GWS intrinsics	2020-01-16 11:25:10 -05:00
llvm.amdgcn.ds.gws.init.ll	AMDGPU/GlobalISel: Select DS GWS intrinsics	2020-01-16 11:25:10 -05:00
llvm.amdgcn.ds.gws.sema.br.ll
llvm.amdgcn.ds.gws.sema.p.ll
llvm.amdgcn.ds.gws.sema.release.all.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
llvm.amdgcn.ds.gws.sema.v.ll
llvm.amdgcn.ds.ordered.add.gfx10.ll
llvm.amdgcn.ds.ordered.add.ll
llvm.amdgcn.ds.ordered.swap.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
llvm.amdgcn.ds.permute.ll
llvm.amdgcn.ds.swizzle.ll
llvm.amdgcn.exp.compr.ll
llvm.amdgcn.exp.ll
llvm.amdgcn.exp.prim.ll
llvm.amdgcn.fcmp.ll
llvm.amdgcn.fdiv.fast.ll
llvm.amdgcn.fdot2.ll
llvm.amdgcn.fmad.ftz.f16.ll
llvm.amdgcn.fmad.ftz.ll
llvm.amdgcn.fmed3.f16.ll
llvm.amdgcn.fmed3.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
llvm.amdgcn.fmul.legacy.ll
llvm.amdgcn.fract.f16.ll
llvm.amdgcn.fract.ll
llvm.amdgcn.frexp.exp.f16.ll
llvm.amdgcn.frexp.exp.ll
llvm.amdgcn.frexp.mant.f16.ll
llvm.amdgcn.frexp.mant.ll
llvm.amdgcn.groupstaticsize.ll
llvm.amdgcn.icmp.ll
llvm.amdgcn.image.a16.dim.ll
llvm.amdgcn.image.atomic.dim.ll
llvm.amdgcn.image.d16.dim.ll
llvm.amdgcn.image.dim.ll
llvm.amdgcn.image.gather4.a16.dim.ll
llvm.amdgcn.image.gather4.d16.dim.ll
llvm.amdgcn.image.gather4.dim.ll
llvm.amdgcn.image.gather4.o.dim.ll
llvm.amdgcn.image.getlod.dim.ll
llvm.amdgcn.image.load.a16.d16.ll
llvm.amdgcn.image.load.a16.ll
llvm.amdgcn.image.nsa.ll
llvm.amdgcn.image.sample.a16.dim.ll
llvm.amdgcn.image.sample.d16.dim.ll
llvm.amdgcn.image.sample.dim.ll
llvm.amdgcn.image.sample.ltolz.ll
llvm.amdgcn.image.sample.o.dim.ll
llvm.amdgcn.image.store.a16.d16.ll
llvm.amdgcn.image.store.a16.ll
llvm.amdgcn.implicit.buffer.ptr.hsa.ll
llvm.amdgcn.implicit.buffer.ptr.ll
llvm.amdgcn.implicitarg.ptr.ll
llvm.amdgcn.init.exec.ll	AMDGPU/GlobalISel: Add support for init.exec intrinsics	2019-10-01 02:07:25 +00:00
llvm.amdgcn.init.exec.wave32.ll	AMDGPU/GlobalISel: Add support for init.exec intrinsics	2019-10-01 02:07:25 +00:00
llvm.amdgcn.interp.f16.ll	AMDGPU: Use CopyToReg for interp intrinsic lowering	2019-10-21 19:53:49 +00:00
llvm.amdgcn.interp.ll
llvm.amdgcn.is.private.ll
llvm.amdgcn.is.shared.ll
llvm.amdgcn.kernarg.segment.ptr.ll
llvm.amdgcn.kill.ll
llvm.amdgcn.ldexp.f16.ll
llvm.amdgcn.ldexp.ll
llvm.amdgcn.lerp.ll
llvm.amdgcn.log.clamp.ll
llvm.amdgcn.mbcnt.ll
llvm.amdgcn.mfma.ll	AMDGPU: Fix lit test checks with dag option	2019-11-28 10:01:06 +00:00
llvm.amdgcn.mov.dpp8.ll
llvm.amdgcn.mov.dpp.ll	[AMDGPU] Fixed dpp test. NFC.	2019-11-13 16:38:54 -08:00
llvm.amdgcn.mqsad.pk.u16.u8.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
llvm.amdgcn.mqsad.u32.u8.ll
llvm.amdgcn.msad.u8.ll
llvm.amdgcn.mul.i24.ll
llvm.amdgcn.mul.u24.ll
llvm.amdgcn.permlane.ll
llvm.amdgcn.ps.live.ll
llvm.amdgcn.qsad.pk.u16.u8.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
llvm.amdgcn.queue.ptr.ll
llvm.amdgcn.raw.buffer.atomic.ll
llvm.amdgcn.raw.buffer.load.format.d16.ll
llvm.amdgcn.raw.buffer.load.format.ll
llvm.amdgcn.raw.buffer.load.ll	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
llvm.amdgcn.raw.buffer.store.format.d16.ll
llvm.amdgcn.raw.buffer.store.format.ll
llvm.amdgcn.raw.buffer.store.ll	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
llvm.amdgcn.raw.tbuffer.load.d16.ll
llvm.amdgcn.raw.tbuffer.load.ll
llvm.amdgcn.raw.tbuffer.store.d16.ll
llvm.amdgcn.raw.tbuffer.store.ll
llvm.amdgcn.rcp.f16.ll
llvm.amdgcn.rcp.legacy.ll
llvm.amdgcn.rcp.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
llvm.amdgcn.readfirstlane.ll	AMDGPU: Relax 32-bit SGPR register class	2019-10-18 18:26:37 +00:00
llvm.amdgcn.readlane.ll	AMDGPU: Relax 32-bit SGPR register class	2019-10-18 18:26:37 +00:00
llvm.amdgcn.rsq.clamp.ll
llvm.amdgcn.rsq.f16.ll
llvm.amdgcn.rsq.legacy.ll
llvm.amdgcn.rsq.ll
llvm.amdgcn.s.barrier.ll
llvm.amdgcn.s.buffer.load.ll	[AMDGPU] Lower llvm.amdgcn.s.buffer.load.v3[i\|f]32	2019-11-15 15:01:15 +01:00
llvm.amdgcn.s.dcache.inv.ll
llvm.amdgcn.s.dcache.inv.vol.ll
llvm.amdgcn.s.dcache.wb.ll
llvm.amdgcn.s.dcache.wb.vol.ll
llvm.amdgcn.s.decperflevel.ll
llvm.amdgcn.s.get.waveid.in.workgroup.ll
llvm.amdgcn.s.getpc.ll
llvm.amdgcn.s.getreg.ll
llvm.amdgcn.s.incperflevel.ll
llvm.amdgcn.s.memrealtime.ll
llvm.amdgcn.s.memtime.ll
llvm.amdgcn.s.sleep.ll
llvm.amdgcn.s.waitcnt.ll
llvm.amdgcn.sad.hi.u8.ll
llvm.amdgcn.sad.u8.ll
llvm.amdgcn.sad.u16.ll
llvm.amdgcn.sbfe.ll
llvm.amdgcn.sdot2.ll
llvm.amdgcn.sdot4.ll
llvm.amdgcn.sdot8.ll
llvm.amdgcn.sendmsg.ll
llvm.amdgcn.set.inactive.ll
llvm.amdgcn.sffbh.ll
llvm.amdgcn.sin.f16.ll
llvm.amdgcn.sin.ll
llvm.amdgcn.softwqm.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
llvm.amdgcn.struct.buffer.atomic.ll
llvm.amdgcn.struct.buffer.load.format.d16.ll
llvm.amdgcn.struct.buffer.load.format.ll
llvm.amdgcn.struct.buffer.load.ll
llvm.amdgcn.struct.buffer.store.format.d16.ll
llvm.amdgcn.struct.buffer.store.format.ll
llvm.amdgcn.struct.buffer.store.ll
llvm.amdgcn.struct.tbuffer.load.d16.ll
llvm.amdgcn.struct.tbuffer.load.ll
llvm.amdgcn.struct.tbuffer.store.d16.ll
llvm.amdgcn.struct.tbuffer.store.ll
llvm.amdgcn.tbuffer.load.d16.ll
llvm.amdgcn.tbuffer.load.dwordx3.ll
llvm.amdgcn.tbuffer.load.ll
llvm.amdgcn.tbuffer.store.d16.ll
llvm.amdgcn.tbuffer.store.dwordx3.ll
llvm.amdgcn.tbuffer.store.ll
llvm.amdgcn.trig.preop.ll
llvm.amdgcn.ubfe.ll
llvm.amdgcn.udot2.ll
llvm.amdgcn.udot4.ll
llvm.amdgcn.udot8.ll
llvm.amdgcn.unreachable.ll
llvm.amdgcn.update.dpp.ll	[AMDGPU] Support mov dpp with 64 bit operands	2019-10-15 16:41:15 +00:00
llvm.amdgcn.wave.barrier.ll
llvm.amdgcn.wavefrontsize.ll
llvm.amdgcn.workgroup.id.ll
llvm.amdgcn.workitem.id.ll
llvm.amdgcn.wqm.vote.ll	AMDGPU/GlobalISel: Select llvm.amdgcn.wqm.vote	2020-01-07 10:15:29 -05:00
llvm.amdgcn.writelane.ll	AMDGPU: Relax 32-bit SGPR register class	2019-10-18 18:26:37 +00:00
llvm.ceil.f16.ll
llvm.cos.f16.ll
llvm.cos.ll
llvm.dbg.value.ll
llvm.exp2.f16.ll
llvm.exp2.ll
llvm.floor.f16.ll
llvm.fma.f16.ll
llvm.fmuladd.f16.ll
llvm.log2.f16.ll
llvm.log2.ll
llvm.log10.f16.ll
llvm.log10.ll
llvm.log.f16.ll
llvm.log.ll
llvm.maxnum.f16.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
llvm.memcpy.ll
llvm.minnum.f16.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
llvm.pow.ll
llvm.r600.cube.ll
llvm.r600.dot4.ll
llvm.r600.group.barrier.ll
llvm.r600.read.local.size.ll
llvm.r600.recipsqrt.clamped.ll
llvm.r600.recipsqrt.ieee.ll
llvm.r600.tex.ll
llvm.rint.f16.ll
llvm.rint.f64.ll
llvm.rint.ll
llvm.round.f64.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
llvm.round.ll
llvm.sin.f16.ll
llvm.sin.ll
llvm.sqrt.f16.ll
llvm.trunc.f16.ll
load-constant-f32.ll
load-constant-f64.ll
load-constant-i1.ll
load-constant-i8.ll
load-constant-i16.ll
load-constant-i32.ll
load-constant-i64.ll
load-global-f32.ll
load-global-f64.ll
load-global-i1.ll
load-global-i8.ll
load-global-i16.ll
load-global-i32.ll
load-global-i64.ll
load-hi16.ll
load-input-fold.ll
load-lo16.ll	[AMDGPU] Clean up update_llc_test_checks CodeGen tests	2019-10-24 17:35:33 -04:00
load-local-f32-no-ds128.ll
load-local-f32.ll
load-local-f64.ll
load-local-i1.ll
load-local-i8.ll
load-local-i16.ll
load-local-i32.ll
load-local-i64.ll
load-select-ptr.ll
load-weird-sizes.ll
local-64.ll
local-atomics64.ll
local-atomics-fp.ll
local-atomics.ll
local-memory.amdgcn.ll
local-memory.ll
local-memory.r600.ll
local-stack-slot-offset.ll
loop_break.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
loop_exit_with_xor.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
loop_header_nopred.mir	[MBP] Avoid tail duplication if it can't bring benefit	2019-12-06 09:53:53 -08:00
loop-address.ll
loop-idiom.ll
lower-kernargs.ll
lower-mem-intrinsics.ll
lower-range-metadata-intrinsic-call.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
lshl64-to-32.ll
lshr.v2i16.ll	[SelectionDAG] ComputeKnownBits - minimum leading/trailing zero bits in LSHR/SHL (PR44526)	2020-01-13 11:08:12 +00:00
macro-fusion-cluster-vcc-uses.mir
mad24-get-global-id.ll
mad_64_32.ll
mad_int24.ll
mad_uint24.ll
mad-combine.ll
mad-mix-hi.ll
mad-mix-lo.ll
mad-mix.ll
mad.u16.ll
madak-inline-constant.mir
madak.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
madmk.ll
mai-hazards.mir
mai-inline.ll
max3.ll
max-literals.ll
max-sgprs.ll
max.i16.ll	[DAG] Fold extract_vector_elt (scalar_to_vector), K to undef (K != 0)	2020-01-21 10:58:30 +00:00
max.ll	[AMDGPU] Unnecessary -amdgpu-scalarize-global-loads=false flag removed from min/max lit tests.	2019-09-19 16:44:38 +00:00
mcp-overlap-after-propagation.mir	MCP: Fixed bug with dest overlapping copy source	2019-11-12 08:18:11 +00:00
med3-no-simplify.ll
mem-builtins.ll
memory_clause.ll
memory_clause.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
memory-legalizer-amdpal.ll
memory-legalizer-atomic-cmpxchg.ll
memory-legalizer-atomic-fence.ll
memory-legalizer-atomic-insert-end.mir	llc: Change behavior of -mcpu with existing attribute	2020-01-07 10:10:25 -05:00
memory-legalizer-atomic-rmw.ll
memory-legalizer-invalid-addrspace.mir
memory-legalizer-invalid-syncscope.ll
memory-legalizer-load.ll
memory-legalizer-local.mir
memory-legalizer-mesa3d.ll
memory-legalizer-multiple-mem-operands-atomics.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
memory-legalizer-multiple-mem-operands-nontemporal-1.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
memory-legalizer-multiple-mem-operands-nontemporal-2.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
memory-legalizer-region.mir
memory-legalizer-store-infinite-loop.ll
memory-legalizer-store.ll
merge-image-load.mir	[AMDGPU] Extend the SI Load/Store optimizer	2019-10-16 10:17:02 +00:00
merge-image-sample.mir	[AMDGPU] Extend the SI Load/Store optimizer	2019-10-16 10:17:02 +00:00
merge-load-store-physreg.mir	AMDGPU: Remove unnecessary IR from test	2019-10-14 18:30:29 +00:00
merge-load-store-vreg.mir
merge-load-store.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
merge-m0.mir
merge-store-crash.ll
merge-store-usedef.ll
merge-stores.ll
merge-tbuffer.mir	[AMDGPU][SILoadStoreOptimizer] Merge TBUFFER loads/stores	2019-11-20 22:59:30 +01:00
mesa3d.ll
mesa_regression.ll
mfma-loop.ll	[AMDGPU] Fixed mfma-loop test. NFC.	2019-11-13 16:03:54 -08:00
min3.ll
min.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
mir-print-dead-csr-fi.mir
misched-killflags.mir	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
missing-store.ll
mixed_wave32_wave64.ll
mixed-wave32-wave64.ll
mode-register.mir
move-addr64-rsrc-dead-subreg-writes.ll
move-to-valu-atomicrmw.ll
move-to-valu-worklist.ll
movreld-bug.ll	AMDGPU: Add gfx9 run lines to a testcase	2020-01-03 15:25:50 -05:00
movrels-bug.mir
mubuf-legalize-operands.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
mubuf-legalize-operands.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
mubuf-offset-private.ll
mubuf-shader-vgpr.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
mubuf.ll
mul24-pass-ordering.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
mul_int24.ll
mul_uint24-amdgcn.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
mul_uint24-r600.ll
mul.i16.ll
mul.ll
multi-divergent-exit-region.ll
multi-dword-vgpr-spill.ll
multilevel-break.ll	[AMDGPU] Fix emitIfBreak CF lowering: use temp reg to make register coalescer life easier.	2019-11-26 18:59:37 +03:00
nand.ll
nested-calls.ll
nested-loop-conditions.ll
no-hsa-graphics-shaders.ll
no-initializer-constant-addrspace.ll
no-remat-indirect-mov.mir
no-shrink-extloads.ll
noop-shader-O0.ll
nop-data.ll
nop-fold.mir	AMDGPU: Don't fold S_NOPs with implicit operands	2019-10-30 14:40:56 -07:00
nor.ll
not-scalarize-volatile-load.ll
nsa-reassign.ll
nsa-vmem-hazard.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
nullptr.ll
occupancy-levels.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
offset-split-flat.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
offset-split-global.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
omod-nsz-flag.mir	AMDGPU: Be explicit about denormal mode in MIR tests	2019-11-19 19:55:43 +05:30
omod.ll
opencl-image-metadata.ll
opencl-printf-no-hostcall.ll	[AMDGPU] add support for hostcall buffer pointer as hidden kernel argument	2019-11-20 15:53:55 +05:30
opencl-printf.ll
operand-folding.ll	[AMDGPU] Skip additional folding on the same operand.	2019-10-24 11:30:22 -04:00
operand-spacing.ll
opt-sgpr-to-vgpr-copy.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
optimize-exec-masking-pre-ra.mir	AMDGPU: Propagate undef flag during pre-RA exec mask optimizations	2019-10-08 12:46:32 +00:00
optimize-if-exec-masking.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
optimize-negated-cond-exec-masking-wave32.mir
optimize-negated-cond-exec-masking.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
optimize-negated-cond.ll
or3.ll
or.ll
pack.v2f16.ll
pack.v2i16.ll
packed-op-sel.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
packetizer.ll
parallelandifcollapse.ll
parallelorifcollapse.ll
partial-sgpr-to-vgpr-spills.ll
partial-shift-shrink.ll
partially-dead-super-register-immediate.ll
peephole-opt-regseq-removal.mir
pei-reg-scavenger-position.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
pei-scavenge-sgpr-carry-out.mir	AMDGPU: Reuse carry out register during FI elimination	2019-11-28 10:13:48 -08:00
pei-scavenge-sgpr-gfx9.mir
pei-scavenge-sgpr.mir
perfhint.ll
permute.ll
phi-elimination-assertion.mir
phi-elimination-end-cf.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
pk_max_f16_literal.ll
post-ra-sched-kill-bundle-use-inst.mir
postra-machine-sink.mir	PostRA Machine Sink should take care of COPY defining register that is a sub-register by another COPY source operand	2019-12-17 15:20:43 +03:00
postra-norename.mir
power-sched-no-instr-sunit.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
predicate-dp4.ll
predicates.ll
preserve-hi16.ll	AMDGPU: Fix i16 arithmetic pattern redundancy	2019-10-08 17:36:38 +00:00
print-mir-custom-pseudo.ll
private-access-no-objects.ll
private-element-size.ll
private-memory-atomics.ll
private-memory-r600.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
promote-alloca-addrspacecast.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
promote-alloca-array-aggregate.ll
promote-alloca-array-allocation.ll
promote-alloca-bitcast-function.ll
promote-alloca-calling-conv.ll
promote-alloca-globals.ll
promote-alloca-invariant-markers.ll
promote-alloca-lifetime.ll
promote-alloca-mem-intrinsics.ll
promote-alloca-no-opts.ll
promote-alloca-padding-size-estimate.ll
promote-alloca-stored-pointer-value.ll
promote-alloca-to-lds-icmp.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
promote-alloca-to-lds-phi.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
promote-alloca-to-lds-select.ll	AMDGPU: Switch backend default max workgroup size to 1024	2019-11-13 07:11:02 +05:30
promote-alloca-unhandled-intrinsic.ll
promote-alloca-vector-to-vector.ll
promote-alloca-volatile.ll
promote-constOffset-to-imm-gfx10.mir	[AMDGPU] Keep consistent check of legal addressing mode.	2019-11-20 15:08:17 -05:00
promote-constOffset-to-imm.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
promote-constOffset-to-imm.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
propagate-attributes-bitcast-function.ll
propagate-attributes-clone.ll
propagate-attributes-single-set.ll
pv-packing.ll
pv.ll
r600-constant-array-fixup.ll	R600: Fix failing testcase	2020-01-22 16:01:35 -05:00
r600-encoding.ll
r600-export-fix.ll
r600-infinite-loop-bug-while-reorganizing-vector.ll
r600-legalize-umax-bug.ll
r600.add.ll
r600.alu-limits.ll
r600.amdgpu-alias-analysis.ll
r600.bitcast.ll
r600.extract-lowbits.ll
r600.func-alignment.ll
r600.global_atomics.ll
r600.private-memory.ll
r600.sub.ll
r600.work-item-intrinsics.ll
r600cfg.ll
rcp_iflag.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
rcp-pattern.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
read_register.ll	AMDGPU: Adjust test so it will work with GlobalISel	2019-12-27 19:37:39 -05:00
read-register-invalid-subtarget.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
read-register-invalid-type-i32.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
read-register-invalid-type-i64.ll	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
readcyclecounter.ll
readlane_exec0.mir
README
reassoc-scalar.ll
reduce-build-vec-ext-to-ext-build-vec.ll
reduce-load-width-alignment.ll
reduce-saveexec.mir
reduce-store-width-alignment.ll
reduction.ll
reg-coalescer-sched-crash.ll
regbank-reassign.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
regcoal-subrange-join-seg.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
regcoal-subrange-join.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
regcoalesce-cannot-join-failures.mir
regcoalesce-dbg.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
regcoalesce-keep-valid-lanes-implicit-def-bug39602.mir
regcoalesce-prune.mir
regcoalescing-remove-partial-redundancy-assert.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
register-count-comments.ll
rename-disconnected-bug.ll
rename-independent-subregs-mac-operands.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
rename-independent-subregs.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
reorder-stores.ll
reqd-work-group-size.ll
ret_jump.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
ret.ll
returnaddress.ll
rewrite-out-arguments-address-space.ll
rewrite-out-arguments.ll
rotl.i64.ll
rotl.ll
rotr.i64.ll
rotr.ll
rsq.ll	AMDGPU: Implement FDIV optimizations in AMDGPUCodeGenPrepare	2020-01-23 16:57:43 -08:00
rv7x0_count3.ll
s_addk_i32.ll
s_code_end.ll
s_movk_i32.ll
s_mulk_i32.ll
sad.ll
saddo.ll	[AMDGPU] Allow folding of sgpr to vgpr copy	2019-10-23 18:42:48 -07:00
salu-to-valu.ll
sampler-resource-id.ll
scalar_to_vector_v2x16.ll
scalar_to_vector.ll
scalar-branch-missing-and-exec.ll
scalar-store-cache-flush.mir
sched-assert-dead-def-subreg-use-other-subreg.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
sched-assert-onlydbg-value-empty-region.mir
sched-crash-dbg-value.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
sched-handleMoveUp-subreg-def-across-subreg-def.mir	LiveIntervals: Fix handleMoveUp with subreg def moving across a def	2019-10-18 23:24:25 +00:00
schedule-barrier.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
schedule-fs-loop-nested-if.ll
schedule-fs-loop-nested.ll
schedule-fs-loop.ll
schedule-global-loads.ll
schedule-if-2.ll
schedule-if.ll
schedule-ilp.ll
schedule-kernel-arg-loads.ll
schedule-regpressure-limit2.ll
schedule-regpressure-limit3.ll
schedule-regpressure-limit.ll
schedule-regpressure.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
schedule-vs-if-nested-loop-failure.ll
schedule-vs-if-nested-loop.ll
scheduler-handle-move-bundle.mir	[AMDGPU] Fix getInstrLatency() always returning 1	2020-01-14 01:08:30 -08:00
scheduler-subrange-crash.ll
scratch-buffer.ll
scratch-simple.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
sdiv64.ll	AMDGPU: Cleanup and generate 64-bit div tests	2020-01-20 17:19:39 -05:00
sdiv.ll
sdivrem24.ll
sdivrem64.r600.ll	AMDGPU: Cleanup and generate 64-bit div tests	2020-01-20 17:19:39 -05:00
sdwa-gfx9.mir
sdwa-op64-test.ll
sdwa-ops.mir
sdwa-peephole-instr-gfx10.mir
sdwa-peephole-instr.mir
sdwa-peephole.ll	AMDGPU: Fix not using v_cvt_f16_[iu]16	2020-01-07 15:10:07 -05:00
sdwa-preserve.mir
sdwa-scalar-ops.mir
sdwa-vop2-64bit.mir
select64.ll
select-fabs-fneg-extract-legacy.ll
select-fabs-fneg-extract.ll	AMDGPU: Do binop of select of constant fold in AMDGPUCodeGenPrepare	2020-01-22 10:16:39 -05:00
select-i1.ll
select-opt.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
select-undef.ll
select-vectors.ll
select.f16.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
select.ll
selectcc-cnd.ll
selectcc-cnde-int.ll
selectcc-icmp-select-float.ll
selectcc-opt.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
selectcc.ll
sendmsg-m0-hazard.mir
set-dx10.ll
setcc64.ll
setcc-equivalent.ll
setcc-fneg-constant.ll
setcc-limit-load-shrink.ll
setcc-opt.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
setcc-sext.ll
setcc.ll
seto.ll
setuo.ll
sext-eliminate.ll
sext-in-reg-failure-r600.ll
sext-in-reg.ll
sgpr-control-flow.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
sgpr-copy-duplicate-operand.ll
sgpr-copy.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
sgpr-spill-wrong-stack-id.mir
sgprcopies.ll
shader-addr64-nonuniform.ll
shared-op-cycle.ll
shift-and-i64-ubfe.ll
shift-and-i128-ubfe.ll
shift-i64-opts.ll
shift-i128.ll	[MachineScheduler] Reduce reordering due to mem op clustering	2020-01-14 19:19:02 +00:00
shl_add_constant.ll
shl_add_ptr.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
shl_add.ll
shl_or.ll
shl-add-to-add-shl.ll
shl.ll	[SelectionDAG] ComputeKnownBits - minimum leading/trailing zero bits in LSHR/SHL (PR44526)	2020-01-13 11:08:12 +00:00
shl.v2i16.ll	[SelectionDAG] ComputeKnownBits - minimum leading/trailing zero bits in LSHR/SHL (PR44526)	2020-01-13 11:08:12 +00:00
shrink-add-sub-constant.ll	AMDGPU: Apply i16 add->sub pattern with zext to i32	2020-01-07 16:36:31 -05:00
shrink-carry.mir
shrink-vop3-carry-out.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
si-annotate-cf-noloop.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
si-annotate-cf-unreachable.ll
si-annotate-cf.ll	[AMDGPU] Fix emitIfBreak CF lowering: use temp reg to make register coalescer life easier.	2019-11-26 18:59:37 +03:00
si-annotate-cfg-loop-assert.ll
si-annotatecfg-multiple-backedges.ll
si-fix-sgpr-copies.mir	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
si-i1-copies.mir	MachineSSAUpdater: insert IMPLICIT_DEF at top of basic block	2019-10-08 12:46:20 +00:00
si-instr-info-correct-implicit-operands.ll
si-lower-control-flow-kill.ll
si-lower-control-flow-unreachable-block.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
si-lower-control-flow.mir	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
si-lower-i1-copies.mir
si-lower-sgpr-spills.mir
si-scheduler.ll
si-sgpr-spill.ll
si-spill-cf.ll
si-spill-sgpr-stack.ll	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem.	2019-10-18 21:48:22 +00:00
si-triv-disjoint-mem-access.ll
si-vector-hang.ll
sibling-call.ll
sign_extend.ll
simplify-libcalls.ll	Revert "Revert "As a follow-up to my initial mail to llvm-dev here's a first pass at the O1 described there.""	2019-11-26 20:28:52 -08:00
simplifydemandedbits-recursion.ll
sint_to_fp.f64.ll	AMDGPU: Add run line to int_to_fp tests	2020-01-06 21:38:50 -05:00
sint_to_fp.i64.ll
sint_to_fp.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
sitofp.f16.ll	AMDGPU: Fix not using v_cvt_f16_[iu]16	2020-01-07 15:10:07 -05:00
skip-branch-taildup-ret.mir	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
skip-branch-trap.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
skip-if-dead.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
smed3.ll
smem-no-clause-coalesced.mir
smem-war-hazard.mir	AMDGPU: Fix SMEM WAR hazard for gfx10 readlane	2019-10-18 18:20:30 +00:00
sminmax.ll	[AMDGPU] Unnecessary -amdgpu-scalarize-global-loads=false flag removed from min/max lit tests.	2019-09-19 16:44:38 +00:00
sminmax.v2i16.ll	[AMDGPU] Unnecessary -amdgpu-scalarize-global-loads=false flag removed from min/max lit tests.	2019-09-19 16:44:38 +00:00
smrd_vmem_war.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
smrd-fold-offset.mir
smrd-gfx10.ll
smrd-vccz-bug.ll	[AMDGPU] Simplify VCCZ bug handling	2019-10-30 17:09:07 +00:00
smrd.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
sopk-compares.ll
sp-too-many-input-sgprs.ll
spill-agpr.ll
spill-alloc-sgpr-init-bug.ll
spill-before-exec.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
spill-cfg-position.ll
spill-csr-frame-ptr-reg-copy.ll
spill-empty-live-interval.mir
spill-m0.ll	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem.	2019-10-18 21:48:22 +00:00
spill-offset-calculation.ll
spill-scavenge-offset.ll
spill-vgpr-to-agpr.ll	[codegen,amdgpu] Enhance MIR DIE and re-arrange it for AMDGPU.	2020-01-14 19:26:15 -05:00
spill-wide-sgpr.ll	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem.	2019-10-18 21:48:22 +00:00
split-arg-dbg-value.ll	[AMDGPU] Remove update_llc_test_checks for a test	2019-10-25 11:47:33 -04:00
split-scalar-i64-add.ll
split-smrd.ll
split-vector-memoperand-offsets.ll
splitkit.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
sra.ll
sram-ecc-default.ll
srem64.ll	AMDGPU: Cleanup and generate 64-bit div tests	2020-01-20 17:19:39 -05:00
srem.ll
srl.ll
ssubo.ll
stack-pointer-offset-relative-frameindex.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
stack-realign-kernel.ll	[AMDGPU] Clean up update_llc_test_checks CodeGen tests	2019-10-24 17:35:33 -04:00
stack-realign.ll
stack-size-overflow.ll
stack-slot-color-sgpr-vgpr-spills.mir	AMDGPU: Stop adding m0 implicit def to SGPR spills	2019-10-21 19:42:29 +00:00
store_typed.ll
store-barrier.ll
store-global.ll
store-hi16.ll	AMDGPU: Split flat offsets that don't fit in DAG	2019-10-20 17:34:44 +00:00
store-local.ll
store-private.ll
store-v3i64.ll
store-vector-ptrs.ll
store-weird-sizes.ll
stress-calls.ll
structurize1.ll
structurize.ll
sub_i1.ll
sub-zext-cc-zext-cc.ll	Revert "AMDGPU: Try to commute sub of boolean ext"	2019-12-13 12:49:06 +00:00
sub.i16.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
sub.ll
sub.v2i16.ll	AMDGPU: Apply i16 add->sub pattern with zext to i32	2020-01-07 16:36:31 -05:00
subreg_interference.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
subreg-coalescer-crash.ll
subreg-coalescer-undef-use.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
subreg-eliminate-dead.ll
subreg-intervals.mir
subreg-split-live-in-error.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
subreg-undef-def-with-other-subreg-defs.mir	MachineScheduler: Fix missing dependency with multiple subreg defs	2019-09-20 00:09:15 +00:00
subvector-test.mir	AMDGPU: Use SGPR_128 instead of SReg_128 for vregs	2019-10-10 07:11:33 +00:00
swizzle-export.ll
syncscopes.ll
tail-call-cgp.ll
tail-dup-bundle.mir	Process BUNDLE in tail duplication	2020-01-15 15:46:57 -08:00
tail-duplication-convergent.ll
target-cpu.ll
tex-clause-antidep.ll
texture-input-merge.ll
trap.ll
trunc-bitcast-vector.ll
trunc-cmp-constant.ll
trunc-combine.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
trunc-store-f64-to-f16.ll
trunc-store-i1.ll
trunc-store.ll
trunc-vector-store-assertion-failure.ll
trunc.ll
tti-unroll-prefs.ll
twoaddr-fma.mir
twoaddr-mad.mir
uaddo.ll
udiv64.ll	AMDGPU: Cleanup and generate 64-bit div tests	2020-01-20 17:19:39 -05:00
udiv.ll
udivrem24.ll
udivrem64.r600.ll	AMDGPU: Cleanup and generate 64-bit div tests	2020-01-20 17:19:39 -05:00
udivrem.ll
uint_to_fp.f64.ll	AMDGPU: Fix not using v_cvt_f16_[iu]16	2020-01-07 15:10:07 -05:00
uint_to_fp.i64.ll
uint_to_fp.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
uitofp.f16.ll	AMDGPU: Fix not using v_cvt_f16_[iu]16	2020-01-07 15:10:07 -05:00
umed3.ll
unaligned-load-store.ll
undefined-physreg-sgpr-spill.mir
undefined-subreg-liverange.ll
unhandled-loop-condition-assertion.ll
uniform-branch-intrinsic-cond.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
uniform-cfg.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
uniform-crash.ll
uniform-loop-inside-nonuniform.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
uniform-work-group-attribute-missing.ll
uniform-work-group-nested-function-calls.ll
uniform-work-group-prevent-attribute-propagation.ll
uniform-work-group-propagate-attribute.ll
uniform-work-group-recursion-test.ll
uniform-work-group-test.ll
unify-metadata.ll
unigine-liveness-crash.ll
unknown-processor.ll
unpack-half.ll
unroll.ll
unsupported-calls.ll	AMDGPU: Don't error on calls to null or undef	2019-10-20 07:46:04 +00:00
unsupported-cc.ll
unsupported-image-a16.ll	AMDGPU: Don't assert on a16 images on targets without FeatureR128A16	2020-01-17 11:07:00 -05:00
update-phi.ll
urem64.ll	AMDGPU: Cleanup and generate 64-bit div tests	2020-01-20 17:19:39 -05:00
urem.ll
use-sgpr-multiple-times.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
usubo.ll
v1i64-kernel-arg.ll
v1024.ll
v_cndmask.ll
v_cvt_pk_u8_f32.ll
v_mac_f16.ll
v_mac.ll
v_madak_f16.ll
v_swap_b32.mir
valu-i1.ll	[AMDGPU] SIRemoveShortExecBranches should not remove branches exiting loops	2020-01-22 13:18:40 +09:00
vccz-corrupt-bug-workaround.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
vcmpx-exec-war-hazard.mir
vcmpx-permlane-hazard.mir
vector_shuffle.packed.ll	[AMDGPU] Fix typo in SIInstrInfo::memOpsHaveSameBasePtr	2019-12-17 18:54:27 +00:00
vector-alloca-addrspacecast.ll
vector-alloca-atomic.ll
vector-alloca.ll
vector-extract-insert.ll
vector-legalizer-divergence.ll
vectorize-buffer-fat-pointer.ll
vectorize-global-local.ll
verify-sop.mir	Revert "[Support] make report_fatal_error `abort` instead of `exit`"	2020-01-15 17:52:25 -08:00
vertex-fetch-encoding.ll
vgpr-descriptor-waterfall-loop-idom-update.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
vgpr-spill-emergency-stack-slot-compute.ll	[AMDGPU] Come back patch for the 'Assign register class for cross block values according to the divergence.'	2019-10-14 12:01:10 +00:00
vgpr-spill-emergency-stack-slot.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
vi-removed-intrinsics.ll
virtregrewrite-undef-identity-copy.mir
vmem-to-salu-hazard.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
vmem-vcc-hazard.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
vop-shrink-frame-index.mir
vop-shrink-non-ssa.mir
vop-shrink.ll
vselect64.ll
vselect.ll
vtx-fetch-branch.ll
vtx-schedule.ll
wait.ll	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
waitcnt-back-edge-loop.mir
waitcnt-debug.mir
waitcnt-flat.ll
waitcnt-loop-irreducible.mir	[AMDGPU] Extend buffer intrinsics with swizzling	2019-10-02 17:22:36 +00:00
waitcnt-loop-single-basic-block.mir
waitcnt-looptest.ll
waitcnt-no-redundant.mir
waitcnt-overflow.mir	AMDGPU: Handle waitcnt overflow	2019-11-23 09:34:23 -08:00
waitcnt-permute.mir
waitcnt-preexisting.mir
waitcnt-vscnt.ll
waitcnt-vscnt.mir	[amdgpu] Fix scoreboard updating on `s_waitcnt_vscnt`.	2019-12-31 14:20:30 -05:00
waitcnt.mir
wave32.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
wave_dispatch_regs.ll
widen_extending_scalar_loads.ll
widen-smrd-loads.ll
widen-vselect-and-mask.ll
wqm.ll	Resubmit: [AMDGPU] Invert the handling of skip insertion.	2020-01-22 13:18:32 +09:00
wqm.mir	[AMDGPU] Remove unnecessary v_mov from a register to itself in WQM lowering.	2020-01-10 23:01:19 -05:00
write_register.ll	AMDGPU: Split test function	2020-01-12 22:44:51 -05:00
write-register-vgpr-into-sgpr.ll
wrong-transalu-pos-fix.ll
wwm-reserved.ll
xfail.r600.bitcast.ll
xnor.ll
xor3-i1-const.ll
xor3.ll
xor_add.ll
xor.ll
zero_extend.ll	[AMDGPU] Fix bundle scheduling	2020-01-09 15:56:36 -08:00
zext-i64-bit-operand.ll
zext-lid.ll

README

+==============================================================================+
| How to organize the lit tests                                                |
+==============================================================================+

- If you write a test for matching a single DAG opcode or intrinsic, it should
  go in a file called {opcode_name,intrinsic_name}.ll (e.g. fadd.ll)

- If you write a test that matches several DAG opcodes and checks for a single
  ISA instruction, then that test should go in a file called {ISA_name}.ll (e.g.
  bfi_int.ll

- For all other tests, use your best judgement for organizing tests and naming
  the files.

+==============================================================================+
| Naming conventions                                                           |
+==============================================================================+

- Use dash '-' and not underscore '_' to separate words in file names, unless
  the file is named after a DAG opcode or ISA instruction that has an
  underscore '_' in its name.