teak-llvm

mirror of https://github.com/Gericom/teak-llvm.git synced 2025-06-20 03:55:48 -04:00

Author	SHA1	Message	Date
Craig Topper	8d02be3bf3	[X86] Add 'sahf' to getHostCPUFeatures so -march=native will pick it up correctly. Summary: We probably mostly get this right due to family/model/stepping mapping to CPU names. But we should detect it explicitly. Reviewers: RKSimon, echristo, dim, spatel Reviewed By: dim Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D43418 llvm-svn: 325439	2018-02-17 16:52:49 +00:00
Craig Topper	84b26b90d1	[X86] Add intrinsic support for the RDPID instruction This adds a new instrinsic to support the rdpid instruction. The implementation is a bit weird because the intrinsic is defined as always returning 32-bits, but the assembler support thinks the instruction produces a 64-bit register in 64-bit mode. But really it zeros the upper 32 bits. So I had to add separate patterns where 64-bit mode uses an extract_subreg. Differential Revision: https://reviews.llvm.org/D42205 llvm-svn: 322910	2018-01-18 23:52:31 +00:00
Craig Topper	48176a5fb6	[X86] Minor formatting fix to getHostCPUFeatures. NFC llvm-svn: 321015	2017-12-18 19:40:11 +00:00
Evandro Menezes	5d7a9e6e54	[AArch64] Add Exynos to host detection Differential revision: https://reviews.llvm.org/D40985 llvm-svn: 320195	2017-12-08 21:09:59 +00:00
Oren Ben Simhon	fa582b075c	Control-Flow Enforcement Technology - Shadow Stack support (LLVM side) Shadow stack solution introduces a new stack for return addresses only. The HW has a Shadow Stack Pointer (SSP) that points to the next return address. If we return to a different address, an exception is triggered. The shadow stack is managed using a series of intrinsics that are introduced in this patch as well as the new register (SSP). The intrinsics are mapped to new instruction set that implements CET mechanism. The patch also includes initial infrastructure support for IBT. For more information, please see the following: https://software.intel.com/sites/default/files/managed/4d/2a/control-flow-enforcement-technology-preview.pdf Differential Revision: https://reviews.llvm.org/D40223 Change-Id: I4daa1f27e88176be79a4ac3b4cd26a459e88fed4 llvm-svn: 318996	2017-11-26 13:02:45 +00:00
Coby Tayree	d8b17bedfa	[x86][icelake]GFNI galois field arithmetic (GF(2^8)) insns: gf2p8affineinvqb gf2p8affineqb gf2p8mulb Differential Revision: https://reviews.llvm.org/D40373 llvm-svn: 318993	2017-11-26 09:36:41 +00:00
Craig Topper	47c8739b08	[X86] Move the information about the feature bits used by compiler-rt and shared by Host.cpp to a .def file and TargetParser.h so clang can make use of it. Since we keep Host.cpp and compiler-rt relatively in sync, clang can use this information as a proxy. llvm-svn: 318814	2017-11-21 23:36:42 +00:00
Craig Topper	9b03f67f9a	[X86] Sort bits in getHostCPUFeatures again. llvm-svn: 318792	2017-11-21 18:50:41 +00:00
Coby Tayree	5c7fe5df53	[x86][icelake]BITALG vpopcnt{b,w} Differential Revision: https://reviews.llvm.org/D40213 llvm-svn: 318748	2017-11-21 10:32:42 +00:00
Coby Tayree	3880f2a363	[x86][icelake]VNNI Introducing Vector Neural Network Instructions, consisting of: vpdpbusd{s} vpdpwssd{s} Differential Revision: https://reviews.llvm.org/D40208 llvm-svn: 318746	2017-11-21 10:04:28 +00:00
Coby Tayree	71e37cc9ff	[x86][icelake]vbmi2 introducing vbmi2, consisting of vpcompress{b,w} vpexpand{b,w} vpsh{l,r}d{w,d,q} vpsh{l,r}dv{w,d,q} Differential Revision: https://reviews.llvm.org/D40206 llvm-svn: 318745	2017-11-21 09:48:44 +00:00
Coby Tayree	7ca5e58736	[x86][icelake]vpclmulqdq introduction an icelake promotion of pclmulqdq Differential Revision: https://reviews.llvm.org/D40101 llvm-svn: 318741	2017-11-21 09:30:33 +00:00
Coby Tayree	2a1c02fcbc	[x86][icelake]VAES introduction an icelake promotion of AES Differential Revision: https://reviews.llvm.org/D40078 llvm-svn: 318740	2017-11-21 09:11:41 +00:00
Craig Topper	dcd69797a6	[X86] clzero check in getHostCPUFeatures should use getX86CpuIDAndInfo not getX86CpuIDAndInfoEx. This leaf doesn't take an additional argument. llvm-svn: 318634	2017-11-19 23:49:19 +00:00
Craig Topper	1af7e4424f	[X86] Reorder and reformat the feature bit checks in getHostCPUFeatues to keep the bits in order per register and encourage future additions to be in order too. llvm-svn: 318633	2017-11-19 23:30:22 +00:00
Craig Topper	4a1ec811af	[X86] Add some explanatory comments to the ProcessorFeatures enum in Host.cpp. llvm-svn: 318331	2017-11-15 20:42:49 +00:00
Craig Topper	0dadfe30d2	[X86] Add getHostCPUName support for the Gemini Lake model number which also uses Goldmont. llvm-svn: 318271	2017-11-15 06:02:43 +00:00
Craig Topper	0749186a70	[X86] Add getHostCPUName support for cannonlake. This adds an explicit model number check and fallback path to the unknown family 6 detection. llvm-svn: 318270	2017-11-15 06:02:42 +00:00
Craig Topper	c77d00e327	[X86] Add a def file to CPU vendor, type, and subtype encodings used by Host.cpp Summary: I want to leverage this to clean up some of the code in clang. This will allow us to simplify D39521 which was trying to do some of the same. If we accurately keep the code in Host.cpp synced with new CPUs added to compile-rt/libgcc we should be able to use this file as a proxy for what's implemented in the libraries. The entries for the CPUs recognized by the libraries use separate macros that define additional parameters like the name for __builtin_cpu_is and an alias string for the couple cases where __builtin_cpu_is accepts two different names. All of the macros contain an ARCHNAME that is usually the same as the __builtin_cpu_is string, but sometimes isn't. This represents the name recognized by X86.td and -march. I'm following the precedent set by ARM and AArch64 and adding this information to lib/Support/TargetParser.cpp Reviewers: erichkeane, echristo, asbirlea Reviewed By: echristo Subscribers: llvm-commits, aemerson, kristof.beyls Differential Revision: https://reviews.llvm.org/D39782 llvm-svn: 317900	2017-11-10 17:10:57 +00:00
Craig Topper	8b6600363a	[X86] Promote athlon, athlon-xp, k8, and k8-sse3 to types instead of subtypes in getHostCPUName. NFCI This removes the athlon type and simplifies the string decoding. We only really need these type/subtype breaks where we need to match libgcc/compiler-rt and these CPUs aren't part of that. I'm looking into moving some of this information to a .def file to share with clang's __builtin_cpu_is handling. And while these CPUs aren't part of that the less lines I have to deal with in the .def file the better. llvm-svn: 317354	2017-11-03 19:37:41 +00:00
Craig Topper	741e7e6a71	[X86] Initialize Type and Subtype in getHostCPUName to 0. llvm-svn: 317341	2017-11-03 18:02:44 +00:00
Craig Topper	1494915a3a	[X86] Simplify the pentium4 code in getHostCPUName to be based on feature flags. Don't use 'x86-64' ever. 'x86-64' has started to reflect a sort of generic tuning flag for more modern 64-bit CPUs. We probably shouldn't be using it as the name of an unidentifiable pentium4. So use nocona for all 64-bit pentium4s instead. llvm-svn: 317230	2017-11-02 19:13:34 +00:00
Craig Topper	a233e16cd8	[X86] Change getHostCPUName fallback code to not select 'x86-64' for unknown CPUs in family 6 that has 64-bit support but not any newer SSE features. Use 'core2' instead We know that's the earliest CPU with 64-bit support. x86-64 has taken on a role of representing a more modern 64-bit CPU so we probably shouldn't be using that when we can't identify things. llvm-svn: 317229	2017-11-02 19:13:32 +00:00
Craig Topper	3bbe24c3ca	[X86] Remove the model checks from the 486 detection code in Host.cpp This just provided a bunch of comments to read and not much else. llvm-svn: 317185	2017-11-02 03:32:50 +00:00
Craig Topper	094d7914ae	[X86] Simplify the detection of pentium-mmx in Host.cpp. Rather than looking at model numbers just check for the mmx feature flag. While there promote INTEL_PENTIUM_MMX to a CPU type instead of a subtype so that we don't have weird type with only one subtype. llvm-svn: 317184	2017-11-02 03:32:49 +00:00
Craig Topper	5d692917f4	[X86] Add initial skeleton support for knm cpu This adds Intel's Knights Mill CPU to valid CPU names for the backend. For now its an alias of "knl", but ultimately we need to support AVX5124FMAPS and AVX5124VNNIW instruction sets for it. Differential Revision: https://reviews.llvm.org/D38811 llvm-svn: 315722	2017-10-13 18:10:17 +00:00
Chad Rosier	71070856e6	[AArch64] Add basic support for Qualcomm's Saphira CPU. llvm-svn: 314105	2017-09-25 14:05:00 +00:00
Balaram Makam	a1e7ecc734	[Falkor] Add falkor CPU to host detection This returns "falkor" for Falkor CPU. llvm-svn: 313998	2017-09-22 17:46:36 +00:00
Eli Friedman	bde9fc76dd	[ARM] Add more CPUs to host detection This returns "cortex-a73" for second-generation Kryo; not precisely correct, but close enough. Differential Revision: https://reviews.llvm.org/D37724 llvm-svn: 313200	2017-09-13 21:48:00 +00:00
Yonghong Song	c6d2571031	bpf: close the file descriptor after probe inside getHostCPUNameForBPF Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 311567	2017-08-23 16:24:31 +00:00
Yonghong Song	dc1dbf6ef3	bpf: add variants of -mcpu=# and support for additional jmp insns -mcpu=# will support: . generic: the default insn set . v1: insn set version 1, the same as generic . v2: insn set version 2, version 1 + additional jmp insns . probe: the compiler will probe the underlying kernel to decide proper version of insn set. We did not not use -mcpu=native since llc/llvm will interpret -mcpu=native as the underlying hardware architecture regardless of -march value. Currently, only x86_64 supports -mcpu=probe. Other architecture will silently revert to "generic". Also added -mcpu=help to print available cpu parameters. llvm will print out the information only if there are at least one cpu and at least one feature. Add an unused dummy feature to enable the printout. Examples for usage: $ llc -march=bpf -mcpu=v1 -filetype=asm t.ll $ llc -march=bpf -mcpu=v2 -filetype=asm t.ll $ llc -march=bpf -mcpu=generic -filetype=asm t.ll $ llc -march=bpf -mcpu=probe -filetype=asm t.ll $ llc -march=bpf -mcpu=v3 -filetype=asm t.ll 'v3' is not a recognized processor for this target (ignoring processor) ... $ llc -march=bpf -mcpu=help -filetype=asm t.ll Available CPUs for this target: generic - Select the generic processor. probe - Select the probe processor. v1 - Select the v1 processor. v2 - Select the v2 processor. Available features for this target: dummy - unused feature. Use +feature to enable a feature, or -feature to disable it. For example, llc -mcpu=mycpu -mattr=+feature1,-feature2 ... Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Signed-off-by: Yonghong Song <yhs@fb.com> Acked-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 311522	2017-08-23 04:25:57 +00:00
Craig Topper	4eda7561b3	[X86] Improve the unknown stepping support for Intel CPUs in getHostCPUName This patch improves our guessing of unknown Intel CPUs to support Goldmont and skylake-avx512. Differential Revision: https://reviews.llvm.org/D35161 llvm-svn: 309246	2017-07-27 03:26:52 +00:00
Ulrich Weigand	2b3482fe85	[SystemZ] Add support for IBM z14 processor (1/3) This patch series adds support for the IBM z14 processor. This part includes: - Basic support for the new processor and its features. - Support for new instructions (except vector 32-bit float and 128-bit float). - CodeGen for new instructions, including new LLVM intrinsics. - Scheduler description for the new processor. - Detection of z14 as host processor. Support for the new 32-bit vector float and 128-bit vector float instructions is provided by separate patches. llvm-svn: 308194	2017-07-17 17:41:11 +00:00
Craig Topper	828cf302ec	[X86] Use MSVC's __cpuidex intrinsic instead of inline assembly in getHostCPUName/getHostCPUFeatures for 32-bit builds too. We're already using it in 64-bit builds because 64-bit MSVC doesn't support inline assembly. As far as I know we were using inline assembly because at the time the code was added we had to support MSVC 2008 pre-SP1 while the intrinsic was added to MSVC in SP1. Now that we don't have to support that we should be able to just use the intrinsic. llvm-svn: 308163	2017-07-17 05:16:16 +00:00
Craig Topper	f3de5eb7c6	[X86] Simplify the getHostCPUName for AMD family 6 and 15. As far as I can tell we can simply distinguish based on features rather than model number. Many of the strings we were previously using are treated the same by the backend. llvm-svn: 307884	2017-07-13 06:34:10 +00:00
Craig Topper	3a5d082a05	[X86] Synchronize the ProcessorFeatures enum used by getHostCPUName with the enum in libgcc and soon compiler-rt. This adds all the feature bits libgcc has. They will soon be added to compiler-rt as well. This adds a second 32 bit feature variable to hold the bits that are needed by getHostCPUName that are not in libgcc. libgcc had already used 31 of the 32 bits in the existing variable and we needed 3 bits so at minimum 2 bits would spill over. I chose to move all 3. llvm-svn: 307758	2017-07-12 06:49:58 +00:00
Craig Topper	f3af64e824	[X86] Sync ProcessorTypes and ProcessorSubtypes enums used by getHostCPUName with the version proposed to for compiler-rt's cpu_model.c This keeps the starting entries in the enums in sync with what's in gcc and in review D35214 for compiler-rt. llvm-svn: 307757	2017-07-12 06:49:57 +00:00
Craig Topper	3db11705f5	[X86] Cleanup the switches in getHostCPUName to remove impossible combinations. llvm-svn: 307756	2017-07-12 06:49:56 +00:00
Craig Topper	e98b65b809	[X86] Remove 'barcelona' string from getHostCPUName. Use 'amdfam10' instead. The x86 backend doesn't distinguish. llvm-svn: 307755	2017-07-12 06:49:55 +00:00
Craig Topper	ada983a555	[X86] Fix typo in comment. NFC llvm-svn: 307522	2017-07-10 06:09:22 +00:00
Craig Topper	1efd10ab75	[X86] Remove asserts from getX86CpuIDAndInfo/getX86CpuIDAndInfoEx. Restore past behavior of returning an unsupported indication to the caller instead. These asserts could only occur if we fail to properly detect the compiler, but an assert is not a good way to do that because it doesn't work in release builds. I wonder if we could use #error? llvm-svn: 307520	2017-07-10 06:04:11 +00:00
Craig Topper	52cec3894a	[X86] Remove check for AVX512 support from skylake-avx512 detection in getHostCPUName. Users of getHostCPUName should also use getHostCPUFeatures which will take care of making sure avx512 is disabled if the CPU doesn't support it. This is consistent with what we do for other CPUs. llvm-svn: 307495	2017-07-09 07:26:14 +00:00
Craig Topper	ffe672d200	[X86] In getHostCPUName, remove some code that changes some AMD CPU names based on features not being enabled. The CPU name is really just used for scheduler and other microarchitectural optimizations. The feature flags should be determined by getHostCPUFeatures which should always be used with getHostCPUName. Trying to alter CPU name strings to control features just isn't practical. Most of these types of things were removed from Intel CPUs a while ago. This is part of my plan to bring compiler-rt's cpu_model.c file up to date with the equivalent functionality in libgcc. A lot of the code in that file is copied from Host.cpp and we want to keep them reasonably in sync. llvm-svn: 307467	2017-07-08 06:44:36 +00:00
Craig Topper	1f9d3c0612	[X86] Correct the BDVER4 model numbers to include 0x70-0x7f. According to wikipedia and some other googling suggests these should also be considered as BDVER4. llvm-svn: 307466	2017-07-08 06:44:35 +00:00
Craig Topper	2ace153a82	[X86] Minor formatting fix. NFC llvm-svn: 307465	2017-07-08 06:44:34 +00:00
Craig Topper	c6bbe4becb	[X86] Use 'unsigned' instead of 'unsigned int' for consistency in the X86 portion of Host.cpp. llvm-svn: 307463	2017-07-08 05:16:14 +00:00
Craig Topper	bb8c799e1a	[X86] Cleanup some CPUID usage in getAvailableFeatures. We should make sure leaf 1 is available before accessing it. Same with leaf 0x80000001. llvm-svn: 307462	2017-07-08 05:16:13 +00:00
Alex Lorenz	3803df3dcd	[Support] sys::getProcessTriple should return a macOS triple using the system's version of macOS sys::getProcessTriple returns LLVM_HOST_TRIPLE, whose system version might not be the actual version of the system on which the compiler running. This commit ensures that, for macOS, sys::getProcessTriple returns a triple with the system's macOS version. rdar://33177551 Differential Revision: https://reviews.llvm.org/D34446 llvm-svn: 307372	2017-07-07 09:53:47 +00:00
Michael Zuckerman	4bcb9c3349	[LLVM][X86][Goldmont] Adding new target-cpu: Goldmont [LLVM SIDE] Connecting the GoldMont processor to his feature. Reviewers: 1. igorb 2. zvi 3. delena 4. RKSimon 5. craig.topper Differential Revision: https://reviews.llvm.org/D34504 llvm-svn: 306658	2017-06-29 10:00:33 +00:00
Oren Ben Simhon	f3aab2fa33	[X86] Fixing VPOPCNTDQ feature set lookup. llvm-svn: 304086	2017-05-28 11:26:11 +00:00
Oren Ben Simhon	7bf27f03f2	[X86] Adding vpopcntd and vpopcntq instructions AVX512_VPOPCNTDQ is a new feature set that was published by Intel. The patch represents the LLVM side of the addition of two new intrinsic based instructions (vpopcntd and vpopcntq). Differential Revision: https://reviews.llvm.org/D33169 llvm-svn: 303858	2017-05-25 13:45:23 +00:00
Simon Pilgrim	99b925bdf3	[X86][LWP] Add llvm support for LWP instructions (reapplied). This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Reapplied - this time without changing line endings of existing files. Differential Revision: https://reviews.llvm.org/D32769 llvm-svn: 302041	2017-05-03 15:51:39 +00:00
Simon Pilgrim	a271c54324	Revert rL302028 due to accidental line ending changes. llvm-svn: 302038	2017-05-03 15:42:29 +00:00
Simon Pilgrim	b2e0464fde	[X86][LWP] Add llvm support for LWP instructions. This patch adds support for the the LightWeight Profiling (LWP) instructions which are available on all AMD Bulldozer class CPUs (bdver1 to bdver4). Differential Revision: https://reviews.llvm.org/D32769 llvm-svn: 302028	2017-05-03 15:18:34 +00:00
Teresa Johnson	9709e2b76d	[Support] Fix ErrorOr assertion when /proc/cpuinfo doesn't exist. The ErrorOr should not be dereferenced on the error path. Patch by Jacob Young Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32032 llvm-svn: 300267	2017-04-13 21:51:49 +00:00
Yi Kong	0b5eed24c5	Remove xgene1 from host detection This is not a supported mcpu tuning option. We should treat it as "generic" variant. Also, add record for cortex-a35. llvm-svn: 300003	2017-04-11 22:39:55 +00:00
Yi Kong	57019dc9b2	Implement host CPU detection for AArch64 This shares detection logic with ARM(32), since AArch64 capable CPUs may also run in 32-bit system mode. We observe weird /proc/cpuinfo output for MSM8992 and MSM8994, where they report all CPU cores as one single model, depending on which CPU core the kernel is running on. As a workaround, we hardcode the known CPU part name for these SoCs. For big.LITTLE systems, this patch would only return the part name of the first core (usually the little core). Proper support will be added in a follow-up change. Differential Revision: D31675 llvm-svn: 299458	2017-04-04 19:06:04 +00:00
Kristof Beyls	77ce4f6e37	Make naming in Host.h in line with coding standards. Based on post-commit review comments by Chandler Carruth on https://reviews.llvm.org/D31236. Thanks! llvm-svn: 299211	2017-03-31 13:06:40 +00:00
Kristof Beyls	7a76b315d6	Revert "Make naming in Host.h in line with coding standards." This reverts r299062, which caused build failures on Windows. It also reverts the attempts to fix the windows builds in r299064 and r299065. The introduction of namespace llvm::sys::detail makes MSVC, and seemingly also mingw, complain about ambiguity with the existing namespace llvm::detail. E.g.: C:\b\slave\sanitizer-windows\llvm\include\llvm/Support/MathExtras.h(184): error C2872: 'detail': ambiguous symbol C:\b\slave\sanitizer-windows\llvm\include\llvm/Support/PointerLikeTypeTraits.h(31): note: could be 'llvm::detail' C:\b\slave\sanitizer-windows\llvm\include\llvm/Support/Host.h(80): note: or 'llvm::sys::detail' In r299064 and r299065 I tried to fix these ambiguities, based on the errors reported in the log files. It seems however that the build stops early when this kind of error is encountered, and many build-then-fix-iterations on Windows may be needed to fix this. Therefore reverting r299062 for now to get the build working again on Windows. llvm-svn: 299066	2017-03-30 11:06:25 +00:00
Kristof Beyls	ca878c943b	Make naming in Host.h in line with coding standards. Based on post-commit review comments by Chandler Carruth on https://reviews.llvm.org/D31236. Thanks! llvm-svn: 299062	2017-03-30 09:31:59 +00:00
Kristof Beyls	9e46396ecc	Refactor getHostCPUName to allow testing on non-native hardware. This refactors getHostCPUName so that for the architectures that get the host cpu info on linux from /proc/cpuinfo, the /proc/cpuinfo parsing logic is present in the build, even if it wasn't built on a linux system for that architecture. Since the code is present in the build, we can then test that code also on other systems, i.e. we don't need to have buildbots setup for all architectures on linux to be able to test this. Instead, developers will test this as part of the regression test run. As an example, a few unit tests are added to test getHostCPUName for ARM running linux. A unit test is preferred over a lit-based test, since the expectation is that in the future, the functionality here will grow over what can be tested with "llc -mcpu=native". This is a preparation step to enable implementing the range of improvements discussed on PR30516, such as adding AArch64 support, support for big.LITTLE systems, reducing code duplication. Differential Revision: https://reviews.llvm.org/D31236 llvm-svn: 299060	2017-03-30 07:24:49 +00:00
Craig Topper	3cac763532	[X86] Remove the HLE feature flag. We only implemented it for one of the 3 HLE instructions and that instruction is also under the RTM flag. Clang only implements the RTM flag from its command line. llvm-svn: 294562	2017-02-09 06:51:02 +00:00
Craig Topper	86576bd921	[X86] Remove INVPCID and SMAP feature flags. They aren't currently used by any instructions and not tested. If we implement intrinsics for their instructions in the future, the feature flags can be added back with proper testing. llvm-svn: 294561	2017-02-09 06:50:59 +00:00
Craig Topper	50f3d1452c	[X86] Clzero intrinsic and its addition under znver1 This patch does the following. 1. Adds an Intrinsic int_x86_clzero which works with __builtin_ia32_clzero 2. Identifies clzero feature using cpuid info. (Function:8000_0008, Checks if EBX[0]=1) 3. Adds the clzero feature under znver1 architecture. 4. The custom inserter is added in Lowering. 5. A testcase is added to check the intrinsic. 6. The clzero instruction is added to assembler test. Patch by Ganesh Gopalasubramanian with a couple formatting tweaks, a disassembler test, and using update_llc_test.py from me. Differential revision: https://reviews.llvm.org/D29385 llvm-svn: 294558	2017-02-09 04:27:34 +00:00
Craig Topper	e0ac7f3beb	[X86] Remove PCOMMIT instruction support since Intel has deprecated this instruction with no plans to release products with it. Intel's documentation for the deprecation https://software.intel.com/en-us/blogs/2016/09/12/deprecate-pcommit-instruction llvm-svn: 294405	2017-02-08 05:45:39 +00:00
Craig Topper	d55b83128b	AMD family 17h (znver1) enablement Summary: This patch enables the following 1. AMD family 17h architecture using "znver1" tune flag (-march, -mcpu). 2. ISAs that are enabled for "znver1" architecture. 3. Checks ADX isa from cpuid to identify "znver1" flag when -march=native is used. 4. ISAs FMA4, XOP are disabled as they are dropped from amdfam17. 5. For the time being, it uses the btver2 scheduler model. 6. Test file is updated to check this flag. This item is linked to clang review item https://reviews.llvm.org/D28018 Patch by Ganesh Gopalasubramanian Reviewers: RKSimon, craig.topper Subscribers: vprasad, RKSimon, ashutosh.nema, llvm-commits Differential Revision: https://reviews.llvm.org/D28017 llvm-svn: 291543	2017-01-10 06:01:16 +00:00
Craig Topper	33c544bdb0	[X86] Add Intel Kaby Lake model numbers to getHostCPUName aliased to "skylake" since there are no feature differences. Model numbers found here http://www.sandpile.org/x86/cpuid.htm llvm-svn: 291086	2017-01-05 05:57:27 +00:00
Craig Topper	1ab35fa7a8	[X86] Change getHostCPUName to report Intel model 0x4e as "skylake" instead of "skylake-avx512". Add the proper 0x55 model for "skylake-avx512". Summary: Intel's i5-6300U CPU is reporting to have a model id of 78 (4e). The Host detection assumes that to be Skylake Xeon (with AVX512 support), instead of a normal Skylake machine. Patch by: Valentin Churavy Reviewers: nalimilan, craig.topper Subscribers: hfinkel, tkelman, craig.topper, nalimilan, llvm-commits Differential Revision: https://reviews.llvm.org/D28221 llvm-svn: 291084	2017-01-05 05:47:29 +00:00
Nemanja Ivanovic	c08b90d08f	[PowerPC] Add identification for POWER8NVL This CPU type was not previously recognized by LLVM which led to emitting poor (and sometimes incorrect) code in some JIT workloads on such a machine. llvm-svn: 290961	2017-01-04 13:58:09 +00:00
Mehdi Amini	db46b7d217	Add computeHostNumPhysicalCores() implementation for Darwin Differential Revision: https://reviews.llvm.org/D25800 llvm-svn: 284656	2016-10-19 22:36:07 +00:00
Benjamin Kramer	4c2582ad78	Reduce global namespace pollution. NFC. llvm-svn: 284521	2016-10-18 19:39:31 +00:00
Teresa Johnson	7943fecee8	Add interface to compute number of physical cores on host system Summary: For now I have only added support for x86_64 Linux, but other systems can be added incrementally. This is to be used for setting the default parallelism for ThinLTO backends (instead of thread::hardware_concurrency which includes hyperthreading and is too aggressive). I'll send this as a follow-on patch, and it will fall back to hardware_concurrency when the new getHostNumPhysicalCores returns -1 (when not supported for a given host system). I also added an interface to MemoryBuffer to force reading a file as a stream - this is required for /proc/cpuinfo which is a special file that looks like a normal file but appears to have 0 size. The existing readers of this file in Host.cpp are reading the first 1024 or so bytes from it, because the necessary info is near the top. But for the new functionality we need to be able to read the entire file. I can go back and change the other readers to use the new getFileAsStream as a follow-on patch since it seems much more robust. Added a unittest. Reviewers: mehdi_amini Subscribers: beanz, mgorny, llvm-commits, modocache Differential Revision: https://reviews.llvm.org/D25564 llvm-svn: 284138	2016-10-13 17:43:20 +00:00
Alina Sbirlea	9a78ebd6d8	[cpu-detection] Copy simplified version of get_cpuid_max to remove dependency to clang's implementation Summary: Attempting to fix PR30384. Take the same approach as in compiler_rt and add a simplified version of __get_cpuid_max. Including cpuid.h is no longer needed. Reviewers: echristo, joerg Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D24597 llvm-svn: 283265	2016-10-04 22:39:53 +00:00
Alina Sbirlea	b86aa17b06	Properly ifdef the use of cpuid. llvm-svn: 276156	2016-07-20 18:54:26 +00:00
Alina Sbirlea	33588b14a7	[cpu-detection] Cleanup of Host.cpp. Summary: Mirroring most cleanup changed from compiler-rt/lib/builtins/cpu_model. x86 methods are still returning a bool. Reviewers: llvm-commits, echristo, craig.topper, sanjoy Subscribers: mehdi_amini Differential Revision: https://reviews.llvm.org/D22480 llvm-svn: 276149	2016-07-20 18:15:29 +00:00
Benjamin Kramer	b308f8b812	[Support] Make helper function static. NFC. llvm-svn: 275017	2016-07-10 16:11:53 +00:00
Alina Sbirlea	d665b410c6	Reapply 272328 and 272329 as a single patch. [cpu-detection] [amdfam10] Return barcelona, and amdfam10 for all other subtypes. Address Bug 28067. Along with the refactoring of Host.cpp, getHostCPUName() was modified to return more precise types for CPUs in amdfam10. However, callers of getHostCPUName() do string matching on type, so this cannot be modified. Currently there is support in the x86 backend for barcelona. For all other subtypes the assumed return value is amdfam10. Fix: getHostCPUName() returns barcelona subtype and amdfam10 for all others. This can be extended further when support for the other subtypes is added. Differential revision: http://reviews.llvm.org/D21193 llvm-svn: 272333	2016-06-09 23:04:15 +00:00
Alina Sbirlea	e0b7bf42c2	Revert 272328 and 272329 to recommit as a single patch. llvm-svn: 272332	2016-06-09 23:04:05 +00:00
Alina Sbirlea	4acea0aa22	Keep barcelona subtype for amdfam10 llvm-svn: 272329	2016-06-09 22:47:36 +00:00
Alina Sbirlea	e76dcd2b12	[cpu-detection] Return amdfam10 for all subtypes. Address Bug 28067. Summary: Remove architecture subtype from the string returned by getHostCPUName(). String matching done on type. Reviewers: llvm-commits, echristo Subscribers: mehdi_amini Differential Revision: http://reviews.llvm.org/D21193 llvm-svn: 272328	2016-06-09 22:47:12 +00:00
Alina Sbirlea	080241b75d	[cpu-detection] Add missing break statements in outer switches Summary: Break on all switch cases for outer and inner switches. No functionality changed. Reviewers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D21158 llvm-svn: 272228	2016-06-09 00:08:15 +00:00
Alina Sbirlea	76c4a855bc	[cpu-detection] Substantial refactor of Host CPU detection code (x86) Summary: Following D20970 (committed as r271726). This is a substantial refactoring of the host CPU detection code. There is no functionality change intended, but the changes are extensive. Definitions of architecture types and subtypes are by no means exhaustive or perfectly defined, but a fair starting point. Suggestions for futher improvements are welcome. Reviewers: llvm-commits Differential Revision: http://reviews.llvm.org/D20988 llvm-svn: 271921	2016-06-06 18:29:59 +00:00
Alina Sbirlea	400eb02775	[cpu-detection] Naming convention Summary: Follow-up to D20926 (committed as r271595, r271596). This patch is in preparation for a substantial refactoring of the code. No functionality changed. Differential Revision: http://reviews.llvm.org/D20970 llvm-svn: 271726	2016-06-03 20:27:50 +00:00
Eric Christopher	b820edd58a	80-column fixup after last formatting change. llvm-svn: 271598	2016-06-02 21:32:30 +00:00
Eric Christopher	d9804bb696	Fix a couple of misformatted comments spotted in post-commit review. llvm-svn: 271596	2016-06-02 21:09:17 +00:00
Eric Christopher	5db9d661f0	This patch is in preparation for a substantial refactoring of the code. To make the diffs easier to read, clang-format everything first. No functionality changed. Patch by Alina Sbirlea! http://reviews.llvm.org/D20926 llvm-svn: 271595	2016-06-02 21:03:19 +00:00
Aaron Ballman	54269226ba	Removing an unused variable introduced in r269911; NFC. llvm-svn: 269915	2016-05-18 12:52:04 +00:00
Ashutosh Nema	348af9cc6b	Add new flag and intrinsic support for MWAITX and MONITORX instructions Summary: MONITORX/MWAITX instructions provide similar capability to the MONITOR/MWAIT pair while adding a timer function, such that another termination of the MWAITX instruction occurs when the timer expires. The presence of the MONITORX and MWAITX instructions is indicated by CPUID 8000_0001, ECX, bit 29. The MONITORX and MWAITX instructions are intercepted by the same bits that intercept MONITOR and MWAIT. MONITORX instruction establishes a range to be monitored. MWAITX instruction causes the processor to stop instruction execution and enter an implementation-dependent optimized state until occurrence of a class of events. Opcode of MONITORX instruction is "0F 01 FA". Opcode of MWAITX instruction is "0F 01 FB". These opcode information is used in adding tests for the disassembler. These instructions are enabled for AMD's bdver4 architecture. Patch by Ganesh Gopalasubramanian! Reviewers: echristo, craig.topper, RKSimon Subscribers: RKSimon, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D19795 llvm-svn: 269911	2016-05-18 11:59:12 +00:00
Nemanja Ivanovic	6e29baf7f5	[Power9] Add support for -mcpu=pwr9 in the back end This patch corresponds to review: http://reviews.llvm.org/D19683 Simply adds the bits for being able to specify -mcpu=pwr9 to the back end. llvm-svn: 268950	2016-05-09 18:54:58 +00:00
Craig Topper	de4318b928	[Support][X86] Add a few more Intel model numbers to getHostCPUName for airmont and knl. llvm-svn: 267670	2016-04-27 05:17:00 +00:00
Craig Topper	e7d743ccf8	[Support][X86] Change the case values in the Intel family 6 code to hex so its easier to compare with Intel's docs. NFC llvm-svn: 267669	2016-04-27 05:16:58 +00:00
Craig Topper	0e2f14fa83	[Support][X86] Add a couple more Broadwell CPU models numbers to getHostCPUName. llvm-svn: 267666	2016-04-27 04:40:03 +00:00
Duncan P. N. Exon Smith	91d3cfed78	Revert "Fix Clang-tidy modernize-deprecated-headers warnings in remaining files; other minor fixes." This reverts commit r265454 since it broke the build. E.g.: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_build/22413/ llvm-svn: 265459	2016-04-05 20:45:04 +00:00
Eugene Zelenko	1760dc2a23	Fix Clang-tidy modernize-deprecated-headers warnings in remaining files; other minor fixes. Some Include What You Use suggestions were used too. Use anonymous namespaces in source files. Differential revision: http://reviews.llvm.org/D18778 llvm-svn: 265454	2016-04-05 20:19:49 +00:00
Sanjoy Das	aa63dc0e9a	Fix LLVM's handling and detection of skylake and cannonlake CPUs Summary: - Rename `"skylake"` == SkylakeServerProc to `"skylake-avx512"` - Change `"skylake"` to denote SkylakeClientProc - Fix the detection of cpu family 6 and model 94 to be SkylakeClientProc instead of SkylakeServerProc - Remove the `"cnl"` for CannonLake Reviewers: craig.topper, delena Subscribers: zansari, echristo, qcolombet, RKSimon, spatel, DavidKreitzer, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D17090 llvm-svn: 261482	2016-02-21 17:12:03 +00:00
Elena Demikhovsky	29cde35b43	Added Skylake client to X86 targets and features Changes in X86.td: I set features of Intel processors in incremental form: IVB = SNB + X HSW = IVB + X .. I added Skylake client processor and defined it's features FeatureADX was missing on KNL Added some new features to appropriate processors SMAP, IFMA, PREFETCHWT1, VMFUNC and others Differential Revision: http://reviews.llvm.org/D16357 llvm-svn: 258659	2016-01-24 10:41:28 +00:00
Michael Zuckerman	9c47e0681c	[AVX512] adding AVXVBMI feature flag Fixing wrong typo (avx515) → (avx512) Review over the shoulder by asaf . Differential Revision: http://reviews.llvm.org/D16190 llvm-svn: 258041	2016-01-18 11:12:47 +00:00
Michael Zuckerman	97b6a6923e	[AVX512] adding AVXVBMI feature flag The feature flag is for VPERMB,VPERMI2B,VPERMT2B and VPMULTISHIFTQB instructions. More about the instruction can be found in: hattps://software.intel.com/sites/default/files/managed/07/b7/319433-023.pdf Differential Revision: http://reviews.llvm.org/D16190 llvm-svn: 258012	2016-01-17 13:42:12 +00:00
Asaf Badouh	5acf66ff97	[x86] adding PKU feature flag the feature flag is essential for RDPKRU and WRPKRU instruction more about the instruction can be found in the SDM rev 56, vol 2 from http://www.intel.com/sdm Differential Revision: http://reviews.llvm.org/D15491 llvm-svn: 255644	2015-12-15 13:35:29 +00:00
Craig Topper	b84b12699f	[X86] Update CPU detection to only enable XSAVE features if the OS has enabled them and the saving of YMM state. This seems to be consistent with gcc behavior. llvm-svn: 250269	2015-10-14 05:37:42 +00:00

1 2 3 4 5

231 Commits