[aarch64-port-dev ] RFR (S) 8131682: C1 should use multibyte nops everywhere (original) (raw)

Aleksey Shipilev aleksey.shipilev at oracle.com
Mon Jul 27 10:53:20 UTC 2015


On 07/27/2015 01:21 PM, Andrew Haley wrote:

On 27/07/15 10:13, Aleksey Shipilev wrote:

Thanks Goetz! Fixed the assemblerppc.inline.hpp.

Andrew/Edward, are you OK with AArch64 part? http://cr.openjdk.java.net/~shade/8131682/webrev.02/ I agree that it looks good. Please have a look to see how many NOPs take the same time as a branch.

Thanks!

I don't quite believe we should spend time trying branches for nops, at least for x86. The change we are discussing follows the Intel Optimization Reference Manual 3.5.1.10 "Using NOPs", which Assembler::align for x86 seems to implement with some bells and whistles. Agner agrees on using multi-byte nops (0F 1F ...) on modern x86 chips as well; up to the point he claims 4 insn/clock throughput for them.

Is there a vendor-recommended strategy for using something else? Even if it's so, this calls for experimenting with Assembler::align itself (that also touches C2 usages), and not the C1-specific usages this trivial change addresses.

Thanks again, -Aleksey

-------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: OpenPGP digital signature URL: <http://mail.openjdk.java.net/pipermail/hotspot-compiler-dev/attachments/20150727/7eedfd39/signature.asc>



More information about the hotspot-compiler-dev mailing list