[llvm-dev] Can I control HSA config generated by AMDGPU backend? (original) (raw)

Changdao Dong via llvm-dev llvm-dev at lists.llvm.org
Wed Sep 5 12:26:11 PDT 2018


The target algorithm is lyra2 and the target kernel is "search2" in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/phi2.cl. The detail is implemented in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/lyra2mdz.cl If you have time to take a look at the assembly, I can upload them later today.

Thanks, Changdao

On Wed, Sep 5, 2018 at 11:32 AM Matt Arsenault <arsenm2 at gmail.com> wrote:

On Sep 5, 2018, at 23:17, Changdao Dong via llvm-dev <_ _llvm-dev at lists.llvm.org> wrote: Finally I kind of modified llvm to generate assembly that can run on AMDGPU pro drivers. One problem is the performance of the code generated by llvm is about 10% slower than amdgpu's online compiler. Anything I can tune _the performance up the performance of llvm?_ This is very dependent on the case you are looking at, so without a specific example or ISA comparison between the compilers there’s no guessing -Matt

-- DONG, Changdao

MP: 1-412-551-2330 dongchangdao at gmail.com <cddong at cmu.edu> -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180905/ed1da73f/attachment.html>



More information about the llvm-dev mailing list