[llvm-dev] Can I control HSA config generated by AMDGPU backend? (original) (raw)
Changdao Dong via llvm-dev llvm-dev at lists.llvm.org
Wed Sep 5 23:58:21 PDT 2018
- Previous message: [llvm-dev] Can I control HSA config generated by AMDGPU backend?
- Next message: [llvm-dev] Can I control HSA config generated by AMDGPU backend?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Matt,
Attached are the two assembly code for kernel "search2", one is from amdgpu pro online compiler, one is from llvm-roc-1.6.x. It will be great if you can take a look and see how to improve the llvm one. In case you missed: The target algorithm is lyra2 and the target kernel is "search2" in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/phi2.cl. The detail is implemented in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/lyra2mdz.cl
Thanks, Changdao
On Wed, Sep 5, 2018 at 12:26 PM Changdao Dong <dongchangdao at gmail.com> wrote:
The target algorithm is lyra2 and the target kernel is "search2" in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/phi2.cl. The detail is implemented in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/lyra2mdz.cl If you have time to take a look at the assembly, I can upload them later today.
Thanks, Changdao On Wed, Sep 5, 2018 at 11:32 AM Matt Arsenault <arsenm2 at gmail.com> wrote:
On Sep 5, 2018, at 23:17, Changdao Dong via llvm-dev <_ _llvm-dev at lists.llvm.org> wrote: Finally I kind of modified llvm to generate assembly that can run on AMDGPU pro drivers. One problem is the performance of the code generated by llvm is about 10% slower than amdgpu's online compiler. Anything I can tune _the performance up the performance of llvm?_ This is very dependent on the case you are looking at, so without a specific example or ISA comparison between the compilers there’s no guessing -Matt -- DONG, Changdao MP: 1-412-551-2330 dongchangdao at gmail.com <cddong at cmu.edu>
-- DONG, Changdao
MP: 1-412-551-2330 dongchangdao at gmail.com <cddong at cmu.edu> -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180905/b3fa5e77/attachment-0001.html> -------------- next part -------------- A non-text attachment was scrubbed... Name: search2_llvm.isa Type: application/octet-stream Size: 623649 bytes Desc: not available URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180905/b3fa5e77/attachment-0002.obj> -------------- next part -------------- A non-text attachment was scrubbed... Name: search2_amdgpupro.isa Type: application/octet-stream Size: 1101054 bytes Desc: not available URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180905/b3fa5e77/attachment-0003.obj>
- Previous message: [llvm-dev] Can I control HSA config generated by AMDGPU backend?
- Next message: [llvm-dev] Can I control HSA config generated by AMDGPU backend?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]