(original) (raw)
The target algorithm is lyra2 and the target kernel is "search2" in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/phi2.cl.
The detail is implemented in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/lyra2mdz.cl
If you have time to take a look at the assembly, I can upload them later today.
Thanks,
Changdao
On Wed, Sep 5, 2018 at 11:32 AM Matt Arsenault <arsenm2@gmail.com> wrote:
This is very dependent on the case you are looking at, so without a specific example or ISA comparison between the compilers there’s no guessingOn Sep 5, 2018, at 23:17, Changdao Dong via llvm-dev <llvm-dev@lists.llvm.org> wrote:
Finally I kind of modified llvm to generate assembly that can run on AMDGPU pro drivers. One problem is the performance of the code generated by llvm is about 10% slower than amdgpu's online compiler. Anything I can tune the performance up the performance of llvm?\\-Matt
--