(original) (raw)

The target algorithm is lyra2 and the target kernel is "search2" in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/phi2.cl.
The detail is implemented in https://github.com/fancyIX/sgminer-phi2-branch/blob/master/kernel/lyra2mdz.cl
If you have time to take a look at the assembly, I can upload them later today.

Thanks,
Changdao

On Wed, Sep 5, 2018 at 11:32 AM Matt Arsenault <arsenm2@gmail.com> wrote:


On Sep 5, 2018, at 23:17, Changdao Dong via llvm-dev <llvm-dev@lists.llvm.org> wrote:


Finally I kind of modified llvm to generate assembly that can run on AMDGPU pro drivers. One problem is the performance of the code generated by llvm is about 10% slower than amdgpu's online compiler. Anything I can tune the performance up the performance of llvm?\\

This is very dependent on the case you are looking at, so without a specific example or ISA comparison between the compilers there’s no guessing

-Matt


--
DONG, Changdao

MP: 1-412-551-2330
dongchangdao@gmail.com