[LLVMdev] PTX optimizations (original) (raw)
Justin Holewinski justin.holewinski at gmail.com
Sat Sep 3 04:51:33 PDT 2011
- Previous message: [LLVMdev] PTX optimizations
- Next message: [LLVMdev] The right option combination to compile into ARM/Thumb2 code.
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On Sep 3, 2011 1:12 AM, "Adarsh HV" <adarsh.hv99 at gmail.com> wrote:
Hi everyone, I am trying to add some optimizations to LLVM's PTX backend. But i am unaware of the existing optimizations. Can you please guide me about the same?
So far, we have been focusing on code correctness and coverage, not PTX specific optimization. Unfortunately, I have not had the time I had hoped to work on this over the summer. We do collapse multiply-add pairs into FMA, but that's about the extent of our optimizations. I want to start looking into converting branches into predicated code and load/store scheduling, but I want to finish up the function call implementation first.
What kinds of optimizations are you wanting to implement? We should coordinate on this so as not to duplicate work.
Thank You:)
LLVM Developers mailing list LLVMdev at cs.uiuc.edu http://llvm.cs.uiuc.edu http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20110903/1b270ff7/attachment.html>
- Previous message: [LLVMdev] PTX optimizations
- Next message: [LLVMdev] The right option combination to compile into ARM/Thumb2 code.
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]