[llvm-dev] The AnghaBench collection of compilable programs (original) (raw)

Fernando Magno Quintao Pereira via llvm-dev llvm-dev at lists.llvm.org
Sat Feb 22 06:55:03 PST 2020


Dear LLVMers,

we, at UFMG, have been building a large collection of compilable

benchmarks. Today, we have one million C files, mined from open-source repositories, that compile into LLVM bytecodes (and from there to object files). To ensure compilation, we perform type inference on the C programs. Type inference lets us replace missing dependencies.

The benchmarks are available at: http://cuda.dcc.ufmg.br/angha/

We have a technical report describing the construction of this collection: http://lac.dcc.ufmg.br/pubs/TechReports/LaC_TechReport012020.pdf

Many things can be done with so many LLVM bytecodes. A few examples follow below:

We do not currently provide inputs for those programs. It's possible to execute the so called "leaf-functions", e.g., functions that do not call other routines. We have thousands of them. However, we do not guarantee the absence of undefined behavior during the execution.

Regards,

Fernando



More information about the llvm-dev mailing list