Locality-Conscious Nested-Loops Parallelization (original) (raw)
Related papers
Scheduling and partitioning for multiple loop nests
Proceedings of the 14th international symposium on Systems synthesis - ISSS '01, 2001
Encyclopedia of Parallel Computing, 2011
Readers Are Parallel Processors
Trends in Cognitive Sciences, 2019
Optimization of Nest-Loop Software Pipelining
Performance Technology for Complex Parallel and Distributed Systems
Scalable Computing: Practice and Experience, 2001
Optimizing nested loops with iterational and instructional retiming
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2005
Proceedings of the 12th European Conference on Software Architecture: Companion Proceedings, 2018
Time-constrained loop scheduling with minimal resources
Using Retiming to Minimize Inter-Iteration Dependencies
2001
Parallel Experimentation: A Basic Scheme for Dynamic Efficiency
Social Science Research Network, 2004
Performance Evaluation of an Irregular Application Parallelized in Java
2010 39th International Conference on Parallel Processing Workshops, 2010
Timing optimization of nested loops considering code size for DSP applications
International Conference on Parallel Processing, 2004. ICPP 2004., 2004
Combined partitioning and data padding for scheduling multiple loop nests
Proceedings of the international conference on Compilers, architecture, and synthesis for embedded systems - CASES '01, 2001
An ordered heuristic for the allocation of resources in unrelated parallel-machines
International Journal of Industrial Engineering Computations, 2015
Optimizing Data Distribution for Loops on Embedded Multicore with Scratch-Pad Memory
Journal of Computers, 2014
Optimizing Timing and Code Size Using Maximum Direct Loop Fusion
Scaling alltoall collective on multi-core systems
2008 IEEE International Symposium on Parallel and Distributed Processing, 2008
Optimizing overall loop schedules using prefetching and partitioning
IEEE Transactions on Parallel and Distributed Systems, 2000
Implementing parallelism and scheduling data flow graphs on Java virtual machine
2002
Loop Distribution and Fusion with Timing and Code Size Optimization
Journal of Signal Processing Systems, 2010
Minimizing Inter-Iteration Dependencies in Multi-Dimensional Loops
cs.uakron.edu
SUPPLE: An efficient run-time support for non-uniform parallel loops
Journal of Systems Architecture, 1999
Optimizing synchronous systems for multi-dimensional applications
Proceedings the European Design and Test Conference. ED&TC 1995
General loop fusion technique for nested loops considering timing and code size
2004