A Recipe for Loop Optimization
1. Identify the best approach to implementing
- Profile the loop to identify bottlenecks.
For example, are bottlenecks caused by
- a particular execution unit?
- accesses to memory?
- Re-structure the algorithm to alleviate these
bottlenecks (“algorithmic transformation”)
2. Implement this approach efficiently
using scheduling techniques