What would be the best strategy for finding the right estimate of time to Hyper AutoOptimize? Should we perform a single step and benchmark the time taken and use it as a quantity for what is best?
Also would the greedy approach do? For eg we optimize weather we want to use autodiff or not and weather we want to use GPU or not. We can find this out in 2 ways -
- Check if it is faster with AD or not with GPU turned off. Then if it is faster with AD, we try GPU with AD. For n parameters, we try 2n configurations
- We check every configuration i.e. 2^n
What would be the best strategy for finding the right estimate of time to Hyper AutoOptimize? Should we perform a single step and benchmark the time taken and use it as a quantity for what is best?
Also would the greedy approach do? For eg we optimize weather we want to use autodiff or not and weather we want to use GPU or not. We can find this out in 2 ways -