processing in batches is not very efficient when objective function execution time is not constant

Question

processing in batches is not very efficient when objective function execution time is not constant

jschall opened this issue 7 years ago · 3 comments

My objective function execution time is quite variable. This means nodes in the cluster (once there is a cluster) will be idle a lot of the time.

Is there a way to improve this?
It is obviously trivial to fix it for the first pass, but not for the second pass...

Answer 1 · 2018-01-07T20:33:44.000Z

This is an interesting question. Unfortunately at the moment code relies heavily on the concept of parallel map, which means all evaluations within a batch have to start simultaneously (and wait until the longest evaluation in the batch is done). What you're asking is actually not very trivial I believe. But maybe using code as is will work out well for you eventually..

Answer 2 · 2018-01-09T16:59:35.000Z

Do you have a recommendation for initial samples, subsequent samples, and batch size?

Answer 3 · 2018-01-09T18:40:47.000Z

The total number of evaluations depends on the dimensionality of your problem. I remember using the code for a 4D case, the total number of evaluations was on the order of 100-200. For higher dimensions you'll need more evaluations. I can't tell you exact numbers. Generally, the code is designed to find an approximation to optimum with any number of evaluations provided. Once you find a good candidate solution, you can always refine it afterwards by using more appropriate search box for example.

As for initial/subsequent evaluations, I'd say splitting them equally (n equals m) should work good in most cases. The batch size should be as large as possible for more efficient use of parallelism. If let's say you have 20 cores available (means you can run 20 evaluations in parallel), feel free to set batch=20. But not more than 20 of course.

Let me know if you have more questions. Btw, how many parameters do you have?