replicate/replicate-python

How to estimate the cost of a prediction?

haranrk opened this issue · 0 comments

To calculate the cost of a prediction 3 values are needed

  • Prediction time (returned as part of GET prediction)
  • Hardware on which the model was run
  • Price/sec of running that hardware (values present here)

Given this we can approximately calculate price as time*pricing. However, the GET prediction api does not return what hardware the prediction ran on even though the website shows this information. Would it be possible to return the hardware as part of response to GET prediction? If not, is there any other way to calculate the inference cost

image