Is reporting the model diff supported?

Question

Is reporting the model diff supported?

erksch opened this issue 5 years ago · 21 comments

Question

Hey there! Thanks for the awesome work on the Android front!

The demo works for training on the dataset but I can not see any code that is related to actually reporting the model diff to PyGrid. Is this already in the demo, or even supported by the library?

Regards

Answer 1 · 2020-06-01T16:28:30.000Z

Hi @erksch reporting the diffs to pygrid is not yet functional. It should be fairly easy to add though. We have plans to add it just haven't implemented it.

Answer 2 · 2020-06-01T16:31:46.000Z

If you need it on priority, you can do it without altering the kotlin code. Just create a plan by pysyft that evaluates the diff between two sets of tensors. You can then serialise this diff and send it to pygrid by the report function already present in SyftJob.

Ofcourse if you are interested, we welcome all the contributions

Answer 3 · 2020-06-01T16:59:53.000Z

Thank you for your quick response!
What do you mean by not altering the kotlin code? I would say I'd need to calculate the diff somehow and wrap it into a State object. Like in this example in python:

diff = [ model_params[i] - updated_model_params[i] for i in range(len(model_params)) ]
diff_state = State(
    owner=hook.local_worker,
    state_placeholders=[PlaceHolder().instantiate(param) for param in diff]
)

As you mentioned SyftJob already has a report method and takes a diff State object and sends it to the websocket. But how do I create the diff in Kotlin? I think that would be easier than creating a plan for that.

Answer 4 · 2020-06-01T17:49:20.000Z

Okay this is what I came up with

fun comptueDiff(originalState: State, newState: State): State {
  val diffTensors = mutableListOf<SyftTensor>()

  for (i in 0 until originalState.syftTensors.size) {
    val originalTensor = originalState.syftTensors[i].getTorchTensor()
    val originalFloats = originalTensor.dataAsFloatArray

    val newTensor = newState.syftTensors[i].getTorchTensor()
    val newFloats = newTensor.dataAsFloatArray

    val diffFloats = FloatArray(originalFloats.size)

    for (j in originalFloats.indices) {
      diffFloats[j] = originalFloats[j] - newFloats[j]
    }

    val diffTensor = Tensor.fromBlob(diffFloats, originalTensor.shape())
    val diffSyftTensor = SyftTensor.fromTorchTensor(diffTensor)
    diffTensors.add(diffSyftTensor)
 }

 return State(newState.placeholders, diffTensors)
}

In full action it looks like this:

fun training_process(...) {
  // Copy state in a way so it does not change during training
  val originalState = State(model.modelState!!.placeholders.toMutableList(), model.modelState!!.syftTensors.toMutableList())
  .... training ...
  val diff = comptueDiff(originalState, model.modelState!!)
  mnistJob.report(diff)
}

I can confirm that it works!

Answer 5 · 2020-06-01T20:48:01.000Z

I actually wrote a method to evaluate a diff in Kotlin and it is quite similar to what you came up with. It's left commented in SyftModel because of slow speed.

https://github.com/OpenMined/KotlinSyft/blob/dev/syftlib/src/main/java/org/openmined/syft/proto/SyftModel.kt#L40

The problem of writing this in Kotlin is we have to sequentially iterate over all the indices and subtract them manually. This is way too slow if the model parameters' dimensions explode. There are few vectorization alternatives in Java like Bigdecimal but they only handle integer in substraction.
The only way I could find to have a highly parallelized implementation while being able to support all the syft tensor types was doing that via a diff plan. Yet another issue with such type casting is the possibility of memory overflow when upcasting float16 to float64 or byte to int
Thus, I am inclined towards writing a diff torchscript and storing it in library assets so that the user does not need to explicitly define it. This would have been a lot more easier if Kotlin had a numpy like library. Afaik Kotlin math is in active development but is farther in future

Answer 6 · 2020-06-01T21:01:04.000Z

Okay, I see the problem, I didn't really think of performance yet. This stuff is much easier in python :D
If you already have some code in mind for creating a diffing plan it would be awesome to share a small notebook or add it somewhere in the repo.

Answer 7 · 2020-06-01T21:12:02.000Z

Crudely, I would go about the lines of how a usual plan is implemented just removing all the custom arguments from it. https://github.com/OpenMined/PySyft/blob/v0.2.5/examples/experimental/FL%20Training%20Plan/Create%20Plan.ipynb

@sy.func2plan(args_shape=args_shape)
def diff_plan(*model_params):   # we can also feed in two lists if torchscript supports
    num_params = len(model_params)/2
    old_params = model_params[:num_params]
    new_params = model_params[num_params:]
    diff = [new - old for old,new in zip(old_params,new_params)]
    return (*diff)

Answer 8 · 2020-06-01T21:14:19.000Z

Ah right, I thought the plan function would also include reporting the diff to the grid but this could just be done in Kotlin.

Answer 9 · 2020-06-01T21:16:16.000Z

But looking at this, I feel it cannot be a static file after all and we would need to generate this plan based on the model (maybe while hosting the plan?). At Kotlin end, we should load the diff plan and syft model should execute that plan.

Answer 10 · 2020-06-01T21:31:15.000Z

Why couldn't it be a static file? Looks good to me?

Answer 11 · 2020-06-01T21:32:17.000Z

The @sy.func2plan requires the shape of arguments as input which varies tensor to tensor.

Answer 12 · 2020-06-02T12:36:20.000Z

So I looked into pysyft compatibility for args https://github.com/OpenMined/PySyft/blob/v0.2.5/examples/tutorials/Part%2008%20-%20Introduction%20to%20Plans.ipynb

@erksch Are you upto this? We can go via static file path with this

@sy.func2plan()
def diff_plan(old,new):   
    diff = new -old
    return diff

Answer 13 · 2020-06-02T16:32:18.000Z

Nice I'll give it a try later.

Answer 14 · 2020-06-04T10:52:23.000Z

@all-contributors add @erksch for creating an issue

Answer 15 · 2020-06-04T10:52:27.000Z

@vkkhare

I couldn't determine any contributions to add, did you specify any contributions?
Please make sure to use valid contribution names.

Answer 16 · 2020-06-04T10:53:50.000Z

@all-contributors fine add @erksch for creating a bug 😅😅

Answer 17 · 2020-06-04T10:54:00.000Z

@vkkhare

I've put up a pull request to add @erksch! 🎉

Answer 18 · 2020-06-11T10:36:58.000Z

@all-contributors add @erksch for bugs

Answer 19 · 2020-06-11T10:37:08.000Z

@vkkhare

I've put up a pull request to add @erksch! 🎉

Answer 20 · 2020-06-11T10:39:38.000Z

Hey @vkkhare sorry for the delay! I'll totally get back to you and the kotlin worker but I am a little bit occupied at the moment and I hope I can get back to it next week.

Answer 21 · 2020-06-11T16:24:30.000Z

Oh okay no problem. Thanks @erksch!