can't save to netcdf

Question

can't save to netcdf

Closed this issue 4 years ago · 8 comments

Hi, I'm new to python, maybe the question is very simple.

I build a binder in Github using Pangeo binder, which can handle the MITgcm LLC4320 huge dataset. When I use the "to_netcdf", only the 2D(i, j) data can be transferred, when the data include "k", like 2D(i, k) or 3D(i, j, k), it will stop at 83% each time. I'm not sure what happened, can anyone help me? jupyter notebook address

Answer 1 · 2020-11-02T17:04:34.000Z

Perhaps you are running out of memory? Does your session crash, or does it just hang here forever?

Answer 2 · 2020-11-03T01:22:52.000Z

Hi @rabernat , I can download more than 1GB 2D(i, j) data, this one is really small to that.
The code will stop without any error massages, you can see that from the pic below (it will stop when the Python3 kernel out of work)

And you are welcome to reproduce this issue from here.

Answer 3 · 2020-11-03T01:26:56.000Z

I can download more than 1GB 2D(i, j) data, this one is really small to that.

I think you are confusing the final file size with the intermediate memory usage. Because of the way the LLC data are stored, you may need to download a large amount of data before you can subset it.

I noticed you are using k_chunksize=90. This means that you are downloading all 90 vertical levels, then subsetting the top 5. Try instead with k_chunksize=5 or even k_chunksize=1.

Answer 4 · 2020-11-03T17:35:53.000Z

I can confirm it works for me if you change k_chunksize=90 to k_levels=[0, 1, 2, 3, 4].

Answer 5 · 2020-11-05T11:38:16.000Z

Hi @rabernat, thank you for your help, I think the problem is solved.

Answer 6 · 2020-11-05T20:05:56.000Z

I have a further question on this front. It seems that for me things work when k_levels is small in number (like around 10) or k_chunksize is not too small (fails if this value is 1 or 2). I tried k_levels=range(0,56) and that also failed. Things fail at the get_dataset step itself.
Any idea why?

Answer 7 · 2020-11-05T21:35:40.000Z

Can you post the full error and traceback for "things failed"?

Answer 8 · 2020-11-05T21:37:29.000Z

there is no error, the kernel just crashes/restarts. (I am using pangeo google cloud - large size)