Dask dataframe vs Polars
Closed this issue · 2 comments
More of a design question, I am curious to hear the tradeoffs between using Dask dataframes vs geopandas vs Polars for Points? I have been migrating my heavy workflows to Polars, but am also interested in potentially utilizing SpatialData, so I wanted to consider what the tradeoffs might be.
Thanks!
@LucaMarconato perhaps you can say more about the design decision initially for dask and add to anything I say.
For Polars, as far as I know Polars is meant to be single machine while this is not the case for dask. Dask also follows more the pandas API which most people are more familiar with and although Polars is catching up integrates better with other libraries in the Python ecosystem.
With regards to geopandas, of course this provides the geospatial tooling which dask does not provide (although dask geopandas does look interesting), while geopandas again is single machine.
Closing this for now but feel free to reopen if you feel like your question was not sufficiently answered.