why kdf.head() is much lower than sdf.show()?
Closed this issue · 3 comments
RainFung commented
HyukjinKwon commented
Very likely because of the default index: https://koalas.readthedocs.io/en/latest/user_guide/options.html#default-index-type . Can you try with ks.set_option('compute.default_index_type', 'distributed')
?
RainFung commented
It's much faster now. Can we set it to distributed
by default. The speed gap is too big.
HyukjinKwon commented
distributed
disables the operations between other DataFrames. It's something we should discuss. Let me close this ticket for now though.