Speed up `coerce_to_sparse_data_frame()`
Opened this issue · 0 comments
EmilHvitfeldt commented
if this works, we also speed up the tibble case.
In my mind, it shouldn't take this long
library(tidymodels)
library(textrecipes)
library(friends)
preped_rec <- recipe(season ~ text, data = friends) %>%
step_tokenize(text) %>%
step_tf(text) %>%
prep()
#> Warning in asMethod(object): sparse->dense coercion: allocating vector of size
#> 8.7 GiB
term_freq <- bake(preped_rec, new_data = NULL, composition = "dgCMatrix")
library(sparsevctrs)
tictoc::tic()
tmp <- coerce_to_sparse_data_frame(term_freq)
tictoc::toc()
#> 1.392 sec elapsed
Created on 2024-05-23 with reprex v2.1.0