Eventual-Inc/Daft
Distributed DataFrame for Python designed for the cloud, powered by Rust
RustApache-2.0
Issues
- 1
rename `to_arrow`, `as_arrow`
#2964 opened by andrewgazelka - 0
SQL: COUNT function not found, but count works
#2960 opened by jaychia - 0
- 0
Make casting image to tensor easier
#2962 opened by jaychia - 0
SQL: Column ordering after groupby
#2961 opened by jaychia - 0
Allowing GroupedDataFrame.agg_concat to also take in a delimiter like Expression.list.join
#2959 opened by MisterKloudy - 0
Exposing number of bytes to keep & hashing algorithm in the expression minhash()
#2958 opened by MisterKloudy - 0
Implementing hive-style read
#2957 opened by MisterKloudy - 1
faulty reading hudi table after it has beed altered
#2941 opened by sephib - 2
[DOCS] Documentation wishlist
#2928 opened by jaychia - 0
consider `fxhash`
#2955 opened by andrewgazelka - 2
RFC: rename `.str.endswith` and `.str.startswith` to `ends_with` and `starts_with`
#2949 opened by universalmind303 - 1
how to flatten/unnest a struct?
#2950 opened by universalmind303 - 0
`url.parse` function
#2951 opened by universalmind303 - 0
- 6
read_deltalake on Unity Catalog Table from Databricks has invalid region configuration
#2903 opened by lukaskratoch - 0
Reproducing with_column functionality in SQL
#2935 opened by jaychia - 0
add missing `url_*` functions to SQL
#2945 opened by universalmind303 - 2
fix pyo3 build thrashing cross-IDE
#2933 opened by andrewgazelka - 1
Clarify URI syntax for connecting to ray cluster
#2927 opened by kevinjqliu - 2
[feature request] Only lock `daft.context.set_runner_ray` on successful connection
#2922 opened by kevinjqliu - 0
- 1
[ActorPoolProject] Pipeline of multiple actor pool projects throttles later stages if earlier stages have low concurrency
#2900 opened by jaychia - 1
[ActorPoolProject] Implement functionality for RayRunner
#2901 opened by jaychia - 0
- 2
Support for rolling joins and other special joins
#2911 opened by GitHunter0 - 1
Add a .list.apply() expression
#2913 opened by MisterKloudy - 2
Checking file sizes
#2851 opened by dioptre - 0
improve table printing width (dynamic)
#2912 opened by andrewgazelka - 0
- 3
- 3
comparison on `Decimal` dtypes does not work
#2906 opened by universalmind303 - 1
Add fp16 type support
#2889 opened by jaychia - 0
- 0
[ActorPoolProject] Correctly allocate GPUs for each running Actor in the PyRunner
#2896 opened by jaychia - 0
[ActorPoolProject] UDFs initialized with the ActorPoolProject do not respect custom init_args/batch_size
#2899 opened by jaychia - 0
hello
#2897 opened by andrewgazelka - 0
something is not working
#2898 opened by andrewgazelka - 1
mypy error for str.concat
#2863 opened by gmweaver - 4
improve Struct DataType
#2894 opened by andrewgazelka - 3
Field name use `Arc<str>` instead of `String`
#2892 opened by andrewgazelka - 4
Add a `.list.value_counts()` expression
#2862 opened by MisterKloudy - 7
Model loads after each completed partition
#2878 opened by conceptofmind - 2
- 1
switch to conventional commits for PR labeller
#2867 opened by universalmind303 - 1
There is no date on when the benchmark was executed
#2865 opened by alberttwong - 0
write directly to huggingface
#2866 opened by universalmind303 - 1
- 0
`read_text` and `read_blob` functions
#2859 opened by universalmind303 - 1
Rename `DataFrame.where` to `DataFrame.filter`?
#2846 opened by MarcoGorelli