Issues
- 1
```[tasklist]sa
#540 opened by samran5 - 0
It looks like you're using the VS Code Issue Reporter but did not paste the text generated into the created issue. We've closed this issue, please open a new one containing the text we placed in your clipboard.
#541 opened by samran5 - 0
Add JSON / JSON lines export support
#533 opened by dtulga - 0
`datachain pull` doesn't work
#539 opened by ilongin - 0
- 0
Fix tests that are skipped
#535 opened by ilongin - 0
mutate: overwrite columns
#336 opened by dmpetrov - 1
Export to huggingface hub
#370 opened by dberenbaum - 1
DataChain objects nomeclature
#534 opened by tibor-mach - 1
Implement more window functions
#524 opened by dreadatour - 1
Finish SQL functions refactoring
#525 opened by dreadatour - 3
Implement more group_by functions
#523 opened by dreadatour - 0
Refactor back file system listing `source` and `path`
#447 opened by ilongin - 1
Window-function to select subset of the records
#522 opened by dreadatour - 3
Bug: hierarchical columns and the `.select` method
#510 opened by tibor-mach - 0
Don't allow top level fields with the same name
#519 opened by shcheklein - 8
Clarify ordering semantics
#477 opened by rlamy - 3
Feature request: Automatically cast merge keys
#509 opened by tibor-mach - 3
Feature request: `DataChain.unique`
#511 opened by tibor-mach - 0
- 2
Semantic file diff: excel
#372 opened by dmpetrov - 2
Add explode and / or dynamic model / schema
#481 opened by shcheklein - 5
Add `from_sql` / `from_database` factory method
#463 opened by shcheklein - 0
Remove `QueryGenerator`
#456 opened by rlamy - 0
SQL compilation generates too many SELECTs
#476 opened by rlamy - 7
Kaggle: UnidentifiedImageError: cannot identify image file <_io.BytesIO object at 0x78574cfee2a0>
#465 opened by Aisuko - 0
Remove `Catalog.open_object()` and refactor `Catalog.get_file_signals()` to return one specific `File` signal
#466 opened by ilongin - 0
CI failure in examples/multimodal
#462 opened by rlamy - 2
Pre-UDF table logic creates unnecessary copies
#457 opened by rlamy - 1
map/gen/agg: overwrite columns
#337 opened by dmpetrov - 3
DataChain Telemetry needed
#344 opened by jendefig - 6
- 0
Remove storages from `DatasetQuery`
#340 opened by ilongin - 0
Cleanup `src/datachain/query/schema.py`
#453 opened by ilongin - 0
Fix dataset dependencies for storages
#420 opened by ilongin - 0
Convert `index_tar` to a new-style generator
#439 opened by rlamy - 0
Refactor `Client.parse_url()`
#434 opened by ilongin - 4
- 1
Fix 'Update template' GitHub Actions workflow
#423 opened by dreadatour - 2
Handle huggingface images/audio objects as files
#369 opened by dberenbaum - 0
- 0
save metrics in real-time
#384 opened by skshetry - 0
Support Inf in `Array(Float)`
#386 opened by dberenbaum - 2
Improve management of iterator returned by `collect` (`database table is locked` error)
#377 opened by shcheklein - 6
Add `take` and / or indexing operations
#374 opened by shcheklein - 0
- 1
Support for `hf://` filesystem
#368 opened by dberenbaum - 1
Remove `Catalog.merge_datasets()`
#349 opened by ilongin - 2
New persist() method
#361 opened by dreadatour - 1
Rename exec or save() into persist()
#359 opened by skshetry