Issues
- 5
Support for multidimensional arrays in Croissant
#649 opened by pierrot0 - 3
Invalid object type for field "distribution"
#725 opened by pdurbin - 2
Data-level annotations
#737 opened by benjelloun - 0
croissant cloud
#754 opened by stubbi - 5
use XSD datatypes not schema.org datatypes
#654 opened by VladimirAlexiev - 0
Fix references definition in Croissant spec
#751 opened by benjelloun - 1
Lineage / provenance representation
#738 opened by benjelloun - 0
- 0
Uniform jsonQuery and jsonPath
#746 opened by ccl-core - 0
Semantic annotations / triplification
#739 opened by benjelloun - 3
- 0
- 0
[Apache Beam] Compute shard_sizes explicitly instead of relying on max_shard_size
#732 opened by marcenacp - 0
Can the Huggingface croissant API endpoint read croissant.json metadata created by this tool?
#724 opened by cboettig - 0
Documentation for the python tool, mlcroissant?
#723 opened by cboettig - 1
Joins as described by Croissant Format Specification are not supported by mlcroissant python library.
#683 opened by AdrianUrbanski - 4
- 0
- 0
- 1
- 4
"images/filename" should have an attribute "@type": "https://schema.org/Text". Got http://mlcommons.org/croissant/Field instead.
#651 opened by venkanna37 - 2
Example for audio dataset
#692 opened by quancs - 2
Feature Request: support for CoNLL format
#687 opened by gzhang64 - 3
- 3
- 0
Please improve documentation
#694 opened by mrsalehi - 3
Intro example not working
#695 opened by XenonLamb - 1
[NeurIPS] Variable length integer array field
#681 opened by brendon-boldt - 1
[NeurIPS] How to express data in other binary formats?
#679 opened by gcr - 1
[Neurips]: any support for large h5 files? tried various encodings but no luck.
#697 opened by jhirschm - 0
[NeurIPS] How to "ingest" multiple datasets made of .xz files as data/samples and space-separated .txt files as ground truth
#693 opened by 4ndr3aR - 4
- 8
[NeurIPS] Fileld has no source
#686 opened by gorovuha - 1
- 0
[NeurIPS] URL to dataset metedata
#691 opened by zhwang0 - 1
[NeurIPS] How do i create a Croissant File using this library for my dataset?
#688 opened by lartpang - 5
[NEURIPS] `.zip` and `.tar.gz` archives are not supported for file uploading
#663 opened by amorehead - 1
examples datasets are broken
#650 opened by seralf - 2
- 0
- 3
Supporting many bounding boxes within an image
#673 opened by Irenetema - 3
- 1
[NeurIPS]How to define (jpg, json) pairs
#676 opened by MehreenMehreen - 4
[NeurIPS] Croissant for Binary Blob Data
#674 opened by 5had3z - 2
`schema:Enumerations` does not exist
#653 opened by VladimirAlexiev - 0
Missing prefix declaration in the JSON-LD context
#662 opened by pchampin - 1
SegmentationMask, BoundingBox
#657 opened by VladimirAlexiev - 1
404 url:http://mlcommons.org/croissant/source
#659 opened by pan-2001 - 1
consider reusing CSVW and DQV
#656 opened by VladimirAlexiev - 0
Implement `key` in mlcroissant.
#655 opened by marcenacp