Issues
- 0
Add `Dataset.load_embedding`
#1080 opened by nsthorat - 0
USE_TABLE_INDEX=True when swapping a space during a reboot causes a lock issue.
#1078 opened by nsthorat - 0
Reduce polling frequency of tasks when no task active
#1072 opened by brilee - 3
- 0
[UI] When searching a ShareGPT dataset, we should allow passing a filter of conversations.*.from= to the search.
#1067 opened by nsthorat - 0
- 0
- 7
Parallel Datasets
#1018 opened by vince62s - 1
- 2
Error while using OpenAI embeddings
#1036 opened by IhorNi - 1
Add keyboard shortcuts for fast labeling
#979 opened by nsthorat - 1
Add model selection or ability to overwrite GPT model for concept examples generation
#1019 opened by IhorNi - 0
Environment flags that are boolean still are truthy when you set the string to "False"
#1037 opened by nsthorat - 2
Cannot create concepts in huggingface space
#1027 opened by ajms - 0
Signals and embeddings are much slower
#1029 opened by nsthorat - 1
Rendering bug when we have `a[].b[]`
#971 opened by dsmilkov - 0
Loading a dataset should not block the REST endpoint.
#1012 opened by nsthorat - 0
Allow datasets with spaces in the name.
#990 opened by nsthorat - 0
Markdown code block aware chunking.
#989 opened by nsthorat - 3
ShareGPT format: compare to menu should show the title instead of `conversation.0.value`
#988 opened by dsmilkov - 0
- 0
- 0
Default to combine_cols=True
#973 opened by nsthorat - 0
labels stay in ui after deletion.
#972 opened by unaidedelf8777 - 0
- 0
Add `dataset.map(embeddings=True)`
#967 opened by dsmilkov - 0
- 2
Confused by HDBscan in UI
#960 opened by arnicas - 4
- 1
- 0
Add `dataset.count(filters=...)` public API
#954 opened by dsmilkov - 0
duckdb syntax error
#949 opened by tfriedel - 2
- 1
Improve the CSS for markdown rendered content.
#905 opened by nsthorat - 1
Allow subsampling datasets
#871 opened by brilee - 0
Add documentation in the guide for limit & filter.
#939 opened by nsthorat - 2
Ergonomics and small bugs with `dataset.map`
#915 opened by dsmilkov - 1
Multiprocess `map` got ~40% slower
#928 opened by dsmilkov - 2
website: cross-wired footer follow links
#940 opened by theosanderson - 1
I want to modify the Datasets directly
#937 opened by Mokocoder - 1
- 1
Multithreaded map only shows progress from shard 0
#927 opened by dsmilkov - 0
- 1
- 0
Break up the TaskManager.
#922 opened by nsthorat - 0
Write a getting started guide.
#906 opened by nsthorat - 0
Build versioning of source & map data.
#903 opened by nsthorat - 0
Add a simple signal python editor from the UI.
#902 opened by nsthorat - 0
- 0
Add defragmentation of signal output
#872 opened by dsmilkov