Download dataset
BenGalewsky opened this issue · 3 comments
BenGalewsky commented
As a foundry dataset user I want to access a dataset as a pandas dataframe so I can perform my analysis
Description
Create a new DatasetCache class which will be instantiated along with the foundry instance.
This class has three methods:
- flush
- download_dataset
- is_dataset_in_cache
The cache lives by default in ./data
. This can be overridden with FOUNDRY_CACHE_PATH
environment var.
This story is for the user to be able to download a dataset.
Add a new method to Dataset class: get_as_dict
Assumptions
- Dataset has only one file
- FoundryCache is implemented as part of this issue
- No splits
- Foundry example notebooks would need to be updated along with this implementation (also good for testing)
Acceptance Criteria
f = Foundry()
datasets = f.search("DOI.123/445")
assert len(datasets) == 1
res = datasets[0].get_as_dict()
kjschmidt913 commented
Hi @blue442 , was this part of that big PR you did? Should I mark this as complete?
blue442 commented
@kjschmidt913 yes it was - mark away!