unsplash/datasets

Elaboration of data fields and additions

cadop opened this issue · 1 comments

cadop commented

[1] photo_featured does this include the featured topic categories at the top?
[1.1] Could this field include which topics it is featured in, but also if the photo was submitted and rejected from a topic?
[1.2] Is there historic data for which images were not included as 'searchable' before the search system was replaced for everything being searchable?

[2] suggested_by_user the description mentions 'a user (human)'. At some point (maybe?) unsplash was adding tags or keywords to its approved/moderated photos, does (could) this distinguish who added it (uploader or staff)?

[4] keyword Is this referring to the search terms used to find the photo?
[4.1] assuming it is the searching keywords, can we add a field for the position it was displayed on the website (i.e. if it was the first row first column, or it was an image that was 30 photos down that a searcher scrolled down to find and pick)

Thanks for releasing the dataset, its a great contribution to the research community!

Hey! Sorry for addressing this so late, I must have missed the notification.

photo_featured does this include the featured topic categories at the top?

Yes, that would include topics.

Could this field include which topics it is featured in, but also if the photo was submitted and rejected from a topic?

The idea of including topics makes sense. It's something we'll think about this for the next iteration. Thanks for the suggestion

Is there historic data for which images were not included as 'searchable' before the search system was replaced for everything being searchable?

The system changed a lot and even today, not everything is searchable anymore. I don't think this information would make sense in the dataset. It would likely bring more confusion to most users of the dataset.

the description mentions 'a user (human)'. At some point (maybe?) unsplash was adding tags or keywords to its approved/moderated photos, does (could) this distinguish who added it (uploader or staff)?

That's a good suggestion, we can keep it for the next iteration, thank you!

keyword Is this referring to the search terms used to find the photo?

If you're referring to search conversions then yes.

assuming it is the searching keywords, can we add a field for the position it was displayed on the website (i.e. if it was the first row first column, or it was an image that was 30 photos down that a searcher scrolled down to find and pick)

We only recently started tracking this so we don't have much historical data. That's why the data is not available in the dataset but I think it could make sense to have it now.

Thank you for your feedback and suggestions, it's greatly appreciated!