openml/benchmark-suites

Come up with a new, catchy tag

Closed this issue · 18 comments

to replace OpenML100

please also note that the new suite will have only 60-80 sets

HighQualityData2018?

I checked the old paper, it refers to the following hyperlink:
https://www.openml.org/s/14

We could set it up in such a way that this link refers to the 'old' openml-100 datasets, but tag them with the tag 'OpenML100-depricated' or something like that. That would allow us to reuse the tag 'OpenML100' for the new study.

We are all using this term anyway, it would be hard to change it here in the lab.

According to the google doc there are 81 datasets (minus some potential candidates for removal), so we really shouldn't stick to a number. How about creating a nice abbreviation: OpenML Small Binary Classification Benchmark Suite, short OML-SBC18 benchmark suite. It's not as nice as OpenML100, but contains all necessary information and is easy to remember.

How could I add the word binary in there??? Well, what about:

  • OML-SASC18 (simple and small classification)
  • OML-SC18 (small classification)
  • OML-CBS18 (classification benchmark suite)

@giuseppec and @joaquinvanschoren and I decided that we should cast a vote about the name, I do suggest to use a voting procedure like Condorcet with Schulze tie breaking as implemented here: http://www1.cse.wustl.edu/~legrand/rbvote/calc.html

OpenML-Light-2018 (And we can use OpenML-Heavy for large datasets ;))

That name does not highlight that we do classification.

OpenML C-Scape18?

-scape means 'a wide view', which is what a benchmark should also offer :)
And Seascape is a nice word :)

We can use R-Scape for regression, M-Scape for multi-label, I-Scape for imbalanced,... :)

OpenML2018? Small and catchy

CSC18 - Curated Small Classification 2018

OML-CC18 or OpenML-CC18

OpenML-CC18 ?

did we decide something?

Unless anyone has a very strong different opinion, I propose it's going to be OpenML-CC18

Closed per Skype call (@frank-hutter @giuseppec @mfeurer @janvanrijn) as no one complained about OpenML-CC18