huggingface/autotrain-advanced

[FEATURE REQUEST] Improved tooltips/explanations of fields in GUI

Opened this issue · 5 comments

Feature Request

I'm pretty bad at writing docs myself, so I can't criticize, but it would seemingly take quite a small amount of effort to add hover tooltips to the fields like "Train" and "Save total limit".

You can use the HTML built in title attribute, so there's no need to import any libraries or anything like that. I.e.

<input title="Explanation of what it does" ...>

Motivation

Would make it much easier to use for newbies.

Additional Context

Also, every time you make a mistake in the GUI, the whole space/container crashes, and it's not clear to the user what caused it - especially via huggingface spaces, because it says "the owner has paused this". And you need to know to click on the little logs button at the top to work out what happened.

A few usability improvements would make this project a lot more useful to newbies I think.

I don't want to spam issues in your repo, so I'm just going to add some more feedback here:

  • Shouldn't "auto choose batch size" be a non-advanced feature? I.e. choosing the batch size manually seems more like an 'advanced' feature for an auto ML application. My understanding of the project is that you try to simplify things so there are fewer complicated knobs to turn to get a good result.
  • It took my a long time to work out that I had to write "train" in the "Train" field, and rename my csv to end with _train. Each time I got it wrong, the space would crash and I'd have to wait several minutes for it to boot back up, then I'd have to re-input all the settings, and try again. Intuitively, I'd have expected to just put the filename there. When that didn't work I looked at the logs and they were saying "'train' expected" so I tried renaming it to "train", but then it complained that there were no compatible files in the dataset repo, so I renamed it to "train.csv", then it complained again with something like "'train' expected". And again, the crash every time was very annoying.
  • The error message says "look in the logs" whenever something goes wrong (in a way that doesn't cause a crash, which is unfortunately not often), but IIRC some of the errors I had to open the browser developer tools and look in the network panel for the errored requests (e.g. "project name already exists"). It would be great if you could just display the errors / error logs whenever one occurs.
  • On the huggingface.co/autotrain homepage it says "AutoTrain will find the best models for your data automatically." but, at least for the LLM SFT, it presents you with a big list of models that you need to manually guess at. I couldn't find any option to automatically find a good model for the given dataset.

There are quite a few 'basic' improvements that this project needs, along the lines of the above examples, I think. Currently it seems like it's in a draft stage, but since the project has been around for quite a while this surprises me. Presumably a lot of work has gone into this, so I'm sending this feedback in the hope that you'll see the value in doing a bit of work on the UX side of things to fully realize all the benefits of that behind-the-gui effort.

In any case, thank you for your work in making ML more accessible!

thanks for your suggestions, ill consider them and add more info to docs.

Also, every time you make a mistake in the GUI, the whole space/container crashes, and it's not clear to the user what caused it - especially via huggingface spaces, because it says "the owner has paused this". And you need to know to click on the little logs button at the top to work out what happened.

To avoid this, when creating the space, set PAUSE_ON_FAILURE to 0

To avoid this, when creating the space, set PAUSE_ON_FAILURE to 0

I'd you're able to make that default it would be a big UX improvement

not really. the choice is presented when creating the space. its an advanced option for debugging. people dont want to get charged for having left the space running with nothing happening. also, the logs are available even when space goes to sleep mode on error.

also, the logs are available even when space goes to sleep mode on error.

Yes, but the UX here is quite bad. It just says "The owner paused this space" or something. And you need to know to click the little icon in the top bar. It's not clear at all why it happened, and doesn't say to check the logs to find the cause.

If you want a default here for preventing unwanted spending, it'd be better to have/use the "pause if no GPU activity after N hours".

I understand that this is platform-level stuff though, and you might not have direct control over this stuff.