Question on result

Question

Question on result

Masa-Shin opened this issue 7 years ago · 2 comments

Masa-Shin commented 7 years ago

The program worked without errors, but the result was far from I expected ( maybe I did something wrong? ).

here is the image I used:[deleted]

And here is the result: https://jsfiddle.net/b1zt7vsh/

What I did:

Put the image under Screenshot-to-code-in-Keras/floydhub/HTML/resources/images folder.
Rewrote the line 131 of HTML.ipynb (change the path to the image).
Rewrote the line 190 of HTML.ipynb (epochs to be 300).
Ran All cells of HTML.ipynb.

The value of loss function was below 0.001.

I would greatly appreciated it if you could tell me what I did was correct or not.

Answer 1 · 2018-01-26T18:39:36.000Z

@Masa-Shin Thanks for your question.

As mentioned in the article, the HTML version does not generalize on new images. The Bootstrap version generalizes on new images but with a capped vocabulary. The evaluation images for the bootstrap version are under /data/eval/ . You can test it here: floydhub/Bootstrap/test_model_accuracy.ipynb

If you want to train it to generalize on a more advanced vocabulary, I'd recommend customizing it to work on the HTML set provided here: https://github.com/harvardnlp/im2markup (on floydhub: --data emilwallner/datasets/100k-html:data)

After that, I'd recommend creating a new dataset. Create a script that generates random websites, say starting with newsletters or blog layouts. Then you can add optical character recognition, fonts, colors and div sizes as you go.

If you build a version for the harvardnlp dataset or a script that generates websites, please make a pull request.

Let me know if there is anything else I can help with.

Answer 2 · 2018-01-26T23:21:54.000Z

I understood. I will try the Bootstrap version. Thank you very much for detailed explanations (and the wonderful product)!