The HuggingFace Demo has been ignoring the images I've uploaded

Question

The HuggingFace Demo has been ignoring the images I've uploaded

Opened this issue 10 months ago · 21 comments

https://huggingface.co/spaces/naver-ai/VisualStylePrompting

I tried 6 times.

Answer 1 · 2024-03-15T02:53:10.000Z

same, wont take an uploaded image.

Answer 2 · 2024-03-19T06:14:08.000Z

Yes, please add the functionality for the user to specify their own style image without having to modify config files.
Should be simple, pick image, type prompt, generate.
The first thing users want after running the examples is "that's cool, how can I use my own style image now"?

Answer 3 · 2024-03-19T18:44:44.000Z

Please consider making it possible for users to be able to use their own images as a style, and it be simple to do so, many thanks. I really like this concept though, it's great.
Thanks for your contribution.

Answer 4 · 2024-03-20T08:52:25.000Z

To accurately reflect the style of the user image, a description of that image is necessary. Some users may struggle to write effective descriptions, we have not included this aspect in the demo.

We will update the demo code to support this by utilizing BLIP2.

Answer 5 · 2024-03-21T21:22:23.000Z

To accurately reflect the style of the user image, a description of that image is necessary. Some users may struggle to write effective descriptions, we have not included this aspect in the demo.

We will update the demo code to support this by utilizing BLIP2.

That would work. User picks one of their images, BLIP2 captions it, user should get an option to modify the detected caption if need be, then the user image can be used to style any other image.

Answer 6 · 2024-03-22T14:12:33.000Z

This will be very helpful. Thank you. Looking forward to working with my own images.

To accurately reflect the style of the user image, a description of that image is necessary. Some users may struggle to write effective descriptions, we have not included this aspect in the demo.

We will update the demo code to support this by utilizing BLIP2.

Answer 7 · 2024-03-26T13:09:47.000Z

There is an issue about HF gpu, so HF is currently fixing it.
For this reason, the features for user image styles have been implemented, but not executed in the demo.
In now, Try vsp_real_script.py

Answer 8 · 2024-03-26T15:21:35.000Z

Thank you very much for this update. I will give this a try when I return from traveling. Best wishes for continued enhancement of your excellent project. Regards, DM

…

On Tue, Mar 26, 2024 at 8:10 AM Junho Kim ***@***.***> wrote: - There is an issue about HF gpu, so HF is currently fixing it. - For this reason, the features for user image styles have been implemented, but not executed in the demo. - In now, Try vsp_real_script.py — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AZCQOIZYTUSB6XE4F55ISO3Y2FXTDAVCNFSM6AAAAABEWXVWEWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMRQGM4TCNJYHA> . You are receiving this because you commented.Message ID: ***@***.***>

Answer 9 · 2024-03-27T01:22:47.000Z

There is an issue about HF gpu, so HF is currently fixing it.

For this reason, the features for user image styles have been implemented, but not executed in the demo.

In now, Try vsp_real_script.py

Can you make an updated app.py for local running? I am trying to do this all local on Windows, so it doesn't matter if it does not run as a HF online demo.

Answer 10 · 2024-03-28T18:54:41.000Z

User images at HF still not working. Can you make an updated app.py for local running for SoftologyPro?

…

On Tue, Mar 26, 2024 at 9:10 AM Junho Kim ***@***.***> wrote: - There is an issue about HF gpu, so HF is currently fixing it. - For this reason, the features for user image styles have been implemented, but not executed in the demo. - In now, Try vsp_real_script.py — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AZCQOIZYTUSB6XE4F55ISO3Y2FXTDAVCNFSM6AAAAABEWXVWEWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMRQGM4TCNJYHA> . You are receiving this because you commented.Message ID: ***@***.***>

Answer 11 · 2024-04-01T04:49:52.000Z

@dhmiller123 @SoftologyPro
In local, you can try with vsp_real_script.py

Answer 12 · 2024-04-01T04:54:11.000Z

@dhmiller123 @SoftologyPro
In local, you can try with vsp_real_script.py

I understand, but if you updated the gradio UI with that functionality it would make it easier for all users.

Answer 13 · 2024-04-01T06:18:27.000Z

We have recently updated the demo to reflect user images. However, due to an issue with the GPU provided by Hugging Face (HF), the functionality is not performing as expected. We have no choice but to wait until HF resolves this issue.

Answer 14 · 2024-04-01T07:24:45.000Z

OK, I understand that too. But, I don't want to run via huggingface. I want to run your gradio demo locally under Windows. If you do have a version of the gradio app.py that works locally then please do share. The only version of app.py I have is from before which has now been removed from your repo.

Answer 15 · 2024-04-01T07:39:06.000Z

ie the attached version app.py (renamed app.txt as py files do not seem to be attachable). Running locally. That should get around any huggingface limitations?

app.txt

Answer 16 · 2024-04-02T13:22:03.000Z

demo is working now.

Answer 17 · 2024-04-02T18:55:22.000Z

I still get the same GPU error when I try to use my own image. What exactly is working?

…

On Tue, Apr 2, 2024 at 9:22 AM Junho Kim ***@***.***> wrote: demo is working now. — Reply to this email directly, view it on GitHub <#1 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AZCQOI3ZITOLT5P3PAUEK6DY3KWJDAVCNFSM6AAAAABEWXVWEWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMZSGAZTAMJVGE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Answer 18 · 2024-04-02T22:55:39.000Z

OK, when I try and run the HF demo with my own style image I get GPU timeouts. Can you provide a working version of app.py to run local? This is what I tried...

git clone https://huggingface.co/spaces/naver-ai/VisualStylePrompting
In app.py I had to remark the first line import spaces and the other @spaces.GPU line.
Then running app.py opens the UI

I select my own style image, set a prompt, set the outputs to 1 and click Submit.
Gives these errors (same as the other issue I raised wiith vsp_real_script.py) #7

Traceback (most recent call last):
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\gradio\queueing.py", line 501, in call_prediction
    output = await route_utils.call_process_api(
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\gradio\route_utils.py", line 253, in call_process_api
    output = await app.get_blocks().process_api(
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\gradio\blocks.py", line 1695, in process_api
    result = await self.call_function(
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\gradio\blocks.py", line 1235, in call_function
    prediction = await anyio.to_thread.run_sync(
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\anyio\to_thread.py", line 56, in run_sync
    return await get_async_backend().run_sync_in_worker_thread(
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\anyio\_backends\_asyncio.py", line 2144, in run_sync_in_worker_thread
    return await future
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\anyio\_backends\_asyncio.py", line 851, in run
    result = context.run(func, *args)
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\gradio\utils.py", line 692, in wrapper
    response = f(*args, **kwargs)
  File "<path to local clone>Visual Style Prompting\app.py", line 156, in style_fn
    ref_prompt = blip_inf_prompt(origin_real_img)
  File "<path to local clone>Visual Style Prompting\app.py", line 77, in blip_inf_prompt
    generated_ids = blip_model.generate(**inputs)
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\torch\autograd\grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\transformers\models\blip_2\modeling_blip_2.py", line 1830, in generate
    outputs = self.language_model.generate(
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\torch\autograd\grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\transformers\generation\utils.py", line 1466, in generate
    self._validate_generated_length(generation_config, input_ids_length, has_default_max_length)
  File "<path to local clone>venv\voc_visualstyleprompting\lib\site-packages\transformers\generation\utils.py", line 1186, in _validate_generated_length
    raise ValueError(
ValueError: Input length of input_ids is 0, but `max_length` is set to -13. This can lead to unexpected behavior. You should consider increasing `max_length` or, better yet, setting `max_new_tokens`.

If I then click the watercolor horse/tiger example and click Submit it works.

If I then select my own style image again and click Submit it does not crash, but still uses the previous watercolor style and ignores my style image.

Answer 19 · 2024-04-03T07:49:44.000Z

OK, for those wanting to run this locally, I finally got it working after trying various package versions until these worked.

python -m pip install --upgrade pip
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts wheel==0.41.0
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts diffusers==0.27.0
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts accelerate==0.28.0
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts einops==0.7.0
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts kornia==0.7.2
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts gradio==4.25.0
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts transformers==4.39.3
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts opencv-python==4.9.0.80
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts xformers==0.0.25 --index-url https://download.pytorch.org/whl/cu118
pip uninstall -y torch
pip uninstall -y torch
pip install --no-cache-dir --ignore-installed --force-reinstall --no-warn-conflicts torch==2.2.1+cu118 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

https://softologyblog.wordpress.com/2023/10/10/a-plea-to-all-python-developers/

Answer 20 · 2024-04-03T08:04:15.000Z

Running locally under Windows 11 on a 4090.

Answer 21 · 2024-04-04T00:34:01.000Z

To accurately reflect the style of the user image, a description of that image is necessary. Some users may struggle to write effective descriptions, we have not included this aspect in the demo.
We will update the demo code to support this by utilizing BLIP2.

I think BLIP may also struggle to write an effective description too? Would it help to show the detected caption and allow the user to edit it before use? When an example is clicked, show the caption used for those too.

Here are some "failed" results that may help to have a better caption text for the style images?

Do you think these results are due to the caption or just a bad style image choice?

The broccoli image was BLIP captioned "broccoli is a vegetable that is very popular". Would a better prompt help get a better styled result? Maybe just broccoli.

The wave image was captioned. "a large wave breaking on the ocean"

Those 2 and the tiger above are not as "clean" as the example results. For the tiger above I expected textures that matched the style image. Would a better caption help there?