Combining LLMs for image outpainting automatically. Serves as the backend software for image processing.
- see requirements.py
Put all models in models/.
- Prompt Generation: blip-image-captioning-large
- Image Outpainting: stable-diffusion-inpainting
- Image Super-resolution:pillow resize
- solve upscale factor bug when input resolution is not in the range.
- use unconditional image-to-text model instead of conditioned.
- Integrate Lora in the generation pipeline.
- allow users to choose which LLM model they want to use for outpainting and image-to-text.
- improve prompts generation - negative prompts and detailed positive prompts.
- image content consistency for multiple resolutions.
- find a better super-resolution method.
- Train specific Lora.