/StableDiffusionHelper

Advanced automated image processing tool for selection, cropping, and standardization. (Helper for stable diffusion), now updated with GUI 🎉

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

📸 Stable Diffusion Helper - GUI 🎨

Advanced automated image processing tool for selection, cropping, and standardization

🎉 Now with a GUI 🎉

🤓 Features

  • Automatic Face Detection: Utilizes dlib's and openCV's powerful face detection to identify faces in any image.
  • Image Type Selection: Select between Realistic/Anime images for better face detection.
  • Smart Cropping: Automatically crops images to focus on the detected faces, adjusting the size and aspect ratio as needed, offering a choice between Squared or Rectangular crops.
  • Image Standardization: Converts and resizes images to a uniform format and size, making it easier to work with them in your projects.
  • Batch Processing: Process an entire folder of images in one go, saving you time and effort.
  • Manual Cropping: Crop Images one at a time or all at once using the cropping tool which also gives you the option to choose resizing size.
  • Caption Editing: Edit captions for images in a batch, allowing you to quickly update or correct captions, with an easy check-box like interface.
  • Advanced Logging Tools: Keeps track of images that failed to process, allowing you to easily review and handle exceptions, along with time taken for the total process.

🥳 Latest Update (7/7/24)

  • Image Crop Type has been replaced with sliders, allowing you to choose any aspect ratio that you want
  • Same applies to Image Resize dimensions, which is right beside it
  • Added a drop-down, Image Size Calculator for calculating the aspect ratio for any height that you provide (you can use the height box as a width box by inverting the positions of H and W in the crop ratio text box)
  • Added more steps to the [IMP Controller] slider for more control over the zoom out feature

🏃‍♂️ Getting Started

⚙️ Pre-requisites

Before you begin, ensure you have the following installed:

  • Python 3.x
    • Suggested Python 3.10.11 [3.11+ discontinues imghdr, which is a core dependancy]
  • Git

🚀 Installation

  • Fresh Install:
    • Download and run the install.bat from releases
      • Will automatically install dependancies and launch GUI/IPYNB

🔧 Let's get things running !!

  • To run the GUI, simply run the GUI.bat file.

    • The file will prompt you if you want to install remBG with GPU support, follow the on screen instructions for the same
    • If the browser does not auto-open, a link will appear on your screen, ctrl + right click on it to open it on your browser
  • To run the .ipynb file, run IPYNB.bat, more info here

  • To update if other methods fail: run the troubleshoot.bat file

🧠 Usage

There are 3 tabs to choose from:


🖼️ Image Processing


1:

  • Image Folder Location: Enter the name of the folder containing the images you want to process, this will be where the process creates folders inside of.

2:

  • Image Crop Type: Choose between Square or Rectangle for the type of crop you want Deprecated, Replaced with sliders, AutoCrop Image Height/Width, choose any free-form aspect ratio that you want
  • Image Crop Dimensions: Enter the dimensions for the crop, in a Width, Height format. For Sqaure, stick to 1:1, and for rectangle, stick to 3:4 or alternatively use Manual Cropping Deprecated, Replaced with slides, AutoCrop Images Resize Height/Width, same as above
  • Above these options, added Image Size Calculator for calculating the aspect ratio for any height that you provide (you can use the height box as a width box by inverting the positions of H and W in the crop ratio text box)
  • Image Size Threshold: Removes all images smaller than size Width, Height

3:

  • Basic Processing:

    • Duplicate Check: Check for duplicate images in the folder and moves them to a separate folder inside the folder location you provided.
    • Suitabilty Check: With references to values you define below, will get top N images that match the criteria.
    • Top N: These top N images will be selected from the folder
  • Advanced Processing:

    • Remove Background: Removes the background from the images using remBG, with an option to use GPU support.

4:

  • Style Select: Select between Realistic or Anime for the type of images in the dataset (Only used for Suitabilty Check and Face Auto-Crop).
  • Face Crop Method: Choose between Auto or Manual for the type of face cropping you want to use.
    • Face Crop[Auto]: Automatically crops image with respect to the zoom out variable defined via slider.
      • 1: Gives HIGHLY zoomed in images
      • 2: Default, Image cropped around face
      • 2+ The higher you go, the further away the crop is from the face, and if the size exceeds the image, it will be limited by the image size, so try not to go overboard
    • Face Crop[Manual]: Manually crop the image using the cropping tool, with an option to resize the image to a specific size
      • NOTE: Cropping is the last prcess in the pipeline, after all the other processes are done, a button guiding you to the manual cropping tab will appear underneath the Start Processing button, named Click to go to Manual Cropping

5:

  • Values for Crop2Face and Suitability Check:
    • Minimum Confidence: The minimum confidence level for the face detection model to consider a face
    • Minimum Size: The minimum ratio of area of the face to the total image size
    • Minimum Threshold(Image Sharpness): The minimum threshold for the image sharpness, highest value is at 100
    • Blurriness Threshold: The maximum threshold for the image blurriness, lowest value is at 100, lower the value, the more blurry the image

6:

  • Check Choices: Click the button to go through the values you've entered
  • Start Processing: This will start the processes in the pipelined order, along with a loading bar
    • Pipeline Order: Image Conversion to PNG -> Duplicate Check -> Suitability Check -> Background Removal -> Face Crop[Manual/Auto] -> Image Resize

📝 Caption Editing


1:

  • Caption Folder Location: Enter the name of the folder containing the text files you want to process

2:

  • Character Name: Enter the unique name of the character you want to train your model on
  • Max Count: The maximum number of captions you will see retrieved from the text files en masse
  • See Most Used Words: Click the button to see the most used words in the text files, this will open up a checkbox list below the button

3:

  • Selecting the words to delete: Select the checkbox next to each word that you would like removed
  • Process the files: Click this to press the changes onto the files [This is a permanant change, so be sure to check before you press, and keep a backup if you're unsure]

📸 Manual Cropping


1:

  • Ratio Value: Enter the crop box ratio in a Height, Width format, this will be the final size of the image
  • Apply Sizes: Click this button on every change of Ratio Value's
  • Upload Images: Upload either a single image or bulk images
  • Load Images: This will load the images selected, and if you are coming from Image Processing, Click this to load in the images

2:

  • Cropping: Click and drag the box to crop the image, and click Crop to crop the image
    • Troubleshooting: If the cropping box looks incorrect, drag one corner and that should snap to the ratio that you've entered

3:

  • Next Image: Clicking on this will load the next image (if any), and will save the image you've cropped to StableDiffusionHelper/Images/Outputs/ManualCrop

😖 Troubleshooting

Sometimes the GUI might crash due to unexpected reasons (has not been patched yet). Each run of this code writes a sdh_log.txt file in the root of this project, which contains the error logs. If you encounter an error, please open an issue with the log file attached.

  • Basic Fixes:
    • If the GUI crash takes place during "Suitability Check", try removing the file from the folder and proceed, as of now, it's a known issue.

✍️ Contributing

I welcome contributions! If you have a suggestion that would make this better, please fork the repo and/or create a Pull Request/Issue.

Don't forget to give the project a star! ⭐

📝 License

Distributed under the GPL-3.0 License. See LICENSE for more information.

🫂 Acknowledgements

  • dlib for the amazing face detection model.
  • OpenCV for providing the tools necessary for image processing.
  • Pillow for image manipulation.

🖼️ Thank you for checking out my project !!