Anyone can now be a fashion model and walk the runway! The future of how we dress, how we try on outfits before purchasing, and even how we find items online will be shaped by AI.
- Start by uploading an image of yourself or a friend
- Once you upload the image, the app will find 5 high fashion outfits that suit you well. Mistral generates descriptions of outfits based on who is in the photo, as determined by a caption using florence-2-large on fal.ai. Then, we use Flux from Black Forest Labs to generate an image based on the outfit description. cat-vton is the model used to try on the outfit for the given user.
- The other option here is to use a Flux LoRA which you can train here. Input the lora file into the input field and the virtual try-on images will look more like you.
- Click "Start Show"
- This will generate a video of you in those new outfits walking the runway. We use Luma Dream Machine because they recently released their API. You can also use Viggle for some fun effects on their playground. Input the AI-generated images into the text/image to video models to get a "walk the runway" video.
- Add AI-generated music from Udio to go along with your walk if you wish!
- Run Browserbase web agent script to find a similar online and add it to your shopping cart for your review before you purchase.
Add your own API keys in .env file for Fal.ai, Mistral, and Browserbase
Run streamlit run main.py
amazonSearch.js and aritziaSearch.js contain the web agent scripts. Either run it in the browserbase playground to modify it for your own taste or run it locally.
Improve web agent shopping feature - certain actions are blocked (Amazon uses sophisticated systems to detect non-human behavior). Integrating with more shopping sites without bot detection can increase scope and help find a more similar outfit. Doing a similarity search between images and not just captions will also result in better results, as will adding filters for price / brand / material / etc.
Longer runway videos are currently still expensive and object persistence needs improvement