This project is based on an assignment given by Compass Digital for a full-time position. The goal is to develop an intelligent, automated system to match external products with internal items using a Streamlit-based front-end application.
Your stakeholder operates convenience-store-like markets and receives weekly shipments from suppliers, including many new products. The current process of mapping the two product lists together is slow and manual. Your goal is to explore the datasets and develop an intelligent, automated system to match external products with internal items. Ensure that your solution includes prompt engineering as part of your tech stack. The match has to be exact, meaning the product manufacturer, name, and size must be identical.
- Final Result: A table including all external items with the corresponding mapped internal product. If no match is found, the table should indicate NULL for the internal product.
- Presentation: A short presentation to demonstrate how your system works, targeted towards a non-technical audience.
- Bonus: Develop a simple front-end app using Streamlit.
- Install Python: Ensure Python is installed on your system.
- Create Virtual Environment:
python -m venv venv
- Activate Virtual Environment:
- On Windows:
.\venv\Scripts\activate
- On macOS/Linux:
source venv/bin/activate
- On Windows:
- Install Dependencies:
pip install -r requirements.txt
- Run the Application:
streamlit run app.py
- Sidebar: Displays the total number of internal product entries.
- Product Name Matcher: Enter the external product name to find the corresponding internal product name.
- Bulk Product Matching: Upload a CSV file containing external product names to get a new CSV file with matched internal product names.
- Bulk Data Loading: Upload a CSV file to process and embed the data.
Refer to app.py
for a detailed structure of the application.
- Submitted by: Raoof Naushad
- Email: raoofnaushad.7@gmail.com
- Contact: +19029897685
The assignment involves developing an automated system to match external products with internal items. The solution should include prompt engineering and be presented in a user-friendly front-end application. The evaluation criteria include the approach to dataset exploration, solution effectiveness, and presentation clarity.