Hi! I'm Felipe Valory, a data scientist skilled at transforming data into valuable insights that drive business results.
I’m experienced in every step of creating data-driven business solutions, from understanding the problem to deploying the model in production using cloud platforms.
I’ve had the opportunity to develop solutions for important business challenges, such as:
- Forecasting property prices.
- Identifying customers at risk of churn.
- Predicting sales trends.
- Creating data-driven dashboards for management.
- Designing movie recommendation systems and more.
Click here to dive into my portfolio!
A quick introduction to who I am, including my background, experiences, and career path, along with what drives me in the world of Data Science.
A showcase of the most impactful projects I’ve developed, demonstrating my hands-on skills and ability to solve real-world problems using data. Here are some highlights:
- Description: This project aims to rank a list of potential customers based on their likelihood of purchasing car insurance (propensity score).
- Technologies Used: Python, Pandas, Sweetviz, Seaborn, Scikit-Learn, Machine Learning models, Git, Agile Methodology.
- Repository: GitHub Link
- Description: A Machine Learning model to predict customer churn (account cancellations) for a bank.
- Technologies Used: Python, Pandas, Numpy, Seaborn, Scikit-Learn, Random Forest, Git, Agile Methodology.
- Repository: GitHub Link
- Description: A Machine Learning model to predict the next six weeks of sales for Rossmann stores.
- Technologies Used: Python, Pandas, Numpy, Flask, Inflection, Seaborn, Scikit-Learn, Boruta, Linear Regression, Random Forest, XGBoost, Git, Heroku Cloud, Agile Methodology.
- Repository: GitHub Link
- Description: A dashboard to monitor strategic KPIs for a delivery company.
- Technologies Used: Python, Pandas, Matplotlib, Plotly, Streamlit, Streamlit Cloud, Git, Jupyter Lab.
- Repository: GitHub Link
- Description: Performance analysis of Regression, Classification, and Clustering models.
- Technologies Used: Regression, Classification, and Clustering Algorithms, Performance Metrics (RMSE, MSE, MAE, MAPE, Precision, Recall, ROC, F1-Score, Silhouette Score, R2), Scikit-Learn.
- Repository: GitHub Link
- Description: Data analysis for Airbnb revenue and customer behavior to generate insights.
- Technologies Used: Power Query, Power BI.
- Repository: Access the Dashboard
- Data Collect and Storage: SQL, MySQL and Postgres
- Data Process and Analysis: Python and statistics
- Development: Git, Scrum, Project Management
- Data Visualization: Power BI, Looker and Streamlit web
- Machine Learning Modeling: Classification, Regression and Clustering
- Machine Learning Deployment: AWS Cloud, Heroku, Streamlit Cloud, Google Cloud Platform (GCP)
I'm excited to discuss how my Data Science skills can add value to your team. Feel free to reach out!
Thank you for visiting my portfolio. Let’s turn data into strategic decisions together!