Senior Data Scientist | Machine Learning Engineer | Software Engineer | Econometrician

I work as a Data Science Consultant and have gathered experience with a variety of industries and projects. I enjoy solving challenging problems and help companies to best utilize their data to achive their business goals.


Knowledge


My Tech Stack:

  • R (Since 2013)
  • Python (Since 2017)
  • SQL (Since 2018)
  • API development (Flask, FastAPI) (Since 2019)
  • AWS (S3, EC2, Lambda, Batch, Sagemaker, ...) (Since 2019)
  • Git, Docker, CI/CD (Jenkins, Github Action, CircleCI) (Since 2018)

My Stats Stack:

  • Univariate and multivariate time series analysis
  • Tree based methods (XGBoost, CatBoost, ...)
  • Machine Learning / AI (ANN, CNN, LSTM, TFT, ...)
  • NLP (Transformer, BERT)
  • Multivariate Statistics / Predictive Analytics (GAM),
  • Bayesian Modelling ((R)STAN)

Languages and Tools:

python aws

linux bash docker git kubernetes kafka

jenkins circleci travisci

mariadb mysql postgresql sqlite redis

flask nginx fastapi

tensorflow pytorch pandas scikit_learn seaborn

aws azure


📺 Conference Talks

Performing Content: Can NLP and Deep Learning
algorithms predict reader preferences? Corona Community Pitch: Maps Matter – Realistic Hot Spot Detection Across Regional Boundaries


🎙️ Podcasts (german language only)

In Numbers We Trust - Der Data Science Podcast - #21: Machine Learning Operations (MLOps)


📕 Blog Posts


Connect with me:

Connect via LinkedIn sebastian-cattes