/CV

A resume on Github

Marek Cichy - Curriculum

My current goal

Career transition into a data scientist with focus on NLP from a Polish-Portuguese translation and interpretation role.

My background

Warsaw-based Portuguese language specialist with 8 years of experience. In this time I've:

  • translated over 4000 pages, served as an interpreter for about 400 hours.
  • created and managed Audioteka.es, the Spanish-speaking version of a Polish audiobook startup
  • cofounded Mandioca, a Latin American graphic novel publisher.

Currently I continue self-learning of Python, Machine Learning in general and NLP in particular under Piotr Migdał's guidance. I've just finished the WEC Omdena (AI4Good) energy transition project in the roles of ML Engineer and Task Manager.

Portfolio:

External projects

Private projects

  • PT/BR Classifier, classifying Portuguese texts according to their variety (European or Brasilian Portuguese)
  • NSFW-bot, Tweepy-based tweetbot consuming several image classification APIs
  • Pyster, a script for automated posting on an invite-only forum.

Articles on ML and Python

Books and courses

Talks

- "I scraped 5000 TED talks and know my sambas from my fados" on ML in PL Conference 2019

Probable next steps

Upgrading the PT/BR Classifier:

  • exploring the errors for patterns;
  • adding basic XAI to the frontend;
  • checking out an LSTM-based version of the classifier.
  • broadening the corpus (crowdsourced movie subtitles, YT comments, etc.)

Writing about:

  • PT/BR Classifier pt 2,
  • My experience with collaborative challenges in Omdena

Reading:

  • Introduction to Statistical Learning by Tibshirani et al. (under way)
  • Deep Learning in Python by Francois Chollet