/che-kir.github.io

My blog about data science, machine learning, c++, python, and other IT-oriented things that's interested to me.

IT blog

My blog about data science, machine learning, c++, python, and other IT-oriented things that's interested to me.

Ideas for projects:

  1. Text summarizing using BERT, Transformers, etc. Dataset for this task: https://github.com/mahnazkoupaee/WikiHow-Dataset (csv file can be downloaded by link inside (590 MB) )
  2. Telegram-bot with NLP (maybe with 1 point, or some another useful info)
  3. Text analyzing for fake news, or lack of information and evidence. (https://data-flair.training/blogs/advanced-python-project-detecting-fake-news/)

Useful links:

Awesome dataset list for NLP tasks: https://quantumstat.com/dataset/dataset.html

http://nlpprogress.com/english/summarization.html

https://www.streamlit.io/

Deployment process for DS/ML: https://martinfowler.com/articles/cd4ml.html

Fast API

https://pathmind.com/wiki/open-datasets

https://www.deeplearningbook.org

Data Science Interview Questions:

https://twitter.com/Al_Grigor/status/1230818076578459649

C++ notes:

Interview questions quotes:

I like to just have a chat to start: what do they like about C++, dislike, how it compares to other languages, do they follow newer standards, what do they think of UB. If I'm asking C++ specific coding, I've asked them to implement something like std::function.

Nice to have in the future:

  • Experience with any message queue services (RabbitMQ, Apache Kaffka etc).
  • DeepLearning - PyTorch, MXNet
  • Experience with Golang
  • Templates, initializer_list in different cases
  • OpenGL, DirectX
  • Docker / Kubernetes
  • Understanding of L2/L3 cache and optimization at the Microsecond level

Other useful links:

http://rus-linux.net/lib.php?name=/MyLDP/hard/memory/memory.html

https://codereview.stackexchange.com

https://opensource.zalando.com/restful-api-guidelines/index.html

https://www.amazon.com/Advanced-C-Compiling-Milan-Stevanovic/dp/1430266678

https://www.threadingbuildingblocks.org/docs/help/reference/containers_overview/concurrent_queue_cls.htm

https://help.github.com/en/github/writing-on-github/basic-writing-and-formatting-syntax

Better searching in VS:

resharper visual assist c++

Books

Professional C++ Авторы: Marc Gregoire

Mastering C++ Multithreading Авторы: Maya Posch

Professional Multicore Programming: Design and Implementation for C++ Developers Авторы: Cameron Hughes, Tracey Hughes

Programming multicore and many-core computing systems, Sabri Pilana and Fatos Xhafa

C++ Multithreading Cookbook Авторы: Miloš Ljumović

C++ Concurrency in Action: Practical Multithreading Anthony Williams

https://www.packtpub.com/free-learning