zaaleksey's Stars
great-expectations/great_expectations
Always know what to expect from your data.
anton-k/ru-neophyte-guide-to-scala
Перевод на русский серии статей Daniel Westheide "The Neophyte's Guide to Scala"
zaleslaw/Spark-Tutorial
How to build your first Spark application with MLlib, StructuredStreaming, GraphFrames, Datasets and so on? Answer is here!
Dzeru/distributed-data-processing-systems
Курс распределенных систем обработки данных, 4 курс КНиИТ, 2022 год
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
mtth/hdfs
API and command line interface for HDFS
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
Yelp/mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
karpenkovarya/airflow_for_beginners
pallets/click
Python composable command line interface toolkit
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Yorko/mlcourse.ai
Open Machine Learning Course
VictoriaGurkova/Split-Merge-Queueing-System-Optimization-Methods
dwmkerr/hacker-laws
💻📖 Laws, Theories, Principles and Patterns that developers will find useful. #hackerlaws