Medium - https://medium.com/@kaustubhyerkade/getting-started-with-computer-science-d3a2418ff39c
DE Stuff - Programming & DSA in Python Distrubuted computation & storage -Hadoop - imp - spark,spark sql file format - json,avro,parquet type of data - structiured & semi sturectured processing mechniasm - batch & real time pub sub -kafka or AWS Kinesis data warehuse desginng amazon - ecomerce, netflix sql complex transactional & no sql databases - key value ,document,columanr base,graph base db,-mongodb,caseendra,hbase - data visualization tools- ETL tools cloud services- experience
๐ฅ๐๐บ๐ฝ๐ผ๐ฟ๐๐ฎ๐ป๐ ๐ฅ๐ฒ๐๐ผ๐๐ฟ๐ฐ๐ฒ๐๐ฅ
๐ด Data Engineer Roadmap Document : โก๏ธhttps://docs.google.com/document/d/1g...โ
๐ด Python : โก๏ธ https://www.youtube.com/watch?v=_uQrJ...โ โก๏ธ https://www.programiz.com/python-prog...โ
๐ด Scala : โก๏ธ https://www.youtube.com/watch?v=LQVDJ...โ โก๏ธ http://allaboutscala.com/โ
๐ด Java : โก๏ธ https://www.youtube.com/watch?v=eIrMb...โ โก๏ธ https://beginnersbook.com/java-tutori...โ
๐ด Linux, Unix, Shell Scripting : โก๏ธ https://practice.geeksforgeeks.org/ba...โ โก๏ธ Above one is free course with invitation code - ELEARNINGBLINUX
๐ด Data Structures & Algorithms : โก๏ธ https://www.youtube.com/watch?v=5_5oE...โ โก๏ธ https://www.geeksforgeeks.org/โ โก๏ธ https://leetcode.com/โ
๐ด DBMS : โก๏ธ https://www.youtube.com/watch?v=kBdlM...โ โก๏ธ https://www.studytonight.com/dbms/โ
๐ด SQL Scripting : โก๏ธ https://www.youtube.com/watch?v=HXV3z...โ โก๏ธ https://www.youtube.com/watch?v=7S_tz...โ โก๏ธ https://www.w3schools.com/sql/โ
๐ด Basic Terminologies In BigData : โก๏ธ https://data-flair.training/blogs/wha...โ โก๏ธ https://www.edureka.co/blog/what-is-b...โ
๐ด Data Exploration Libraries : โก๏ธ Pandas - https://www.youtube.com/watch?v=UB3DE...โ โก๏ธ NumPy - https://www.youtube.com/watch?v=DI8wg...โ
๐ด Data Warehousing Concepts : โก๏ธ https://www.youtube.com/watch?v=J326L...โ โก๏ธhttps://www.tutorialspoint.com/dwh/dw...โ.
๐ด BigData Frameworks (Hadoop, Hive, Spark, Sqoop, Nifi, Flume) : โก๏ธ https://www.youtube.com/results?searc...โ โก๏ธ https://www.youtube.com/user/edurekaINโ โก๏ธ https://data-flair.training/โ โก๏ธ https://www.edureka.co/โ
๐ด Workflow Schedulers, Dependency Management : โก๏ธ https://www.youtube.com/watch?v=niJ06...โ โก๏ธ https://www.youtube.com/watch?v=6RebQ...โ
๐ด NoSQL Databases : โก๏ธ HBase - https://www.youtube.com/watch?v=NOX6-...โ โก๏ธ Cassandra - https://www.youtube.com/watch?v=iDhIj...โ โก๏ธ Elastic Search - https://www.youtube.com/watch?v=1Envk...โ โก๏ธ MongoDB - https://www.youtube.com/watch?v=pWbMr...โ
๐ด Apache Kafka : โก๏ธ https://www.youtube.com/watch?v=daRyk...โ
๐ด Dashboarding Tools : โก๏ธ Tableau - https://www.youtube.com/watch?v=aHaOI...โ โก๏ธ PowerBI - https://www.youtube.com/watch?v=3u7MQ...โ โก๏ธ Grafana - https://www.youtube.com/watch?v=CjABE...โ โก๏ธ Kibana - https://www.youtube.com/watch?v=gQ1c1...โ
๐ด BigData Services in Cloud (AWS) : โก๏ธ https://www.youtube.com/watch?v=k1RI5...โ โก๏ธ https://www.youtube.com/watch?v=8PyLr...โ โก๏ธ https://www.simplilearn.com/aws-big-d...โ
๐๐บ๐ฝ๐ผ๐ฟ๐๐ฎ๐ป๐ ๐๐ผ๐ฝ๐ถ๐ฐ๐ ๐ถ๐ป ๐ฆ๐ค๐ :
๐ Joins ๐ Group By ๐ Nested joins ๐ Case-When conditions ๐ Window functions
๐ง๐ฒ๐ฐ๐ต ๐๐๐ฎ๐ฐ๐ธ ๐ณ๐ผ๐ฟ ๐ฟ๐ฒ๐ฎ๐น๐๐ถ๐บ๐ฒ ๐ฑ๐ฎ๐๐ฎ ๐ฝ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐ :
๐ Apache Kafka ๐ Apache Flink ๐ Apache Storm ๐ AWS Kinesis ๐ Spark Streaming
๐๐ถ๐ด๐๐ฎ๐๐ฎ ๐ณ๐ฟ๐ฎ๐บ๐ฒ๐๐ผ๐ฟ๐ธ๐ ๐ฎ๐ป๐ฑ ๐๐ฎ๐ฑ๐ผ๐ผ๐ฝ ๐ฒ๐ฐ๐ผ๐๐๐๐๐ฒ๐บ :
๐ Hadoop architecture, Map-Reduce, HDFS, Yarn ๐ Apache Spark ๐ Hive ๐ Flume ๐ Sqoop ๐ Zookeeper ๐ Ambari, Hue ๐ Oozie, Airflow, Azkaban
๐๐ฎ๐๐ฎ ๐ฉ๐ถ๐๐๐ฎ๐น๐ถ๐๐ฎ๐๐ถ๐ผ๐ป ๐ง๐ผ๐ผ๐น๐ :
๐ Tableau ๐ Power BI ๐ Qlik Sense ๐ Grafana ๐ Kibana
๐ง๐ฟ๐ฎ๐ป๐๐ฎ๐ฐ๐๐ถ๐ผ๐ป๐ฎ๐น ๐๐ฎ๐๐ฎ๐ฏ๐ฎ๐๐ฒ๐ :
๐ Amazon Aurora ๐ PostgreSQL ๐ MySQL ๐ MariaDB ๐ Oracle ๐ SQL Server
๐ก๐ผ-๐ฆ๐ค๐ ๐๐ฎ๐๐ฎ๐ฏ๐ฎ๐๐ฒ๐ :
๐ DynamoDB ๐ Cassandra ๐ MongoDB ๐ ElasticSearch ๐ HBase ๐ Couchbase ๐ Redis
Book - Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python 2nd Edition,