kaustubhyerkade/kaustubh-yerkade
DE Stuff - Programming & DSA in Python Distrubuted computation & storage -Hadoop - imp - spark,spark sql file format - json,avro,parquet type of data - structiured & semi sturectured processing mechniasm - batch & real time pub sub -kafka or AWS Kinesis data warehuse desginng amazon - ecomerce, netflix sql complex transactional & no sql databases - key value ,document,columanr base,graph base db,-mongodb,caseendra,hbase - data visualization tools- ETL tools cloud services- experience ๐ฅ๐๐บ๐ฝ๐ผ๐ฟ๐๐ฎ๐ป๐ ๐ฅ๐ฒ๐๐ผ๐๐ฟ๐ฐ๐ฒ๐๐ฅ ๐ด Data Engineer Roadmap Document : โก๏ธhttps://docs.google.com/document/d/1g...โ ๐ด Python : โก๏ธ https://www.youtube.com/watch?v=_uQrJ...โ โก๏ธ https://www.programiz.com/python-prog...โ ๐ด Scala : โก๏ธ https://www.youtube.com/watch?v=LQVDJ...โ โก๏ธ http://allaboutscala.com/โ ๐ด Java : โก๏ธ https://www.youtube.com/watch?v=eIrMb...โ โก๏ธ https://beginnersbook.com/java-tutori...โ ๐ด Linux, Unix, Shell Scripting : โก๏ธ https://practice.geeksforgeeks.org/ba...โ โก๏ธ Above one is free course with invitation code - ELEARNINGBLINUX ๐ด Data Structures & Algorithms : โก๏ธ https://www.youtube.com/watch?v=5_5oE...โ โก๏ธ https://www.geeksforgeeks.org/โ โก๏ธ https://leetcode.com/โ ๐ด DBMS : โก๏ธ https://www.youtube.com/watch?v=kBdlM...โ โก๏ธ https://www.studytonight.com/dbms/โ ๐ด SQL Scripting : โก๏ธ https://www.youtube.com/watch?v=HXV3z...โ โก๏ธ https://www.youtube.com/watch?v=7S_tz...โ โก๏ธ https://www.w3schools.com/sql/โ ๐ด Basic Terminologies In BigData : โก๏ธ https://data-flair.training/blogs/wha...โ โก๏ธ https://www.edureka.co/blog/what-is-b...โ ๐ด Data Exploration Libraries : โก๏ธ Pandas - https://www.youtube.com/watch?v=UB3DE...โ โก๏ธ NumPy - https://www.youtube.com/watch?v=DI8wg...โ ๐ด Data Warehousing Concepts : โก๏ธ https://www.youtube.com/watch?v=J326L...โ โก๏ธhttps://www.tutorialspoint.com/dwh/dw...โ. ๐ด BigData Frameworks (Hadoop, Hive, Spark, Sqoop, Nifi, Flume) : โก๏ธ https://www.youtube.com/results?searc...โ โก๏ธ https://www.youtube.com/user/edurekaINโ โก๏ธ https://data-flair.training/โ โก๏ธ https://www.edureka.co/โ ๐ด Workflow Schedulers, Dependency Management : โก๏ธ https://www.youtube.com/watch?v=niJ06...โ โก๏ธ https://www.youtube.com/watch?v=6RebQ...โ ๐ด NoSQL Databases : โก๏ธ HBase - https://www.youtube.com/watch?v=NOX6-...โ โก๏ธ Cassandra - https://www.youtube.com/watch?v=iDhIj...โ โก๏ธ Elastic Search - https://www.youtube.com/watch?v=1Envk...โ โก๏ธ MongoDB - https://www.youtube.com/watch?v=pWbMr...โ ๐ด Apache Kafka : โก๏ธ https://www.youtube.com/watch?v=daRyk...โ ๐ด Dashboarding Tools : โก๏ธ Tableau - https://www.youtube.com/watch?v=aHaOI...โ โก๏ธ PowerBI - https://www.youtube.com/watch?v=3u7MQ...โ โก๏ธ Grafana - https://www.youtube.com/watch?v=CjABE...โ โก๏ธ Kibana - https://www.youtube.com/watch?v=gQ1c1...โ ๐ด BigData Services in Cloud (AWS) : โก๏ธ https://www.youtube.com/watch?v=k1RI5...โ โก๏ธ https://www.youtube.com/watch?v=8PyLr...โ โก๏ธ https://www.simplilearn.com/aws-big-d...โ ๐๐บ๐ฝ๐ผ๐ฟ๐๐ฎ๐ป๐ ๐๐ผ๐ฝ๐ถ๐ฐ๐ ๐ถ๐ป ๐ฆ๐ค๐ : ๐ Joins ๐ Group By ๐ Nested joins ๐ Case-When conditions ๐ Window functions ๐ง๐ฒ๐ฐ๐ต ๐๐๐ฎ๐ฐ๐ธ ๐ณ๐ผ๐ฟ ๐ฟ๐ฒ๐ฎ๐น๐๐ถ๐บ๐ฒ ๐ฑ๐ฎ๐๐ฎ ๐ฝ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐ : ๐ Apache Kafka ๐ Apache Flink ๐ Apache Storm ๐ AWS Kinesis ๐ Spark Streaming ๐๐ถ๐ด๐๐ฎ๐๐ฎ ๐ณ๐ฟ๐ฎ๐บ๐ฒ๐๐ผ๐ฟ๐ธ๐ ๐ฎ๐ป๐ฑ ๐๐ฎ๐ฑ๐ผ๐ผ๐ฝ ๐ฒ๐ฐ๐ผ๐๐๐๐๐ฒ๐บ : ๐ Hadoop architecture, Map-Reduce, HDFS, Yarn ๐ Apache Spark ๐ Hive ๐ Flume ๐ Sqoop ๐ Zookeeper ๐ Ambari, Hue ๐ Oozie, Airflow, Azkaban ๐๐ฎ๐๐ฎ ๐ฉ๐ถ๐๐๐ฎ๐น๐ถ๐๐ฎ๐๐ถ๐ผ๐ป ๐ง๐ผ๐ผ๐น๐ : ๐ Tableau ๐ Power BI ๐ Qlik Sense ๐ Grafana ๐ Kibana ๐ง๐ฟ๐ฎ๐ป๐๐ฎ๐ฐ๐๐ถ๐ผ๐ป๐ฎ๐น ๐๐ฎ๐๐ฎ๐ฏ๐ฎ๐๐ฒ๐ : ๐ Amazon Aurora ๐ PostgreSQL ๐ MySQL ๐ MariaDB ๐ Oracle ๐ SQL Server ๐ก๐ผ-๐ฆ๐ค๐ ๐๐ฎ๐๐ฎ๐ฏ๐ฎ๐๐ฒ๐ : ๐ DynamoDB ๐ Cassandra ๐ MongoDB ๐ ElasticSearch ๐ HBase ๐ Couchbase ๐ Redis Book - Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python 2nd Edition,