/TheDataSeminar

This started cause of a tweet :)

Primary LanguageHCL

The Data Seminar

Managing data:

Database Management Systems (DBMS)

Week-01

Week-02

  • Indexing, Transactions, Isolation levels and Table locks in SQL
  • What are file descriptors and how do they relate to DB performance
  • MariaDB types of db replication
  • MariaDB async and semi-sync replication exercise

Week-03

  • MariaDB Sync Replication using Libvirt/KVM and ansible

Week-04

  • Mongodb and Mongo Replicaset exercises

Week-05

  • DB Backup and Recovery
  • Data Partitioning and Sharding
  • CAP theorem
  • Consistent Hashing and Quorums
  • HTTPS for docker

To be decided

Deep Dive into NoSQL Databases:

  • Types of NoSQL databases: Key-Value, Document, Column, Graph

  • Popular NoSQL databases: MongoDB, Redis, Cassandra

  • Real-world applications of NoSQL

NoSQL Database Uses in Scalability and High Availability

(WHY use NoSQL)

  • Scalability challenges and solutions in NoSQL

  • High availability concepts in NoSQL databases

  • Replication and sharding

Enterprise Solutions and NoSQL (HOWs and WHYs):

  • Overview of enterprise NoSQL solutions: Aerospike, Cassandra, Elasticsearch etc

  • How enterprises utilize NoSQL for performance and scalability

System Design Concepts around NoSQL

  • Data modeling for NoSQL databases

  • Indexing strategies in NoSQL

  • Partitioning and sharding in detail

Database Internals:

  • Understanding the internals of SQL and NoSQL databases

  • Storage engines: How data is stored and retrieved

  • The role of caching in database performance

Distributed Systems and Databases:

  • Fundamentals of distributed systems

  • The role of databases in distributed architectures

  • current trends in distributed databases

Cache databases

  • How do Cache databases work.

  • How to implement Redis.