AI-MongoDB-for-Data-analysis-and-Machine-Learning

MongoDB for Data Analysts and Machine Learning

MongoDB is today's fastest-growing database platform and the leading "NoSQL" database in the world. While MongoDB has long been the established standard for web and mobile application development, it is also increasingly being used in broader contexts such as genomics, real-time analytics and IoT.

Analyzing data in MongoDB presents unique challenges because of MongoDB's non-SQL interfaces and non-relational data models. However, MongoDB includes native analytical capabilities, and there are connectors available to familiar analytic tools.

In this course, you'll learn how to interact with MongoDB, use the built-in MongoDB aggregation framework, and analyze MongoDB data with familiar analytics tools such as Python, Excel, Tableau, and Spark.

Introduction (10 minutes)

  • About the instructor
  • Scope of the course
  • Prerequisites

Module 1: MongoDB introduction (60 minutes)

  • What is MongoDB
  • Document model vs relational model
  • Advantages of MongoDB
  • JSON document structure
  • Challenges for data analysts and data scientists
  • Connecting using the shell
  • Inserting and changing documents
  • Find() commands
  • Introduction to the Aggregation framework
  • Using the Compass UI
  • Connecting from languages and tools: Python, R
  • Exercise1

Module 2: The aggregation Framework (40 minutes)

  • Manipulating documents: $match, $unwind, $project
  • Grouping and sorting
  • Joining collections
  • Creating aggregations in Compass
  • Exercise 2

Module 3: Advanced Aggregation (30 minutes)

  • Graph operations
  • Bucketing and facets
  • Windowing functions – moving averages, time series support

Module 4: The MongoDB BI connector (45 minutes)

  • Configuring the BI connector
  • Connecting to MongoDB Atlas BI connector
  • Using the BI connector with analytic tools: Tableau, Excel

Module 5: MongoDB machine learning with Spark (45 minutes)

  • Configuring the spark connector
  • Using Spark SQL
  • Using Spark
  • Using the Spark

Module 6: Wrap up (10 minutes)

  • Summary
  • Further resources