/Introduction_to_Data_Mining_R_Examples

R Code to accompany the book Introduction to Data Mining by Tan, Steinbach and Kumar (Code by Michael Hahsler)

Primary LanguageJupyter NotebookCreative Commons Attribution Share Alike 4.0 InternationalCC-BY-SA-4.0

R Companion for Introduction to Data Mining

This repository contains slides and documented R examples to accompany several chapters of the popular data mining text book:

Pang-Ning Tan, Michael Steinbach, Anuj Karpatne and Vipin Kumar, Introduction to Data Mining, Addison Wesley, 1st or 2nd edition.

The slides and examples are used in my course CS 7331 - Data Mining taught at SMU and will be regularly updated and improved. The code examples are now compiled into the free online book An R Companion for Introduction to Data Mining which is published under the creative commons attribution license and you can share and adapt them freely. Please open an issue
for corrections or to suggest improvements.

Covered Chapters

Chapter Slides R Code Sample Book Chapters
1. Introduction Slides R Code
2. Data Slides: Data, Exploration R Code
3. Classification: Basic Concepts and Techniques Slides R Code Read Chapter 3
4. Classification: Alternative Techniques Slides R Code
5. Association Analysis: Basic Concepts and Algorithms Slides R Code Read Chapter 5
7. Cluster Analysis: Basic Concepts and Algorithms Slides R Code Read Chapter 7

The raw R code and the Powerpoint files can be found in the repository directories code and slides.

License

Creative Commons License All code and documents in this repository are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

For questions please contact Michael Hahsler.