/workshop-random-forests

UNSW codeRs workshop: Introduction to Classification Trees and Random Forests in R. These documents will walk you through examples to fit classification trees and random forest models in R.

Primary LanguageHTMLCreative Commons Attribution 4.0 InternationalCC-BY-4.0

Workshop: randomForest in R

License: CC BY 4.0

José R. Ferrer-Paris (@jrfep) for UNSW codeRs.

Abstract

"Random Forests" are used everywhere, and for good reason! Random Forest is a powerful and versatile machine learning algorithm that grows and combines multiple decision trees to create a "forest". It sounds very complex, but learning to use them is very intuitive, specially if you got a USNW codeRs workshop to help you. JR will give a beginner friendly introduction to Random Forests applied to classification problems.

We will start our workshop using decision trees to describe rules for classifying data, and discuss how multiple, randomized trees can get us to more accurate classifications. We will use guide you through the code needed to fit classification trees and Random Forests using popular R packages. Before you know it, you will be talking about 'bagging', 'variable randomization' and 'ensembles prediction' like a pro.

Come with the latest version of R loaded on your laptop, or come a few minutes early so we can help you load it!

Bio

José Rafael Ferrer-Paris (a.k.a. JR) is currently Research Fellow at the Center for Ecosystem Science at UNSW, and a member of the International Union for the Conservation of Nature (IUCN) Thematic Group on Red List of Ecosystems. JR has studied and worked in Venezuela, Germany and South Africa with biodiversity data, spatial and temporal ecological data and geographical information systems. He is currently working with Prof. David Keith on global risk assessment of ecosystems. He has been using R since version 1.0.0, and also likes working with other command, scripting and programming languages like PHP, Bash, Python, JS, Java or Perl and all kinds of databases (SQL, XML, GDB).

Material based on:

R-markdown

To generate or update the R-scripts from R-markdown documents use:

knitr::purl("01-hands-on-classification-tree.Rmd",documentation=2)
knitr::purl("02-hands-on-random-forest.Rmd",documentation=2)