/neurips2022-autogluon-workshop

NeurIPS 2022 AutoGluon Workshop. See website: https://autogluon.github.io/neurips2022-autogluon-workshop/

Primary LanguageSCSS

AutoGluon: Empowering (Multimodal) AutoML for the Next 10 Million Users

Automated machine learning (AutoML) offers the promise of translating raw data into accurate predictions without the need for significant human effort, expertise, and manual experimentation. In this workshop, we introduce AutoGluon, a state-of-the-art and easy-to-use toolkit that empowers multimodal AutoML. Different from most AutoML systems that focus on solving tabular tasks containing categorical and numerical features, we consider supervised learning tasks on various types of data including tabular features, text, image, time series, as well as their combinations. We will introduce the real-world problems that AutoGluon can help you solve within three lines of code and the fundamental techniques adopted in the toolkit. Rather than diving deep into the mechanisms underlining each individual ML models, we emphasize on how you can take advantage of a diverse collection of models to build an automated ML pipeline. Our workshop will also emphasize on the techniques behind automatically building and training deep learning models, which are powerful yet cumbersome to manage manually.

Join us at the NeurIPS 2022 located at New Orleans Ernest N. Morial Convention Center on Monday, November 28 at 2:00pm, CST in Room 293.

Schedule

For each section, there will be a 10-15min QA at the end of section. In addition, there will be additional hands-on notebooks after each session that people can try out asynchronously.

Section Speaker Duration (CST timezone) Slides Cheatsheet
Introduction + AutoGluon Tabular Nick Erickson 2:00PM -- 2:55PM TBA tabular-cheatsheet
Break - 2:55PM -- 3:05PM -
AutoGluon Multimodal Xingjian Shi, Yi Zhu 3:05PM -- 4:00PM TBA multimodal-cheatsheet
Break - 4:00PM -- 4:10PM -
AutoGluon Timeseries Caner Turkmen 4:10PM -- 4:50PM TBA timeseries-cheatsheet
Additional QA + Feedback All speakers 4:50PM -- 5:00PM -

Section Outline and Materials

AutoGluon Tabular

  • AutoML Basics: Discussion of core AutoML principles
  • History of competition ML and how it influenced the design of modern AutoML systems
  • Discussion of model combination strategies (stacking, bagging, model aggregation)
  • Constraint satisfaction and engineering for a performance envelope (accuracy, speed, compute resources)
  • Benchmark comparisons showcasing the advancement of AutoML systems in recent years both compared to earlier AutoML systems and human data scientists (4 AutoML frameworks, 104 OpenML datasets, 10 Kaggle datasets)

AutoGluon Multimodal

  • Real-world multimodal problems (life beyond captioning images)
  • Foundation models for image and text
  • Fusion techniques
  • Object detection
  • Hyper-parameter optimization
  • Zero-shot image classification
  • Multimodal matching
  • Foundation models are larger
    • Training: Parameter-efficient finetuning
    • Deployment: Model distillation

AutoGluon Timeseries

  • Time series forecasting in a nutshell
  • An overview of machine learning for forecasting
  • AutoML in time series and unique challenges
  • Forecasting with AutoGluon-TimeSeries
  • Looking forward in time series AutoML

Hands-on Notebooks

For hands-on tutorials, we provide notebooks for you to try out AutoGluon via SageMaker Studio Lab or Google Colab.

More details will be added.

Checkout AutoGluon Website and get started!