[Direct ENTER] for bootcamp materials. Apply for the boot camp (Deadline 11:59pm 12/6/2019)

collaboratory logo

4th Annual Data Science Boot Camp

The Columbia Data Science Institute and Columbia Enterpreneuship are organizing the Fourth Winter-Intensive Collaboratory Data Science Boot Camp

The Collaboratory@Columbia is pleased to announce our 4th annual free Winter-Break Data Science Boot Camp. Designed by faculty, research scientists, and postdoctoral scholars from Columbia University’s Data Science Institute, the curriculum includes on-line learning material, introductory lectures and hands-on laboratory experiences.

The boot camp's mission is to enable Columbia faculty members, postdoctoral researchers and senior Ph.D. students to adopt more state-of-the-art data-science tools in their research and educational activities. To achieve this goal, we have designed a new modular model that is different from the previous week-long format. On January 15th-17th, 2020, we will have a data science skill camp that consists of three one-day long modules taught by instructors on machine learning, Bayesian modeling and text mining.

Apply for the boot camp (Deadline 11:59pm 12/6/2019)


  • When:

    • [January 15th – 17th, 2020] Data Science Skill Camp
  • Where:

  • What: This year, to encourage participation from Columbia faculty, we have adopted a modular structure for the boot camp with four stand-alone modules:

  • Who should apply: Columbia full-time faculty, postdoctoral researchers and senior Ph.D. students who have working knowledge of basic probability and statistics, programming and data analytics, and are ready to sharpen their skills to include a training in data science research skills, and/or to create a data science teaching module in a course they teach, and/or to initiate a project in data science. Seats are limited (50 for each module), and we will give priority to faculty who have applied to or are considering applying to the Collaboratory fellows fund.

  • Click here to START (detailed course information, pre-requisites, and setup instructions.)

Apply for the boot camp (Deadline 11:59pm 12/6/2019)

deadline


About the instructors and organizers

andy Ben Wayne tian

Andreas Mueller (@amueller) is an associate research scientist at the Data Science Institute at Columbia University and author of the O’Reilly book “Introduction to Machine Learning with Python”, describing a practical approach to machine learning with python and scikit-learn. He is one of the core developers of the scikit-learn machine learning library, and he has been co-maintaining it for several years. He is also a Software Carpentry instructor. In the past, he worked at the NYU Center for Data Science on open source and open science, and as Machine Learning Scientist at Amazon. You can find his full cv here. His mission is to create open tools to lower the barrier of entry for machine learning applications, promote reproducible science and democratize the access to high-quality machine learning algorithms.

Ben Goodrich is a core instructor of QMSS and teaches Missing Data, Bayesian Statistics, Data Mining, Data Analysis, and Theory and Methodology at QMSS. Previously, he was a Post-doctoral Researcher working with Andrew Gelman at the Applied Statistics Center at Columbia University (primarily on the mi R package for missing Data). He received his PhD in Government and Social Policy from Harvard University in 2010 where his dissertation, It’s Not All About the Benjamins: Political Economy and Social-Psychology Theories of Welfare State Preferences, derived two new estimators and applies them to cross-country survey data to test competing theories of preferences for redistribution and other welfare state programs. He previously served as a research assistant at the Peterson Institute for international Economics. His research interests include methodology, comparative politics and political economy.

Wayne Tai Lee is a Lecturer in Discipline, Department of Statitics, Columbia University. He is a Taiwanese American Statistician with a passion for dancing. He was a practicing data scientist for 6 years in industries like social networks (LinkedIn), agriculture (The Climate Corporation), and heavy manufacturing (United Technologies) before joining the Columbia Statistics department as a lecturer in discipline. Wayne is on a quest to make the world a better place through statistics education. His training has been focused on uncertainty quantification for global climate models and his industry experience is mostly focused on AB testing and business-centered data quality issues.

Tian Zheng (@tz33cu) is Professor and Chair of Statistics, Associate Director for Education of Data Science Institute at Columbia University. She develops novel methods for exploring and understanding patterns in complex data from different application domains such as biology, psychology, climatology, and etc. Her current projects are in the fields of statistical machine learning, spatiotemporal modeling and social network analysis. Professor Zheng’s research has been recognized by the Outstanding Statistical Application Award from the American Statistical Association (ASA), the Mitchell Prize from ISBA and a Google research award. She became a Fellow of American Statistical Association in 2014. Professor Zheng is the receipt of 2017 Columbia’s Presidential Award for Outstanding Teaching. In 2019, she is the chair for ASA’s section on Statistical Learning and Data Science. She is on the advisory board for STATS at Sense About Science America that targets to develop a statistical literate citizenry.

Jointly founded by Columbia University’s Data Science Institute and Columbia Entrepreneurship, The Collaboratory Program is a university-wide program dedicated to supporting collaborative curricula innovations designed to ensure that all Columbia University students receive the education and training that they need to succeed in today’s data rich world.