Your browser is ancient!
Upgrade to a different browser to experience this site.

Introduction to Machine Learning in Sports Analytics

What You'll Learn

  • Gain an understanding of how classification and regression techniques can be used to enable sports analytics across athletic activities and events.
4 Modules
12 Hours
3 hrs per module (approx.)
Rating

About Introduction to Machine Learning in Sports Analytics

In this course students will explore supervised machine learning techniques using the python scikit learn (sklearn) toolkit and real-world athletic data to understand both machine learning algorithms and how to predict athletic outcomes. Building on the previous courses in the specialization, students will apply methods such as support vector machines (SVM), decision trees, random forest, linear and logistic regression, and ensembles of learners to examine data from professional sports leagues such as the NHL and MLB as well as wearable devices such as the Apple Watch and inertial measurement units (IMUs). By the end of the course students will have a broad understanding of how classification and regression techniques can be used to enable sports analytics across athletic activities and events.

Skills You'll Gain

  • Data Analysis
  • Data Visualization
  • Machine Learning
  • Sports Analytics
  • Statistical Analysis

What You'll Earn

Certificate of Completion
Certificates of completion acknowledge knowledge acquired upon completion of a non-credit course or program.
Experience Type
100% Online
Format
Self-Paced
Subject
  • Data Science
  • Education
Platform
Coursera
Welcome Message

Welcome to Introduction to Machine Learning in Sports Analytics, a course that applies supervised machine learning techniques to real-world sports data. Learners use Python and scikit-learn to explore classification and regression methods for predicting athletic performance and outcomes. This is the final course in the Sports Performance Analytics Specialization.

This abbreviated syllabus description was created with the help of AI tools and reviewed by staff. The full syllabus is available to those who enroll in the course.

Course Schedule

Module 1: Machine Learning Concepts

  • Ungraded Lab: JupyterLab
  • Video: Introduction
  • Reading: Help Us Learn More About You
  • Reading: Course Syllabus
  • Video: What is Machine Learning?
  • Video: The Machine Learning Workflow
  • Video: Our First Model: NHL Game Outcomes
  • Video: Building the Logistic Regression Model
  • Video: Considerations in Deploying The Model
  • Video: Wrap Up
  • Reading: Assignment 1 Programming Solution
  • Graded: Assignment 1

Module 2: Support Vector Machines

  • Video: Introduction to Support Vector Machines (SVMs)
  • Video: Polynomial Support Vector Machines
  • Video: Cross Validation
  • Video: A Real World SVM Model: Boxing Punch Classification
  • Reading: (Optional) - An evaluation of wearable inertial sensor configuration and supervised machine learning models for automatic punch classification in boxing
  • Reading: Assignment 2 Programming Solution
  • Graded: Assignment 2

Module 3: Decision Trees

  • Video: Decision Trees
  • Video: A Multiclass Tree Approach
  • Video: Model Trees
  • Video: Tuning and Inspecting Model Trees
  • Reading: Assignment 3 Programming Solution
  • Reading: UM Master of Applied Data Science (optional)
  • Graded: Assignment 3

Module 4: Ensembles & Beyond

  • Video: Ensembles
  • Video: Additional Machine Learning Concepts
  • Video: Baseball Hall of Fame Prediction
  • Video: Baseball Hall of Fame Demonstration Part 1
  • Video: Baseball Hall of Fame Demonstration Part 2
  • Reading: Free Deepnote Notebook Service
  • Reading: Putting Your Skills to the Test!
  • Reading: Post Course Survey
  • Graded: Assignment 4
Grading Policy

Learners must earn 80% or higher on all assessments. Each module contains one graded assignment worth 25% of your total grade.

Course content developed by U-M faculty and managed by the university. Faculty titles and affiliations are updated periodically.

Intermediate Level

Learners should have some familiarity with Python before starting this course. We recommend the Python for Everybody Specialization.

Course Video

Enrollment Options

Individuals

This experience is available to individual learners on the following platforms:

U-M Community

Free access is only available to current U-M students, alumni, faculty, and staff.

Organizations

Special pricing and tailored programming bundles available for organizational partners.

What are Coursera and edX?

Michigan Online learning experiences may be hosted on one or more learning platforms. Platform features may vary, including payment models, social communities, and learner support.

Coursera

  • Hosts online courses, series, and Teach-Outs from Michigan Online
  • Enroll and preview courses anytime
  • May earn a non-credit certificate from Coursera

edX

  • Hosts online courses and series from Michigan Online
  • Many offer a free (limited) audit option
  • May earn a non-credit certificate from edX

For more information visit the What are Coursera and edX? FAQ section

Reviews and Ratings

4.6

21 Ratings from Coursera

Most Recent Reviews

Read all reviews
I completed this course as the last course in the Sports Performance Analytics specialization so my review will look at the course in this context. My main observation would be that the issues with (the absence) of overall specialization design for the Sports Performance Analytics specialization have really come to the forefront during this course. What basic assumptions were made about learners when this specialization was designed? Who is this specialization for? Who is the target audience? Along the specialization, we move from "coding babysitting" to pretty advanced codes with no hint or explanation as for where the wild variation in coding skills is supposed to come from on the side of the learner. The same can be said for some of the maths content of the specialization as well. Sadly, there seems to be no real target audience here that would elevate all the courses onto the same platform - it really seems like there is a large amount of complete randomness here - which shows lack of design. If in some earlier courses learners are spoon-fed through some pretty basic coding steps, then why is a coding hurricane unleashed on them in Course 5? If learners are supposed to be able to grasp the coding content in Course 5, then why do they have to endure the code toddle in earlier courses? Etc. Something is gravely missing from the basic design template of this whole specialization, or maybe, it is the basic design template in its entirety that is missing. Course 5, this current course, also suffers some pretty sloppy assignment content, which seems to have caused a lot of resentment among earlier learners, yet remain largely unaddressed by the original course authors (even though some folks, mysteriously labeled as "Staff" in the module discussion forums, do work pretty hard to make the course and assessment content more accessible to learners - thanks, Brian, whoever you may be!), together with some other instances of sloppy design, course content, or editing. There also seems to be a rather wild discrepancy in this course between what is delivered in the lectures and what is assessed in the assignments. It also seems to be the case to me that this current course is badly adjusted for distance learning. Many of the issues encountered could be effectively addressed in a brick uni setting but are incredibly hard to manage in an online setting. This shows to further course design inefficiencies. Well, what can I say. I have completed the specialization. The content itself was fascinating, the lecturers (mostly) very competent and pleasant to listen to. The problems are numerous, and the question marks are even more numerous. I do not regret enrolling, but I must observe that with a little more care and attention, this course and the entire specialization could have been presented in a much more learner-friendly way, leading to much more rewarding and trouble-free learner experiences, and the fact that that little extra was not invested in the course and the specialization is downright disturbing. I also found the amount of advertising included in this current course quite disconcerting.
The Introduction to Machine Learning in Sports Analytics course is an exceptional offering for anyone interested in blending data science with sports. This course provides a thorough and practical introduction to supervised machine learning techniques, using the Python scikit-learn toolkit, and is designed to build on prior knowledge from earlier courses in the specialization. One of the standout features of this course is its focus on real-world athletic data, including data from professional sports leagues like the NHL and MLB, as well as from wearable devices such as the Apple Watch and inertial measurement units (IMUs). This makes the course not only highly relevant but also engaging, as students get hands-on experience applying machine learning algorithms to actual sports data. Throughout the course, students are introduced to a variety of core machine learning methods including support vector machines (SVM), decision trees, random forests, linear and logistic regression, and ensemble methods. The course provides a balanced mix of theory and practical application, allowing students to not only understand the algorithms but also learn how to apply them effectively to predict athletic outcomes. The structure of the course is well-paced, and the content is presented in a clear and accessible way, even for those with limited experience in machine learning. The practical coding assignments are particularly valuable, as they help solidify concepts and enhance learning through direct application. By the end of the course, students will have a solid foundation in both classification and regression techniques, and will understand how these methods can be applied to real-world sports analytics. Whether you are a beginner in machine learning or looking to further develop your skills, this course offers an excellent opportunity to gain expertise in a rapidly growing field. Overall, I highly recommend the Introduction to Machine Learning in Sports Analytics course for anyone interested in the intersection of data science and sports. It offers a comprehensive, hands-on, and engaging learning experience that will equip you with valuable skills for tackling real-world sports analytics challenges.
Provide solid foundation for beginning supervised ML
Entirely different difficulty than the other courses. It seems like a whole another level, starts from a very high complexity. The quizzes ask questions which are much more deep level than the videos or the commentary.

Michigan Online
For You

Sign up for a Michigan Online account to customize your experience!