Understand basic concepts, tasks, and procedures of data mining.
Formulate real- world information using basic data representations: itemsets, vectors, matrices, sequences, time series, and networks.
Use data mining algorithms to extract patterns and similarities from real-world datasets.
Calculate the importance of patterns and prepare for downstream machine- learning tasks.
4 Modules
56 Hours
14 hrs per module (approx.)
Rating
About Data Mining in Python
In “Data Mining in Python,” you will learn how to extract useful knowledge from large-scale datasets. This course introduces basic concepts and general tasks for data mining. You will explore a wide range of real-world data sets, including grocery store, restaurant reviews, business operations, social media posts, and more.
You will learn how to formally describe real-world information with general data representations (e.g., itemsets, vectors, matrices, sequences, and more). You will then learn how to formulate data in the wild with one or more of these representations.
This course will teach you how to characterize and explain your data by looking for patterns and similarities, which are basic building blocks for advanced analysis and machine learning models.
This is the first course in “More Applied Data Science with Python,” a four-course series focused on helping you apply advanced data science techniques using Python. It is recommended that all learners complete the Applied Data Science with Python specialization prior to beginning this course.
Skills You'll Gain
Data Mining
Data Presentation
Machine Learning
Python For Data Analysis
Python (Programming Language)
What You'll Earn
Certificate of Completion:
Certificates of completion acknowledge knowledge acquired upon completion of a non-credit course or program.
Data Mining in Python, part of the More Applied Data Science with Python series, introduces core concepts and techniques for discovering patterns in complex datasets. Learners work with itemsets, vectors, matrices, and sequences while applying similarity measures and algorithms through quizzes and programming assignments focused on real-world data representations.
This abbreviated syllabus description was created with the help of AI tools and reviewed by staff. The full syllabus is available to those who enroll in the course.
Course Schedule
Module 1: Basic Concepts of Data Mining
Video: Welcome to Data Mining in Python
Reading: MADSwPY Certificate Roadmap
Reading: Course Syllabus
Discussion Prompt: Meet Your Fellow Learners
Reading: Help Us Learn About You
Video: What is Data Mining
Reading: Introduction to the Basic Functionalities of Data Mining
Video: Data Mining Functionalities (Part 1)
Video: Data Mining Functionalities (Part 2)
Video: Data Mining Functionalities (Part 3)
Graded Assignment: Knowledge Check: Basic Functionalities of Data Mining
Reading: Introduction to Basic Data Representations
Video: Representing Itemsets, Vectors, and Matrices
Video: Representing Sequences
Graded Assignment: Knowledge Check: Basic Data Representations (Part 1)
Video: Representing Time-Series and Spatial/Temporal Data
Video: Representing Graph Data
Video: Representing Stream Data
Reading: Case Study: Representations of Real-World Text Data
Graded Assignment: Knowledge Check: Basic Data Representations (Part 2)
Reading: Introduction to Patterns and Similarities
Video: Data Mining Based on Patterns
Video: Data Mining Based on Similarities
Reading: Introduction to Module 1 Programming Assignment: Visualizing Different Data
The course grade is based on four quizzes worth 20% (5% each), and four programming assignments. The first is worth 5%, and the remaining three are worth 25% each.
Course content developed by U-M faculty and managed by the university.
Faculty titles and affiliations are updated periodically.
Advanced Level
A basic understanding of linear algebra, and completing the courses of the “More Applied Data Science with Python” series in order, is recommended.
Enrollment Options
Individuals
This experience is available to individual learners on the following platforms:
U-M Community
Students, faculty, staff, and alumni of the University of Michigan get free access.
Organizations
Special pricing and tailored programming bundles available for organizational partners.
Reviews and Ratings
4.3
3 Ratings from Coursera
What Learners Are Saying
Each course in this series will help you build data analytics skills using Python and increase your understanding of the role of data in shaping decisions.
Qiaozhu MeiProfessor of Information, Associate Dean for Research and Innovation, School of Information
Michigan Online For You
Sign up for a Michigan Online account to customize your experience!