Associate Professor
Your browser is ancient!
Upgrade to a different browser to experience this site.
In this MOOC, you will be introduced to advanced machine learning and natural language
processing techniques to parse and extract information from unstructured text documents in
healthcare, such as clinical notes, radiology reports, and discharge summaries. Whether you are an aspiring data scientist or an early or mid-career professional in data science or information technology in healthcare, it is critical that you keep up-to-date your skills in information extraction and analysis.
To be successful in this course, you should build on the concepts learned through other intermediate-level MOOC courses and specializations in Data Science offered by the University of Michigan, so you will be able to delve deeper into challenges in recognizing medical entities in health-related documents, extracting clinical information, addressing ambiguity and polysemy to tag them with correct concept types, and develop tools and techniques to analyze new genres of health information.
By the end of this course, you will be able to:
Identify text mining approaches needed to identify and extract different kinds of information from health-related text data
Create an end-to-end NLP pipeline to extract medical concepts from clinical free text using one terminology resource
Differentiate how training deep learning models differ from training traditional machine learning models
Configure a deep neural network model to detect adverse events from drug reviews
List the pros and cons of Deep Learning approaches."
Information Extraction from Free Text Data in Health introduces machine learning and natural language processing techniques for analyzing unstructured healthcare data. Learners develop hands-on skills to extract clinical information from medical text while addressing ambiguity, privacy, and real-world healthcare data challenges.
This abbreviated syllabus description was created with the help of AI tools and reviewed by staff. The full syllabus is available to those who enroll in the course.
Module 1: What is Information Extraction
Module 2: Named Entity Recognition (NER)
Module 3: Sequential Classification
Module 4: Introduction to Advanced Approaches to NER in Health
Learners must earn at least 80% overall to pass. Four hands-on programming exercises account for 95% of the grade, with a module 1 quiz worth 5% completing the total course grade.
Associate Professor
Course content developed by U-M faculty and managed by the university. Faculty titles and affiliations are updated periodically.
Intermediate Level
Some related experience required