The Text Mining and Natural Language Processing Pathway provides an introduction to working with and analyzing unstructured text data from the UC Davis Datalab: Data Science and Informatics.
Topics include:
- Getting Started with Textual Data,
- Natural Language Processing (in R or Python),
- PDF Scraping, and more.
This pathway takes 20-40 hours to complete and consists of 4 required training sessions as well as at least 1 additional elective. Participants are expected to complete code submissions, case studies, and reports in order to earn badges.
How to Enroll:
To enroll in the Text Mining and Natural Language Processing pathway please complete the form on the right. You may enroll at any time, even if you have already completed some or all of the requirements. You will be enrolled in 1-2 business days. If you have any questions, please contact gpi@ucdavis.edu.
Pathway Progression Requirements:
Microbadge: Introduction to Unix Command Line Submission Guide
Microbadge: Python Basics Submission Guide
Microbadge: Natural Language Processing in Python Submission Guide
Microbadge: R Basics Submission Guide
Microbadge: Natural Language Processing in R Submission Guide
Microbadge: Getting Started with Textual Data Submission Guide
Microbadge: OCR and Working with Messy Text Submission Guide
Microbadge: Textual Data Structures Submission Guide
Microbadge: PDF Scraping Submission Guide