There was a 2021 Applied Data Analytics workshop in Texas. Participants worked in teams with high school, post-secondary, and employment agency data. The specific focus of the workshop was on connecting high school and/or post-secondary completion to workforce outcomes in Texas. The program provided instruction on using big data tools including SQL and R. Participants received training on core data concepts such as record linkage and data visualization as well as cutting-edge training in machine learning.
This repository contains the class materials for the Texas applied data analytics program.
Datasets Used in the Class:
-
Texas Education Agency Graduates Data
-
Texas Education Agency Post Secondary and Licensure Data
-
Texas Higher Education Coordinating Board Enrollments Data
-
Texas Higher Education Coordinating Board Graduations Data
-
Texas Higher Education Coordinating Board Institutions Data
-
Texas Higher Education Coordinating Board Licensure and Certifications Data
-
Texas Workforce Commission Eligible Training Provider (ETP) Data
-
Texas Workforce Commission Participant Individual Record Layout (PIRL) Data
-
Texas Workforce Commission Unemployment Insurance (UI) Wage Record Data
Class Program
Day 1 - Overview, Project Scoping, and Privacy and Confidentiality
Day 2 - Dataset Introduction
Day 3 - Applications of Dataset Exploration
Day 4 - Record Linkage
Day 5 - Measurement - Definitions, Matching, and Time
Day 6 - Basics and Applications of Data Visualization
Day 7 - Introduction to Machine Learning
Day 8 - Project Status Presentations
Day 9 - Machine Learning - Prediction
Day 10 - Machine Learning - Evaluation
Day 11 - Inference and Imputation
Day 12 - Privacy, Confidentiality, Ethics and Exporting Data