DS2002 Data Science Systems

Neal Magee, Ph.D.
Associate Professor, Data Science
University of Virginia, Spring 2025

Course Schedule

Week # Topics(s)
01 Course introduction: Data Engineering Basics
02 Environments: The Linux command line and filesystem
03 Environments: Git & GitHub
04 Environments: Scripting & pipelines
Quiz 1: Environments
05 Databases: Data Management
06 Databases: SQL & Relational
07 Databases: NoSQL & Unstructured Data
Quiz 2: Data / Databases
08 Infrastructure: Storage
3/10/25 - 3/14/25 No Class - Spring Break
09 Infrastructure: Manage your compute resources
10 Infrastructure: Cloud Resources
11 Infrastructure: Containers
Quiz 3: Infrastructure
12 Data Pipelines: Scaling & Big Data
13 Data Pipelines: Jobs
14 Data Pipelines: Automation
15 Course Wrap-Up
Quiz 4: Data Pipelines