Data Science can be defined as the intersection between Mathematics/Statistics, Computer Science, and Communication. In this course, you will learn the entire process of Data Science: Import, Clean, Explore, Model, and Communicate. This area of study is applicable to any student regardless of major, providing a competitive edge in the modern world. There will be one extensive group project giving all students the opportunity to apply their critical thinking and creativity skills to a dataset of their choice.
Instructor: Mario Giacomazzo
Lab Instructors:
Course Syllabus:
Lab Sections:
Office Hours:
Attendance: UNC Check-in App
University Approved Absences: Online Form
Textbook: R For Data Science (R4DS)
Date | Lecture | Slides | Supplement |
---|---|---|---|
MAY 14 | Introduction | Slides | |
Data Visualization | Slides | Preview(.zip) | |
Workflow in RMarkdown | Slides | ||
MAY 15 | Data Transformation I | Slides | |
Data Transformation II | Slides | ||
Data Transformation III | Slides | ||
Data Transformation IV | Slides | ||
MAY 19 | Exploratory Data Analysis I | Slides | |
Exploratory Data Analysis II | Slides | ||
Final Project I | Slides | ||
MAY 20 | Data Import | Slides | |
Tidy Data I | Slides | ||
Tidy Data II | Slides | ||
MAY 21 | Web Scraping | Slides | Preview(.zip) |
MAY 22 | Joins I | Slides | |
Joins II | Slides | ||
MAY 26 | Memorial Day (No Class) | Remember Me | |
MAY 27 | Factors | Slides | |
Programming I | Slides | Preview(.zip) | |
MAY 28 | Programming II | Slides | Preview(.zip) |
Programming III | Slides | ||
MAY 29 | Final Project II | Slides | |
Midterm 1 | |||
JUNE 2 | Modeling 1 | Slides | |
Modeling 2 | Slides | Preview(.zip) | |
JUNE 3 | Modeling 3 | Slides | Preview(.zip) |
Modeling 4 | Slides | Preview(.zip) | |
JUNE 4 | Modeling 5 | Slides | |
Modeling 6 | Slides | ||
JUNE 5 | Modeling 7 | Slides | Preview(.zip) |
Modeling 8 | Slides | Preview(.zip) | |
JUNE 9 | Modeling 9 | Slides | Preview(.zip) |
JUNE 10 | R Shiny | Slides | Preview(.zip) |
JUNE 12 | Midterm 2 | ||
JUNE 17 | Work on Project | ||
All HW, Lab, and Analysis assignments are to be submitted via Canvas. Unzip folder and complete your homework using Rmd file. Midterms will be taken on paper in class. The table below shows all the assignments sorted by the assigned date.
Assigned | Lab (L) | Homework (HW) | Analysis (A) | Due Date (Time) |
---|---|---|---|---|
MAY 14 | HW1(.zip) | MAY 16 (5:00 PM) | ||
MAY 16 | L1 | MAY 16 (11:59 PM) | ||
MAY 16 | HW2(.zip) | MAY 19 (5:00 PM) | ||
MAY 19 | A1(.zip) | MAY 21 (5:00 PM) | ||
MAY 21 | HW3(.zip) | MAY 23 (5:00 PM) | ||
MAY 23 | L2 | MAY 23 (11:55 PM) | ||
MAY 23 | HW4(.zip) | MAY 28 (5:00 PM) | ||
MAY 28 | A2(.zip) | MAY 30 (5:00 PM) | ||
MAY 30 | L3 | MAY 30 (11:59 PM) | ||
MAY 30 | HW5(.zip) | JUNE 2 (5:00 PM) | ||
JUNE 2 | A3(.zip) | JUNE 4 (5:00 PM) | ||
JUNE 4 | HW6(.zip) | JUNE 6 (5:00 PM) | ||
JUNE 6 | L4 | JUNE 6 (11:59 PM) | ||
JUNE 6 | HW7(.zip) | JUNE 9 (5:00 PM) | ||
JUNE 9 | A4(.zip) | JUNE 13 (5:00 PM) | ||
JUNE 13 | L5 | JUNE 13 (11:59 PM) | ||
For the final project, students in STOR 320 will be divided into research groups of size 5 or 6. To ensure fairness, students will be assigned randomly. Also, I will try to ensure that all students in your group are in your lab section.
If you want to find your research group, see the table below:
Although everyone is responsible for the entire project, each member of the group will be assigned a specific role for accountability and consistency. These four specific roles are described as follows:
The Creator: Schedule and Meet with Dr. Mario to Propose Your Group’s Research Idea, Lead Designer in Slides
The Interpreter(s): Schedule and Meet with Dr. Mario to Share Findings from Exploratory Analysis, Evaluate Practice Presentation
The Orator(s): Give a Captivating 3-5 Minute Slideshow Presentation During Final Exam Day
The Deliverer: Deliver Assignments to Canvas, Polished and On-time
This final project will be divided into four parts worth a total of 100 points. Each part will have a clear rubric as non-subjective as possible. The parts along with total point values are found below:
Part | Description | Method of Submission | Involvement Survey | Due Date (Time) |
---|---|---|---|---|
P1 | Project Proposal | Meeting + Canvas | Survey 1 | MAY 22 (11:59PM) |
P2 | Exploratory Data Analysis | Meeting + Canvas | Survey 2 | JUNE 5 (11:59PM) |
P3 | Final Written Paper | Canvas | Survey 3 | JUNE 17 (11:59PM) |
P4 | Final Presentation (Section 2) | Canvas + Class | Survey 4 | JUNE 20 (3:00PM) |
R for Data Science (2E) (R4DS2)
R Programming: Zero to Pro (RPZP)
Hands-On Programming with R (HOPR)
ModernDive (MD)
This page was last updated on 2025-05-13 15:21:42.897048 Eastern Time.