Principles of Data Science
DSC 10, Fall 2024 at UC San Diego
Janine Tiefenbruckshe/her
Lecture(s): (A) MWF 9-9:50AM in Center 109, (B) MWF 10-10:50AM in Center 113, (C) MF 1-1:50PM in Solis 104 (No live lectures at 1PM on Wednesdays)
Because of holidays, the schedule of important dates is a bit different in the last few weeks! Next week, we have Quiz 4 on Monday, Homework 6 due on Tuesday, and Discussion 8 on Wednesday.
Week 0 – Welcome to DSC 10!
- Fri Sep 27
- Keywords: course logistics, syllabus, Little Women demo, Jupyter notebooks, expressions
- Sun Sep 29
SUR Welcome Survey
SYL Syllabus Check
PRE Pretest
Week 1 – Python Basics
- Mon Sep 30
LEC 2 Variables and Data Types
Keywords: variables, assignment, functions, import, methods, int, float, string- Wed Oct 2
- Keywords: mean, median, lists, arrays, array arithmetic, array methods, np.arange
- Thu Oct 3
- Fri Oct 4
- Keywords: read_csv, .get, .assign, .sort_values, .iloc, .loc, .set_index, US states
Week 2 – DataFrames and Visualization
- Mon Oct 7
- Keywords: Booleans, querying, .shape, &, |, .take, .groupby, aggregation, .drop
DISC 2 Arrays and DataFrames
- Wed Oct 9
- Keywords: numerical vs. categorical, scatter plot, line plot, bar chart, exoplanets
QUIZ 1 Quiz 1 covers Lectures 1-4
- Thu Oct 10
LAB 1 Arrays and DataFrames
- Fri Oct 11
LEC 7 Distributions and Histograms
Keywords: distributions, density histograms, binning, total area, overlaid plots- Sun Oct 13
Week 3 – Functions and Control Flow
- Mon Oct 14
- Keywords: functions, arguments, print vs. return, .apply, .reset_index
- Wed Oct 16
LEC 9 Grouping on Multiple Columns, Merging
Keywords: .groupby([col_1, col_2, …]), subgroups, MultiIndex, .merge, number of rows- Thu Oct 17
- Fri Oct 18
LEC 10 Conditional Statements and Iteration
Keywords: in, not, and, or, if, else, elif, for-loops, np.append, accumulator pattern- Sun Oct 20
Week 4 – Probability and Simulation
- Mon Oct 21
- Keywords: event, conditional prob., multiplication and addition rules, independence
- Wed Oct 23
- Keywords: np.random.choice, replacement, np.count_nonzero, coin flipping, Monty Hall
QUIZ 2 Quiz 2 covers Lectures 5-10
- Thu Oct 24
- Fri Oct 25
LEC 13 Distributions and Sampling
Keywords: probability vs. empirical distribution, SRS, .sample, parameter, statistic- Sun Oct 27
HW 3 DataFrames, Control Flow, and Probability
Week 5 – Midterm Exam
- Mon Oct 28
DISC 5 Probability and Simulation
- Wed Oct 30
EXAM Midterm Exam covers Lectures 1-12
- Fri Nov 1
LEC 15 Bootstrapping and Confidence Intervals
Keywords: inference, bootstrapping, resample, np.percentile, confidence interval- Sun Nov 3
PROJ Midterm Project
Week 6 – Confidence Intervals and the Normal Distribution
- Mon Nov 4
LEC 16 Confidence Intervals, Center, and Spread
Keywords: interpreting CIs, robust vs. sensitive, center, standard deviation- Wed Nov 6
LEC 17 Standardization and the Normal Distribution
Keywords: Chebyshev, standard units, normal distribution, CDF, inflection points- Thu Nov 7
- Fri Nov 8
LEC 18 The Central Limit Theorem
Keywords: distribution of the sample mean, square root law, CLT-based CIs- Sun Nov 10
Week 7 – Central Limit Theorem
- Mon Nov 11
No Lecture (Veterans Day 🎖️)
- Wed Nov 13
LEC 19 Choosing Sample Sizes, Statistical Models
Keywords: standard deviation of 0s and 1s, np.random.multinomial, Robert Swain juryQUIZ 3 Quiz 3 covers Lectures 13, 15-18
- Thu Nov 14
- Fri Nov 15
- Keywords: null and alternative hypotheses, test statistic, fair or unfair coin
- Sun Nov 17
Week 8 – Hypothesis and Permutation Testing
- Mon Nov 18
LEC 21 Hypothesis Testing and Total Variation Distance
Keywords: fair or unfair coin, p-value, midterm exam scores, Alameda County jury, TVDDISC 7 Choosing Sample Sizes, Hypothesis Testing, and Permutation Testing
- Wed Nov 20
LEC 22 TVD, Hypothesis Testing, and Permutation Testing
Keywords: confidence intervals for hypothesis testing, body temperature, smoking/babies- Thu Nov 21
LAB 6 Hypothesis Testing
- Fri Nov 22
LEC 23 Permutation Testing
Keywords: smoking/babies, np.random.permutation, shuffling, Deflategate
Week 9 – Prediction
- Mon Nov 25
LEC 24 Correlation
Keywords: association, correlation coefficient (r), predicting heights, regression line (su)QUIZ 4 Quiz 4 covers Lectures 19-23
- Tue Nov 26
- Wed Nov 27
LEC 25 Regression and Least Squares
Keywords: regression line in original units, outliers, errors, RMSE, best fit, least squaresDISC 8 Hypothesis Testing and Permutation Testing
- Fri Nov 29
No Lecture (Thanksgiving 🦃)
Week 10 – Review
- Mon Dec 2
LEC 26 Residuals and Inference
Keywords: residuals, residual plots, patterns, datasaurus dozen, prediction intervalsDISC 9 Regression
- Tue Dec 3
PROJ Final Project
- Wed Dec 4
LEC 27 Review
- Thu Dec 5
LAB 7 Regression
- Fri Dec 6
LEC 28 Review, Conclusion
- Sat Dec 7
EXAM Final Exam (11:30-2:30PM)
SUR SETs and End-of-Quarter Survey (due 8AM)