Principles of Data Science
DSC 10, Summer 2026 at UC San Diego
Welcome to DSC 10! Make sure to read this website thoroughly and complete the items in the Getting Started checklist.
Our first lecture will be held on Monday, June 29th at 11:00 AM in HDSI 355 and over Zoom.
Discussion will meet immediately after lecture over Zoom.
Week 1 – Python Basics and DataFrames
- Mon Jun 29
- Keywords: data science, course structure, policies, syllabus, Little Women demo
-
LEC 2 Expressions and Data Types
Keywords: Jupyter notebooks, expressions, variables, assignment, functions, int, float -
LEC 3 Strings, Lists, and Arrays
Keywords: string methods, mean, median, lists, arrays, array arithmetic -
DISC 1 Getting Started with Jupyter Notebooks
- Tue Jun 30
-
LAB 0 Expressions and Data Types
-
SUR Welcome Survey
-
SYL Syllabus Check
- Wed Jul 1
-
LEC 4 Arrays and DataFrames
Keywords: array methods, np.arange, .read_csv, .get, .assign, .sort_values, .iloc, .loc, index -
LEC 5 Querying and Grouping
Keywords: .set_index, Booleans, querying, .shape, &, |, .take, .groupby, aggregation -
LEC 6 Grouping and Data Visualization
Keywords: .groupby, numerical vs. categorical, scatter plot, line plot, bar chart -
DISC 2 Arrays and DataFrames
- Thu Jul 2
-
LAB 1 Arrays and DataFrames
-
PRE Pretest
- Fri Jul 3
-
HW 1 Basic Python, Arrays, and DataFrames
Week 2 – Functions, Control Flow, and Probability
- Mon Jul 6
-
LEC 7 Distributions and Histograms
Keywords: distributions, density histograms, binning, total area, overlaid plots -
LEC 8 Functions and Applying
Keywords: functions, arguments, print vs. return, .apply, .reset_index -
LEC 9 Grouping on Multiple Columns, Merging
Keywords: .groupby([col_1, col_2, …]), subgroups, MultiIndex, .merge, number of rows -
DISC 3 Querying, Grouping, and Plotting
-
QUIZ 1 Quiz 1 covers Lectures 1-6
- Tue Jul 7
-
LAB 2 Data Visualizations and Functions
- Wed Jul 8
-
LEC 10 Conditional Statements and Iteration
Keywords: in, not, and, or, if, else, elif, for-loops, np.append, accumulator pattern -
LEC 11 Probability
Keywords: event, conditional prob., multiplication and addition rules, independence -
LEC 12 Simulation
Keywords: np.random.choice, replacement, np.count_nonzero, coin flipping, Monty Hall -
DISC 4 Functions, DataFrames, and Control Flow
-
HW 2 DataFrames, Data Visualization, and Functions
- Thu Jul 9
-
LAB 3 DataFrames, Control Flow, and Probability
- Fri Jul 10
-
HW 3 DataFrames, Control Flow, and Probability
Week 3 – Sampling and Midterm
- Mon Jul 13
-
LEC 13 Distributions and Sampling
Keywords: probability vs. empirical distribution, SRS, .sample, parameter, statistic -
LEC 14 Bootstrapping and Confidence Intervals
Keywords: inference, bootstrapping, resample, np.percentile, confidence interval -
REV Review
-
DISC 5 Probability and Simulation
- Wed Jul 15
-
EXAM Midterm Exam covers Lectures 1-12
- Fri Jul 17
-
LAB 4 Simulation, Sampling, & Bootstrapping
Week 4 – Normal Distribution and Hypothesis Testing
- Mon Jul 20
-
LEC 15 Confidence Intervals, Center, and Spread
Keywords: interpreting CIs, robust vs. sensitive, center, standard deviation -
LEC 16 Standardization and the Normal Distribution
Keywords: Chebyshev, standard units, normal distribution, CDF, inflection points -
LEC 17 The Central Limit Theorem
Keywords: distribution of the sample mean, square root law, CLT-based CIs -
DISC 6 Sampling, Bootstrapping, and Confidence Intervals
-
PROJ Midterm Project
- Tue Jul 21
-
HW 4 Simulation, Sampling, & Bootstrapping
- Wed Jul 22
-
LEC 18 Choosing Sample Sizes, Statistical Models
Keywords: standard deviation of 0s and 1s, np.random.multinomial, Robert Swain jury -
LEC 19 Hypothesis Testing
Keywords: null and alternative hypotheses, test statistic, fair or unfair coin -
LEC 20 Hypothesis Testing and Total Variation Distance
Keywords: fair or unfair coin, p-value, midterm exam scores, Alameda County jury, TVD -
DISC 7 The Normal Distribution and the CLT
- Thu Jul 23
-
LAB 5 Variability and the Normal Distribution
- Fri Jul 24
-
HW 5 The Normal Distribution and the Central Limit Theorem
Week 5 – Prediction and Final Exam
- Mon Jul 27
-
LEC 21 TVD, Hypothesis Testing, and Permutation Testing
Keywords: confidence intervals for hypothesis testing, body temperature, smoking/babies -
LEC 22 Permutation Testing
Keywords: smoking/babies, np.random.permutation, shuffling, Deflategate -
LEC 23 Correlation
Keywords: association, correlation coefficient (r), predicting heights, regression line (su) -
DISC 8 Hypothesis Testing and Permutation Testing
- Tue Jul 28
-
LAB 6 Hypothesis Testing
- Wed Jul 29
-
LEC 24 Regression and Least Squares
Keywords: regression line in original units, outliers, errors, RMSE, best fit, least squares -
LEC 25 Residuals and Inference
Keywords: residuals, residual plots, patterns, datasaurus dozen, prediction intervals -
REV Review
-
DISC 9 Regression
-
HW 6 Hypothesis Testing and Permutation Testing
- Thu Jul 30
-
LAB 7 Regression
- Fri Jul 31
-
EXAM Final Exam