What and Why of Statistics?
Science of collecting, describing, analyzing and making decisions based on data
- Sampling
 
- Exploratory Data Analysis
 
- Inference
 
 
Allows us to make informed decisions in the face of uncertainty and let’s us take an unbiased and evidence-based viewpoint
Data: Cases and Variables
Data are a set of measurements taken on a set of individual units
 
Data is stored and presented in a dataset that comprises of variables measured on cases
- A variable is any characteristic that is recorded for each case
 
 
EducationLiteracy dataset from Lock5
Each row = case & Each column = variable
Categorical Versus Quantitative
Variables are classified as either categorical or quantitative:
- A categorical variable divides the cases into groups. e.g. gender, country, state etc.
 
- A quantitative variable measures a numerical quantity for each case, e.g. age, height, sleep hours, blood pressure etc
 
 
Kidney cancer
If the cases in the kidney cancer dataset are people, then the measured variable is categorical
- We categorize each person as either having kidney cancer or not which is categorical.
 
Kidney cancer
If the cases in the kidney cancer dataset are counties, then the measured variable is quantitative
- Data collected at the county level is aggregated across all people living in the county. We then get rates of cancer which are numbers (quantitative).
 
Variable manipulations
Can use numbers to code categories of categorical variable
- e.g Gender (1 for male and 2 for female)
 
 
 
Can convert quantitative variable into categorical groups
- e.g. Income (0-50000 as Low, 50000+ as High)
 
 
Explanatory and Response Variable
When one variable helps us understand or predict values of another variable, we call the former the explanatory variable and the latter the response variable
Does meditation help reduce stress?
- explanatory variable: meditation
 
- response variable: stress level
 
 
 
Does sugar consumption increase hyperactivity?
- explanatory variable: sugar consumption
 
- response variable: hyperactive behavior