Techniques of Statistical
Analysis I
Lect_1: Breaking-ice session
Bruno Arpino
Practical information
Course goals
Course content
Outline
2
Course content
Instructor:
Bruno Arpino
E-mail: [email protected]
https://sites.google.com/site/brunoarpino
Office hours:
Some practical information
3
Office hours:
Monday 18:00 -19:00 or by appointment
Building Jaume I - room 20.182
Communications and materials will be given through the
“Aula Global” (provided I will be able to use it!)
Suggestions
before the lecture…
take an “Happy Pill”
4
Provide the basic statistical knowledge for
quantitative data analysis
in social science
research
Make students able to structure and conduct
Course goals
5
Make students able to structure and conduct
autonomously an empirical research project with the
use of the statistical software
Stata
Support the compulsory “Final Research Project” due
at the end of the master’s degree and provide a
Design - Planning/Implementing a study
Sample survey or experiment?How to choose people (subjects) for the study, and how many?
Description – Graphical and numerical methods for
summarizing the data
Statistics provides methods for
6
summarizing the data
Inference
– Methods for making statements about a
population (total set of subjects of interest), based on a
sample (subset of the sample on which study collects
data)
It’s not a statistical course stricto sensu
It’s a course about statistical methods
applied
to
social science
Focus will be on
What this course is and is not about...
7
Focus will be on
INTERPETATION of results (also from software
output)
Andy Field: …You need stats to answer questions. Scientists are
curious people, and you probably are too. … to answer interesting questions, you need two things: data and an explanation of those data…
http://www.well.ox.ac.uk/~kanishka/Lectures/OUCS/Notes/Chapter%201%20-Why is my evil lecturer forcing me to learn
statistics?
8
http://www.well.ox.ac.uk/~kanishka/Lectures/OUCS/Notes/Chapter%201%20-%20Discovering%20Statistics%20using%20SPSS%20by%20Andy%20Field.pdf
To measure the real world
To assess relationship between variables
To find empirical evidence to support theoretical hypothesis To make informed decisions
The sexy job of the next 10 years (Hal Varian, chief economist at
The research process
Statistics
9
From: Any Field, DISCOVERING STATISTICS USING SPSS, Ch. 1. Available at:
http://www.well.ox.ac.uk/~kanishka/Lectures/OUCS/Notes/Chapter%201%20-%20Discovering%20Statistics%20using%20SPSS%20by%20Andy%20Field.pdf
United Nations Population Information Network
http://www.un.org/popin/data.html
Some data from the web
10
What is the percentage of people that consider
themselves Democrats?
Is the average income of immigrants lower than that of
natives?
Examples of questions you will be
able to answer
11
natives?
How (if at all) is happiness associated with income, job
satisfaction, family situation, social life, religious beliefs,
political ideology?
Predict how the weather will be tomorrow
(“Prediction is very difficult, especially about the
future.” Niels Bohr)
Unfortunately you will not be able to ...
12
Predict who will win the Champions League
Basic knowledge of descriptive statistics (mean,
variance, graphs…)
Maths required: 4 basic arithmetic operations, square
root, logarithms
Prerequisites
13
1. Principles of
Statistical Inference
. Estimators
and their properties. Confidence interval and test of
hypothesis for a mean and for a proportion.
Comparing means and proportions. ANalysis Of
Variance (ANOVA).
Course content
14
Variance (ANOVA).
or … How to use information on a sample to make
(probabilistic) statements about a population
2.
Linear regression
model in social science
research. Ordinary Least Square (OLS)
estimator. Assumptions. Interpretation of the
coefficients. Model building. Goodness of fit.
Transformation of variables. Categorical
Course content (cont’d)
15
Transformation of variables. Categorical
explanatory variables. Interactions.
3. When the OLS is not appropriate: some examples.
Introduction to longitudinal and multilevel modelling
or… “If anything can go wrong, it will” (Murphy's law)
Course content (cont’d)
16
or… “If anything can go wrong, it will” (Murphy's law)
Slides
and other materials provided during the course trough the “Aula Global” (For the theory) Agresti, A. and Finlay, B. (2008).Statistical
Methods for the Social Sciences
, 4th edition, Allyn &Compulsory readings
17
Bacon.
(Whatever statistical text written for social scientists can substitute or complement the book by Agresti and Finlay. You can find some examples in the general bibliography)
(For Stata) Web (free) book: Chen, X., Ender, P., Mitchell, M. andWells, C. (2003).
Regression with Stata
, UCLA Academic Technology Services, downloadable fromData Analysis
Using Stata
, Stata Press (and one in spanish if you prefer) Modesto Escobar Mercado, EnriqueFernández Macías, and Fabrizio Bernardi (2009),
Cuadernos
Additional readings
18
Fernández Macías, and Fabrizio Bernardi (2009),
Cuadernos
Metodológicos: Análisis de datos con Stata
.
Moore, D.S and G. McCabe (2006), Introduction to the
Practice of Statistics, 5th edition. New York: W.H.Freeman
Lewis-Beck, M.S. Data Analysis: an introduction. SAGE
Series on Quantitative Applications in the Social Sciences
n.103. University Press
General bibliography
19
n.103. University Press
Dougherty, C. (2007), Introduction to Econometrics, third
edition. New York: Oxford University Press.
Three
homework
assignments.
They will be a mixture
of conceptual and applied activities. Assignments will be
individual.
(20%)
An applied
research project
.
Students are required to
Assessment
20
An applied
research project
.
Students are required to
design an applied research project. At the beginning of the
course, working groups (2-3 persons) will be established.
Details on the research project will be provided in a separate
handout.
(30%)
At the end of the course, students will demonstrate their
achievements in a two-hour
final exam
(questions and
First, calculate a simple average of the 3 assignments grades.
If, for example, the grades are 5, 6, 10 then the average is
(5+6+10) / 3 = 7
Then we need to calculate a weighed average of the three
overall grade components. If, for example, the project grade is
8 and the overall exam grade is 6, then:
Overall grade calculation: an example
21
8 and the overall exam grade is 6, then:
Overall grade = 7*0.2 + 8*0.3 + 6*0.5 = 6.8
Only the overall grade will be rounded (to the closest integer)
To pass, the overall grade must be not smaller than 5/10 AND
8 Lab sessions
Assignments areto be delivered the scheduled day
in class or sent by email
Schedule
1 22 Sep Thu 15-17 20.137 Lect2
28 Sep Wed 15-17 40.245 LAB
29 Sep Thu 15-17 20.137 Lect
3
5 Oct Wed 15-17 40.245 LAB
6 Oct Thu 15-17 20.137 Lect
4
12 Oct Wed Holiday
13 Oct Thu 15-17 20.137 Lect Assignment 1
5
19 Oct Wed 15-17 40.245 LAB
20 Oct Thu 15-17 20.137 Lect
22
in class or sent by email (by midnight of the
same day)
The project has tobe sent by email by
the midnight of the day BEFORE the exam AND a paper copy delivered the day of the exam
5 20 Oct Thu 15-17 20.137 Lect
6
26 Oct Wed 15-17 40.245 LAB
27 Oct Thu 15-17 20.137 Lect
7
2 Nov Wed 15-17 40.245 LAB Assignment 2
3 Nov Thu 15-17 20.137 Lect
8
9 Nov Wed 15-17 40.245 LAB
10 Nov Thu 15-17 20.137 Lect
9
16 Nov Wed 15-17 40.245 LAB
17 Nov Thu 15-17 20.137 Lect
10
23 Nov Wed 15-17 40.245 LAB Assignment 3
24 Nov Thu 15-17 20.137 Lect
week 19-23 December.
Date to be confirmed!
Schedule (cont’d)
Date Time Room
19 Dec Mon t.b.c. t.b.c.
20 Dec Tue t.b.c. t.b.c.
21 Dec Wed t.b.c. t.b.c.
22 Dec Thu t.b.c. t.b.c.
23 Dec Fri t.b.c. t.b.c.
Students must attend at least 80% of the scheduled
classes
80% of 20 = 16 classess
Attendance
The joy of Stats
http://www.gapminder.org/videos/the-joy-of-stats/
Do you think statistics is boring?
25
Population growth explained with IKEA boxes
… to you and me!
Good luck!!!
If something is not clear
(or you find mistakes in the slides)
27