Econometricks
- An art invented by economists: \[\underbrace{y}_{\text{dependent variable}} = a \underbrace{x}_{\text{explanatory variable}} + b\]
- Main challenge:
- given dataset \((x_i, y_i)\)
- find \(a\) while controlling for \(b\)
- understand robustness of results
- predict new values of \(y\) for new values of \(x\)
Econometricks: Example 1
Check out the following website: How happy are you?
What is \(x\) ? What is \(y\) ?
Econometricks: Example 2
\[\underbrace{y}_{\text{dependent variable}} = a \underbrace{x}_{\text{explanatory variable}} + b\]
- A famous study:
- young men who go to war receive in average lower wages when they return than men who didn’t go to war
- … is it because they skipped college?
- … or did they choose to go to war because they were less skilled for college?
- How to know which is right?
Econometricks: Example 2
\[\underbrace{y}_{\text{dependent variable}} = a \underbrace{x}_{\text{explanatory variable}} + b\]
How to know which is right?
- find a way to extract causality
- instrumental variables
This was worth a Nobel Prize! (D. Card, J. Angrist, G.W. Imbens)
Big Data Era and Machine Learning (1)
- Data has become very abundant
- Large amounts of data of all kinds
- structured (tables, …)
- unstructured (text, images, …)
- Machine learning:
- a set of powerful algorithms…
- … so powerful some call it artificial intelligence
- they learn by processing data
- … to extract information and relations in large data sets
Big Data Era and Machine Learning (2)
- Machine learning:
- a set of powerful algorithms…
- … so powerful some call it artificial intelligence
- they learn by processing data
- … to extract information and relations in large data sets
- …
- Comparison with econometrics
- ML has it own, partially redundant, jargon
- harder to study causality, standard deviation (precision)
Machine Learning
\[\underbrace{y}_{\text{predicted variable}} = f( \underbrace{x}_{\text{feature}} , a)\]
- Challenge:
- given dataset \((x_i, y_i)\)
- find \(a\), that is find a nonlinear relationship between \(a\) and \(b\)
- predict new values of \(y\) given new values of \(x\)
- What is the difference with econometrics?
Big Data Era and Machine Learning (1)

Sentiment analysis: predict population’s optimism by analyzing tweets.
Check sentiment viz
Big Data Era and Machine Learning (2)
![]()
Beautiful people (from NVIDIA presentation)
Big Data Era and Machine Learning (2)
![]()
Beautiful people (from NVIDIA presentation)
Task: predict second and third columns from the first one.
Solution: deep learning with artificial neural nets