March 17, 2026
The goal of the project is to write an original research output. This means that you will need to:
.tex.The introduction should include the following elements:
A motivation (2/3 paragraphs) State why the question you tackle is important, why is it a subject that is relevant for the policy leaders.
A research question (1 paragraph) Precisely state what is the research question.
A brief summary of the paper (3 paragraphs) Precise: the data you use, the context, the method you used, the key results.
A literature review (2 paragraphs) Cite some papers that are close to what you do and state how you compare to them in terms of results and methods.
Tip
A reader who wants to know the main message of your paper, the context, and the punchline results needs to read only the introduction.
Data description: Data sources (with the appropriate references), the time span, the geographical level of analysis, the countries/contexts. Say some words about the context you are working on (developing/developed countries, political contexts, etc.)
Summary statistics: Make a table with the summary statistics (min,Q1,mean,median,Q3,max,nb.observations) of the key variables in your analysis
Descriptive evidence: You should make non-causal graphs (scatter or line) and/or maps to support your intuition
Model specification: State the econometric model you want to estimate. Describe the threats to identification (simultaneity, OVB, or measurement error…)
Strategy: Write your method to address them (IV, DiD, etc.) and the assumptions needed for it to be valid, the specification.
OLS results: Export the results with coefficient tables with the relevant information (standard errors, N, R2, proper labelling, etc.). Interpret the results.
Extensions results: If you think you can/need to implement an IV or another strategy, you should present and interpret the results.
Tip
Check the papers that we saw in class to know how to format the figures and tables. The presentation is standard and you should mimic it.
The conclusion is short and quickly summarizes the results.
You will need to make slides (7 to 8’ presentation) to present your key results. The structure should be the following
You will need to provide a replication package in a zip file containing:
Tip
For the code, you should use several R files to split the tasks. For instance, you can have three scripts: 1_clean_data.R, 2_descriptive_evidence.R and 3_reduced_form.R.
Mostly French examples, but you can find similar databases for other countries:
These platforms combine many datasets.