Homework 2

Author
Affiliation

Alex Cardazzi

Old Dominion University

All materials can be found at alexcardazzi.github.io.

Completion Requirements: Complete the following questions in RStudio via the homework template. When you are ready, submit your rendered html to Canvas.

Grading Criteria: Full credit will be given to correct, well formatted, and detailed answers. Partial credit will be given if I can follow your work and/or see your thought process via code, comments, and text. Point totals are listed next to each question.


Besides New Hampshire and Virginia1, car insurance is mandated in every state. The point of this policy is to protect citizens from the often large financial burden resulting from a car crash. However, forcing individuals to purchase insurance may generate negative outcomes through unintended consequences. The often cited reason for this is moral hazard. In other words, if individuals are shielded from the negative consequences of dangerous behavior, they may be more likely to engage in such behavior. Think of this from the opposite point of view: if you knew your vehicle would blow up if you got into a fender bender, you’d probably drive much more safely (or not at all) than you do now.

In this homework, we are going to explore the question: how does auto insurance affect traffic safety?

There are many ways to think through this problem, but let’s start with the following factors that might be related to auto insurance and traffic safety:

Now, consider the following relationships between the variables:

Question 1

Using dagitty, Construct and plot a DAG to depict the relationships between these variables. Be sure to indicate which variable is the outcome and which is the treatment. (4 Points)

Question 2

Simulated data for this setting can be found here.

  1. Read the data into an object called safety in R. (1 Point)

  2. Create summary statistics using functions from the modelsummary package. (1 Point)

Question 3

Use functions from dagitty to evaluate and explore your DAG. This will help inform your empirical specification and identification strategy. You may use code, words, or both to answer these questions.

  1. List each of the paths in the DAG and tell which are “open”. (1 Point)

  2. For each path, discuss why it is open or closed. Be sure to use terms like “confounder”, “collider”, “mediator”, etc. (2 Points)

  • AI -> CR:
  • AI -> RDB -> CR:
  • AI -> RDB <- A -> CR:
  • AI -> RDB <- TE -> CR:
  1. What does it mean, and why is it significant, for a path to be open/closed? (1 Point)
  1. Which factors do you need to control for to estimate the total causal effect of auto insurance on crash risk? (1 Point)
  1. Which factors do you need to control for to estimate the direct causal effect of auto insurance on crash risk? You may use dagitty to help you, but you must provide explanations for dagitty’s output. (1 Point)

Question 4

Using safety, estimate the (direct and total) causal effects of auto insurance on crash risk.

  1. Estimate the models below. (1 Point)

  2. Display the coefficient estimates in a table generated via functions in modelsummary. (1 Point)

  3. Interpret each coefficient, besides the intercept/constant, in each model. (2 Points)

Question 5

Consider this question using data in real life. What difficulties might you have in estimating these models in reality? (2 Points)

Question 6

Think of two additional variables and/or edges that may belong in this DAG. Make an argument for why they belong in the DAG. (2 Points)

Footnotes

  1. Virginia allows individuals to opt out of insurance, but they then must pay a $500 uninsured fee.↩︎