Homework 1

Author
Affiliation

Alex Cardazzi

Old Dominion University

All materials can be found at alexcardazzi.github.io.

Completion Requirements: Complete the following questions in RStudio via the homework template. When you are ready, submit your rendered html to Canvas.

Grading Criteria: Full credit will be given to correct, well formatted, and detailed answers. Partial credit will be given if I can follow your work and/or see your thought process via code, comments, and text. Point totals are listed next to each question.


Understanding labor market outcomes is crucial for gaining insights into economic dynamics and societal structures. The 1976 Panel Study of Income Dynamics (PSID) serves as a valuable dataset for such an analysis, capturing a comprehensive snapshot of American households at a particular time. These data document labor market outcomes for women with additional information about their partners’ labor market outcomes. Investigating the factors influencing female wages provides a nuanced perspective on gender dynamics, labor market trends, and the distribution of economic opportunities within households.

Analyzing wages between spouses allows us to explore questions related to gender equity, workforce participation, and the impact of educational attainment. Educational levels, for instance, may influence both the individual’s and the spouse’s earning potential, shedding light on the interplay between education and labor market outcomes within a family unit. Additionally, considering the presence of children in the household introduces a complex dynamic, as it may affect labor force participation, career choices, and subsequently, wages.

Regression analysis emerges as a powerful tool for examining these relationships, enabling the quantification of the impact of various factors on wages. By controlling for relevant characteristics, such as education and the labor market status of partners, regression models can provide a clearer understanding of the specific influences shaping labor market outcomes for women. This exploration not only contributes to our comprehension of societal structures but also provides empirical insights essential for crafting informed policies aimed at fostering equality and economic well-being.

In this homework, you will explore PSID data (documentation) and generate empirical insights about the labor market for women in 1976.

Question 1

Use the data linked above to answer the following.

  1. Read the data into an object called psid. (0 Points)

  2. Subset the data.frame to include only the following columns: (0 Points)

  • wage: wife’s hourly wage in 1975 dollars.
  • hours: wife’s hours worked in 1975.
  • education: wife’s education measured in number of years.
  • age: wife’s age.
  • youngkids: number of children under six years old.
  • oldkids: number of children between six and eighteen.
  • hwage: husband’s hourly wage in 1975 dollars.
  • hhours: husband’s hours worked in 1975.
  • heducation: husband’s education measured in number of years.
  • hage: husband’s age.
  • unemp: unemployment rate in county of residence, in percentage points.
  • city: whether the couple lives in a large city.
  1. Convert city into a binary variable equal to one if the couple lived in a large city and zero otherwise. (0 Points)

  2. Create an employed variable equal to 1 if the woman worked in 1975. (0 Points)

  3. Using functions from the modelsummary package, create a summary statistics table. Then, using this table, discuss some empirical facts.1 (2 Points)

Question 2

In this question, you will explore some of the variables in the data.

  1. Use the density() function to plot the wage distribution for husbands of employed and unemployed women in 1975. What do you learn from this figure? (2 Points)
  1. Tabulate the total number of children for the women sampled. What do you learn from this figure? (1 Point)
  1. Calculate the average number of young and old children (separately) by age. Plot the relationship between children and age. What do you learn from this figure? (2 Points)
  1. Generate a scatterplot between a woman’s wages and her husband’s wages. Be sure to include axis labels, etc. Add a diagonal line that denotes y = x. What are some things you can learn from this plot? (1 Point)

Question 3

For this question, you will estimate two regressions. First, you will estimate a model to explain a woman’s wages using characteristics pertaining to her and/or her husband. Second, estimate the same model using only the subset of women with non-zero wages. At the end, use functions from the modelsummary package to present the results.

  1. Estimate a model where a woman’s wages is a function of factors pertaining to her and/or her husband. You may use any variable, or combination of variables, except for the variable that indicates whether a woman is employed. For each variable you choose to include in the model, provide a short rationale for why you believe it belongs, and the effect you anticipate it having on wages. (2 Points)
  1. Re-estimate the model using only the subset of employed women. (1 Point)

  2. Display the coefficients from both estimations in a single table. (1 Point)

  3. Interpret the coefficients in the regressions. (2 Points)

  1. Do any of the coefficients change between regressions? Why or why not? Which regression do you prefer, and why? (1 Point)

Question 4

The exercise in this question will be very similar to the last one, except the dependent variable will be binary.

  1. Estimate a model that explains female employment status. Again, you may choose which variables you would like to use in the model, but you may not use the woman’s wages as an explanatory variable. For each variable you choose to include in the model, provide a short rationale for why you believe it belongs, and the effect you anticipate it having on employment. (1 Point)
  1. Display the coefficients in a table. (1 Point)

  2. Interpret each coefficient in the regression. (2 Points)

Question 5

Discuss yours findings from the regressions above. What does this tell you about women from the mid 1970s? Did anything surprise you? How do you think these regressions would differ if the sample contained data collected from women of today? (1 Point)

Footnotes

  1. For example, what is the average age gap within couples? What is the average wage gap? What percent of sampled females were employed in 1975? What is the average number of children of the women sampled?↩︎