# Homework 4

Due by

**10:00am**on Monday, April 10, 2023**Initial due date, 4/3****Revision date, 4/5**

# Instructions

In this assignment, you’ll again work with the Medicare Advantage data. These data are described in detail in the Medicare Advantage GitHub Repo. We worked with a subset of these data back in assignment 1; however, this assignment requires that you work with a more complete version of the Medicare Advantage data, but for fewer years. Once you have the data downloaded and the code running, answer the following questions:

# Summarize the data

- Remove all SNPs, 800-series plans, and prescription drug only plans (i.e., plans that do not offer Part C benefits). Provide a box and whisker plot showing the distribution of plan counts by county over time. Do you think that the number of plans is sufficient, too few, or too many?
- Provide bar graphs showing the distribution of star ratings in 2009, 2012, and 2015. How has this distribution changed over time?
- Plot the average benchmark payment over time from 2009 through 2015. How much has the average benchmark payment risen over the years?
- Plot the average share of Medicare Advantage (relative to all Medicare eligibles) over time from 2009 through 2015. Has Medicare Advantage increased or decreased in popularity? How does this share correlate with benchmark payments?

# Estimate ATEs

For the rest of the assignment, we’ll use a regression discontinuity design to estimate the average treatment effect from receiving a marginally higher rating. We’ll focus only on 2009, as this is the year in which the star rating running variable is easiest to replicate.

- Calculate the running variable underlying the star rating. Provide a table showing the number of plans that are rounded up into a 3-star, 3.5-star, 4-star, 4.5-star, and 5-star rating.
- Using the RD estimator with a bandwidth of 0.125, provide an estimate of the effect of receiving a 3-star versus a 2.5 star rating on enrollments. Repeat the exercise to estimate the effects at 3.5 stars, 4 stars, and 4.5 stars. Summarize your results in a table.
- Repeat your results for bandwidhts of 0.1, 0.12, 0.13, 0.14, and 0.15. Show all of the results in a graph. How sensitive are your findings to the choice of bandwidth?
- Examine (graphically) whether contracts appear to manipulate the running variable. In other words, look at the distribution of the running variable before and after the relevent threshold values. What do you find?
- Similar to question 4, examine whether plans just above the threshold values have different characteristics than contracts just below the threshold values. Use HMO and Part D status as your plan characteristics.
- Summarize your findings from 1-5. What is the effect of increasing a star rating on enrollments? Briefly explain your results.