Skip to content

Placebo tests and extended plotting functionality #88

Open
n-delaney wants to merge 47 commits intoebenmichael:masterfrom
n-delaney:pr_merge
Open

Placebo tests and extended plotting functionality #88
n-delaney wants to merge 47 commits intoebenmichael:masterfrom
n-delaney:pr_merge

Conversation

@n-delaney
Copy link

The proposed changes:

  • Extend the functionality of the augsynth package by including the options for permuted placebo tests (raw and RMPSE-adjusted)
  • Move inference out of summary.augsynth() and into the augsynth() function by default
  • Extend plotting options, including placebo permutation plots and gap plots showing the divergence between the treated units and synthetic control over time

All of the changes to the augsynth package itself take place in augsynth.R and a new file, permutation.R.

The tobacco_replication directory contains a vignette demonstrating the new functionality and replicating the results of Abadie et al. (2010).

Rmd file and data for replication of Abadie et al. (2010)
-updates functions in augsynth.R
-adds permutation.R (also contains some plotting functions)
-updates package documentation
Updates the github directory in tobacco replication vignette
Replaces convention of inf = F with inf_type = 'none'.
- Separates out inference from estimation. Sets augsynth(inf = "none") by default, which should be backwards compatible.
- Adds plot_type as an argument for plot.augsynth (allows user to specify the "name" of the desired outputted plot).
- Adds the option to generate a donor table (synthetic control weights and RMSPEs for any inference type)
- Ensures that all existing tests execute successfully.
Updates to abadie et al 2010 replication notebook
Adds warnings to multi-outcome and multisynth models if a user attempts to pass in `inf_type` at the time of model fitting.
@n-delaney n-delaney closed this Feb 14, 2024
@n-delaney n-delaney reopened this Feb 14, 2024
Copy link
Owner

@ebenmichael ebenmichael left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall I think there needs to be a structural change to keep inference calculations in summary.augsynth. The new add_inference function seems to be necessary only to add inference output to the augsynth function, but that should be restricted to the summary.augsynth function. I've called out parts of the code where this can be changed, but mostly it means that all of the new plotting work should happen in the plot.summary.augsynth function, and then the augsynth object never needs any inference added to it.




test_that( "MDES_table corresponds to default treatment table", {
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this test be broken up into subtests, since it looks like there are a few distinct things.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another good test would be that the placebo gaps agree with manually changing the treatment variable in the data and re-running augsynth.

lmiratrix and others added 14 commits August 28, 2024 18:06
- adds a test for update_augsynth() ensuring that non-binding update criteria (ex. RMPSE multiple of infinity) return the same donor weights as the original model.
- removes superfluous datasets froom the tobacco_replication.Rmd file
- updates a test in test-donor_control.R to resolve an error from a variable that is called but not defined (or used as part of the test, as far as I can tell)
Copy link
Owner

@ebenmichael ebenmichael left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for all of these changes. I left reviews throughout. Once those are done (and the merge conflict is fixed) we can fold this in.


# These should be the same? Or close to the same?
# (They were not under ridge, probably due to the cross-validation procedure.)
gaps7 <- augsynth:::get_placebo_gaps(syn7)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you test this with ridge with a fixed lambda parameter? If they should be same then the test should pass.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@n-delaney @lmiratrix the question marks here make me nervous

@lmiratrix
Copy link

Trying to put in my comments

Copy link
Owner

@ebenmichael ebenmichael left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

An overarching request: Please make sure that the top-level calls to the summary and plot functions don't change their default behavior and that the exisiting vignette still works as it should (it doesn't give the same results as-is). If the vignette needs to be updated (e.g. to incorporate the new plot_type argument, please do that).

Comment on lines +398 to +403
plot.augsynth <- function(augsynth,
cv = FALSE,
plot_type = 'estimate',
inf_type = NULL, ...) {

plot_augsynth_results( augsynth=augsynth, cv=cv, plot_type=plot_type, inf_type=inf_type, ... )
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My understanding is that the cv option no longer does anything, and instead that is subsumed into plot_type correct? Can you update that throughout the code so that it isn't passing in cv if that isn't doing anything?

#' @param ... Optional arguments
plot_augsynth_results <- function( augsynth,
plot_type = 'estimate',
inf_type = NULL, ...) {
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The default behavior of the package is to do conformal inference. Can you make whatever changes are needed so that this continues to be the case? Right now if you follow the vignette it doesn't produce the prediction intervals.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, the plots used to have the inf argument that governed whether uncertainty intervals were included. That should stay in to not break any existing code. It also simplifies the plot_type argument because then there doesn't need to be the 'estimate only' option.

aes(x = !!as.name(augsynth$time_var), y = !!as.name(measure),
color = trt_status, linetype = !!as.name(augsynth$unit_var))) +
ggplot2::geom_line() +
ggplot2::geom_vline(lty = 2, xintercept = treat_year) +
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For some reason the vertical line doesn't appear when you get here by running plot rather than permutation_plot.

e.g. the following creates two slightly different plots

syn <- augsynth(lngdpcapita ~ treated, fips, year_qtr, kansas,
                progfunc = "None", scm = T)
# no vertical line at treatment time
plot(syn, plot_type = "placebo")
# yes vertical line at treatment time
permutation_plot(syn)

Comment on lines +909 to +912
} else if (plot_type == 'outcomes') {
p <- augsynth_outcomes_plot(augsynth, measure = 'synth')
} else if (plot_type == 'outcomes raw average') {
p <- augsynth_outcomes_plot(augsynth, measure = c('synth', 'average'))
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These should have the option to add the uncertainty intervals (i.e. with inf = TRUE following the original logic of the code)

Comment on lines +943 to +945
p <- p + ggplot2::geom_ribbon(ggplot2::aes(ymin = Estimate - 2 * Std.Error,
ymax = Estimate + 2 * Std.Error),
alpha = 0.2)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Something is happening here where if you just run plot(syn) for a augsynth object syn, the default behavior doesn't include uncertainty quantification but this line of geom_ribbon is running with a warning about all missing inputs. This might go away if the default behavior is to plot conformal inference intervals, but still it comes up using plot(syn, inf_type = NULL)


#' Calculate MDES
#'
#' Use our method for Calculating SEs, p-values, and MDEs by looking
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok can you just strip this out here then? Alternatively, you can out in a citation etc. Something more than "our" method, since I'd like for it to be clear what different functions are doing.


del = as.numeric( diff( q ) )
z = -qnorm( alpha )
SE_imp = del / (2*z)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please remove it if it is not being used.

units$weights[ units$ever_Tx == 1 ] = 0

# confirm that we have placebos for every entity over every observed time period
stopifnot("Placebos do not cover every entity over every observed time period." = (ncol(augsynth$data$X) + ncol(augsynth$data$y)) * nrow(augsynth$data$y) == nrow(lest))
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@lmiratrix @n-delaney does this ever happen or were you adding in a hypothetical check?



# Impact is difference between observed and imputed control-side outcome
lest$impact = lest$Yobs - lest$Yhat
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please remove it if it isn't needed


# These should be the same? Or close to the same?
# (They were not under ridge, probably due to the cross-validation procedure.)
gaps7 <- augsynth:::get_placebo_gaps(syn7)
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@n-delaney @lmiratrix the question marks here make me nervous

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants