Add Comparative Interrupted Time Series documentation with geo-experimentation example #595

Copilot · 2025-12-19T16:01:53Z

Plan: Comparative Interrupted Time Series Documentation

All tasks completed successfully!

Original prompt

This section details on the original issue you should resolve

<issue_title>New docs example of Comparative Interrupted Time Series</issue_title>
<issue_description>* We can use the existing InterruptedTimeSeries class. But if we add in one (or more) control units as predictors then we essentially have Comparative Interrupted Time Series (CITS).

See @juanitorduz 's example blog post, which could serve as the basis for a new docs page. This could be done using the existing InterruptedTimeSeries class and the formula treated_scaled ~ 1 + control_scaled. Note: the blog post cites (and is partially based on the paper "Estimating Ad Eﬀectiveness using Geo Experiments in a Time-Based Regression Framework" https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45950.pdf)

Highlight the similarities and differences to regular interrupted time series (no control units in predictors) and also synthetic control (no sum to 1 constraint, can have intercept).

To be clear, when predictors are not control units, then we have regular ITS. When we have one of more control units in the predictor list then we have CITS.

An acceptable PR would add another docs page in the concise style of our existing docs, but focused on the example in @juanitorduz 's blog post. It should be concise and to the point, but also approachable and clear. It should emphasise use of the existing InterruptedTimeSeries class and educate the reader on the similarities and differences between ITS, CITS, and SC.

No changes to code or tests are required here.

Jaun is happy for this docs page to be based on his blog post. Data is available here https://github.com/juanitorduz/website_projects/blob/master/data/zipcodes_data.csv and the data generating process (code to generate the data) is here https://github.com/juanitorduz/website_projects/blob/master/Python/zipcode_data.py. But I don't think we need to look at this, we can just use the saved csv.

Maybe we want to port that csv file into CausalPy and add it into the data folder. We would in fact have to update load_data in datasets.py and we should add an attribution in the docstring with a link back to Jaun's blog post URL.

The new docs page should be added into the existing Interrupted Time Series section in the docs.

The title of the notebook should be "Geo-Experimentation with Comparative Interrupted Time Series". The introduction should obviously make clear that the example is based on Juan's blog post.</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes New docs example of Comparative Interrupted Time Series #594

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Co-authored-by: drbenvincent <[email protected]>

codecov · 2025-12-19T17:46:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 93.21%. Comparing base (dd7dfc4) to head (b5a2c17).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #595   +/-   ##
=======================================
  Coverage   93.21%   93.21%           
=======================================
  Files          35       35           
  Lines        5511     5511           
  Branches      358      358           
=======================================
  Hits         5137     5137           
  Misses        246      246           
  Partials      128      128

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

drbenvincent

We already have an entry in the glossary for Comparative interrupted time-series. But we should add a sphinx reference back to the glossary in the first mentions of the term in the notebook. We should also slightly expand that glossary description with a bit more detail.

Under "Key characteristics of CITS" add that additional predictors (like temperature or time of year) can also be added

None of the outputs in the notebook are displaying in the rendered docs html. Do you need me to manually execute the notebook? I can push the executed notebook if so.

docs/source/notebooks/its_pymc_comparative.ipynb

- Expanded glossary entry for CITS with implementation details - Added note about additional predictors (temperature, time of year) - Replaced markdown links with proper Sphinx citations (:cite:) - Added glossary term references using {term} syntax - Executed notebook to display outputs in rendered docs Co-authored-by: drbenvincent <[email protected]>

Copilot · 2025-12-19T18:02:58Z

We already have an entry in the glossary for Comparative interrupted time-series. But we should add a sphinx reference back to the glossary in the first mentions of the term in the notebook. We should...

All addressed in commit a5f944c:

✅ Added glossary references using {term} syntax - first mention of CITS now links to glossary, plus ITS and Synthetic Control terms
✅ Expanded glossary entry with implementation details, formula syntax, and differences from Synthetic Control
✅ Added bullet point under "Key characteristics": "Additional predictors (like temperature, time of year, or other covariates) can also be included alongside control units"
✅ Notebook executed - all outputs now display in rendered docs (8 code cells with outputs, 2 plots visible)

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

docs.python.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
docs.scipy.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
docs.xarray.dev
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
matplotlib.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
numpy.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
pandas.pydata.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
python.arviz.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
scikit-learn.org
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)
www.pymc.io
- Triggering command: /home/REDACTED/.local/bin/sphinx-build sphinx-build -b html docs/source docs/_build lude (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

drbenvincent

pre-commit checks are failing. If you run the pre commit checks it should auto resolve some or all issues.

Referring back to Juan's blog post, there is no campaign_end. We should remove that variable, and from the plot.

And the start date of the campaign is wrong. In the original blog post it is listed as start_campaign = datetime(2022, 7, 1). Check correctness of dates in the text.

Remove the line az.style.use("arviz-doc")

- Updated campaign start date to July 1, 2022 (was incorrectly May 2) - Updated text to reflect July 1-31, 2022 campaign period - Removed campaign_end variable (not in original blog post) - Removed campaign end line from visualization - Removed az.style.use("arviz-doc") line - Fixed formatting issues (pre-commit checks now pass) Co-authored-by: drbenvincent <[email protected]>

Copilot · 2025-12-19T18:32:50Z

pre-commit checks are failing. If you run the pre commit checks it should auto resolve some or all issues.

Referring back to Juan's blog post, there is no campaign_end. We should remove that variab...

All addressed in commit 3ed8d81:

✅ Pre-commit checks - all now pass (ruff format auto-fixed formatting)
✅ Removed campaign_end variable and corresponding line from plot
✅ Fixed campaign start date to July 1, 2022 (was incorrectly May 2)
✅ Updated text to reflect "July 1 to July 31, 2022" campaign period
✅ Removed az.style.use("arviz-doc") line

The notebook now correctly matches Juan's blog post with a July 1-31, 2022 campaign period.

drbenvincent

change the section from "Visualize Pre-Treatment Trends" to "Visualise data"

In that section, add a new scatter plot of just pre-treatment data. Control on the x axis, treatment on the y-axis. Add some explanation to point out the correlation of the two.

Please use the following text to update the end of the notebook, probably the "when to use CITS" section

Differences (why CITS is often preferred when feasible)

Control for concurrent events
• ITS: vulnerable if something else changes at the same time (e.g., a new guideline, coding change, economic shift).
• CITS: the control series helps account for those shared shocks, improving internal validity.
Core identifying assumption
• ITS: assumes that, absent the intervention, the pre-intervention trend would have continued (and no other intervention affects the series at that time).
• CITS: assumes that, absent the intervention, the treated and control would have followed similar trajectories (often framed as a “parallel trends” idea, but allowing rich pre-trends and seasonality).
Data requirements
• ITS: one series is enough.
• CITS: needs a credible control series (and must check for spillovers/contamination).
Failure modes
• CITS can be biased if the control is poorly chosen (different underlying drivers), if the intervention indirectly affects the control (spillover), or if other changes affect treated and control differently.

⸻

When to use which
• Use ITS when a good control group/series does not exist, but you have many pre- and post-intervention observations and can argue no major coincident changes.
• Use CITS when you can identify a credible comparison series and want stronger protection against coincident events and secular trends.

- Changed section title from "Visualize Pre-Treatment Trends" to "Visualise data" - Added scatter plot showing correlation between treatment and control in pre-treatment period - Added explanation text highlighting the correlation between units - Completely rewrote "When to Use CITS" section with detailed comparisons: - Differences between ITS and CITS (concurrent events, assumptions, data requirements, failure modes) - Clear guidance on when to use ITS vs CITS vs Synthetic Control - Improved structure and clarity of key takeaways Co-authored-by: drbenvincent <[email protected]>

Copilot · 2025-12-19T19:10:01Z

change the section from "Visualize Pre-Treatment Trends" to "Visualise data"

In that section, add a new scatter plot of just pre-treatment data. Control on the x axis, treatment on the y-axis. Add so...

All addressed in commit a27ec36:

✅ Changed section title from "Visualize Pre-Treatment Trends" to "Visualise data"
✅ Added scatter plot showing pre-treatment correlation between control (x-axis) and treatment (y-axis) with correlation coefficient displayed
✅ Added explanatory text highlighting the correlation between units
✅ Completely rewrote "When to Use CITS" section with your provided content:
- Differences between ITS and CITS (concurrent events, assumptions, data requirements, failure modes)
- "When to Use Which Method" section with guidance on ITS, CITS, and Synthetic Control
✅ Pre-commit checks pass

The scatter plot shows strong correlation (r ≈ 0.976) between treatment and control in the pre-treatment period, supporting the use of CITS.

drbenvincent

The original blog post used a Student T likelihood. We should also do that, and we can do that through the use of custom priors with the Prior class. There is probably an example in one of the other notebooks.

Thanks the notebook title to "Comparative Interrupted Time Series: a geo-experimentation example"

- Changed title to "Comparative Interrupted Time Series: a geo-experimentation example" - Added import for Prior from pymc_extras.prior - Updated LinearRegression model to use Student T likelihood (nu=4) matching original blog post - Model now uses robust Student T distribution instead of Normal for observations - Pre-commit checks pass, notebook executes successfully Co-authored-by: drbenvincent <[email protected]>

Copilot · 2025-12-19T19:39:32Z

The original blog post used a Student T likelihood. We should also do that, and we can do that through the use of custom priors with the Prior class. There is probably an example in one of the other...

Both addressed in commit 41346bf:

✅ Changed title to "Comparative Interrupted Time Series: a geo-experimentation example"
✅ Updated model to use Student T likelihood (nu=4) matching the original blog post
- Added from pymc_extras.prior import Prior import
- Modified LinearRegression model to use custom priors with StudentT distribution for y_hat
- Uses robust Student T instead of Normal distribution for observations

The model now matches the approach from Juan's original blog post with Student T likelihood for more robust inference.

drbenvincent · 2025-12-19T21:06:50Z

Bear in mind I've pushed some commits to update and re-run the notebook

drbenvincent · 2025-12-19T21:10:24Z

bugbot run

cursor · 2025-12-19T21:10:29Z

PR Summary

Add multiple CITS/geo-experimentation and ITS-related citations to docs/source/references.bib.

^{Written by Cursor Bugbot for commit 188e814. This will update automatically on new commits. Configure here.}

cursor

✅ Bugbot reviewed your changes and found no bugs!

Streamlined and clarified markdown explanations in the CITS notebook for improved readability and conciseness. Combined and reworded sections to better explain comparative interrupted time series, its implementation, assumptions, and practical guidance, while removing redundant bullet points and enhancing the overall narrative flow.

Initial plan

7f319b2

Copilot AI assigned Copilot and drbenvincent Dec 19, 2025

Copilot started work on behalf of drbenvincent December 19, 2025 16:02 View session

Add CITS documentation: data, notebook, and references

edc6285

Co-authored-by: drbenvincent <[email protected]>

Copilot AI changed the title ~~[WIP] Add documentation example for Comparative Interrupted Time Series~~ Add Comparative Interrupted Time Series documentation with geo-experimentation example Dec 19, 2025

Copilot finished work on behalf of drbenvincent December 19, 2025 16:18

Copilot AI requested a review from drbenvincent December 19, 2025 16:18

drbenvincent requested changes Dec 19, 2025

View reviewed changes

docs/source/notebooks/its_pymc_comparative.ipynb Show resolved Hide resolved

Copilot started work on behalf of drbenvincent December 19, 2025 17:53 View session

Copilot AI requested a review from drbenvincent December 19, 2025 18:03

Copilot finished work on behalf of drbenvincent December 19, 2025 18:03

drbenvincent requested changes Dec 19, 2025

View reviewed changes

Copilot started work on behalf of drbenvincent December 19, 2025 18:26 View session

Copilot AI requested a review from drbenvincent December 19, 2025 18:33

Copilot finished work on behalf of drbenvincent December 19, 2025 18:33

drbenvincent requested changes Dec 19, 2025

View reviewed changes

Copilot started work on behalf of drbenvincent December 19, 2025 19:02 View session

Copilot AI requested a review from drbenvincent December 19, 2025 19:10

Copilot finished work on behalf of drbenvincent December 19, 2025 19:10

drbenvincent requested changes Dec 19, 2025

View reviewed changes

Copilot started work on behalf of drbenvincent December 19, 2025 19:33 View session

Copilot AI requested a review from drbenvincent December 19, 2025 19:40

Copilot finished work on behalf of drbenvincent December 19, 2025 19:40

drbenvincent added 3 commits December 19, 2025 20:52

add references, more info, and cell-tags

203d589

text about custom likelihood + adjust table

5fc1e0b

rerun notebook

188e814

drbenvincent requested review from NathanielF and juanitorduz December 19, 2025 21:08

cursor bot reviewed Dec 19, 2025

View reviewed changes

drbenvincent added 3 commits December 19, 2025 21:27

Clarify and illustrate parallel trends assumption + re-run notebook

72dae9e

Put notebook in a new CITS section of the example notebooks

b5a2c17

drbenvincent added documentation Improvements or additions to documentation enhancement New feature or request labels Dec 20, 2025

Add Comparative Interrupted Time Series documentation with geo-experimentation example #595

Are you sure you want to change the base?

Add Comparative Interrupted Time Series documentation with geo-experimentation example #595

Conversation

Copilot AI commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Plan: Comparative Interrupted Time Series Documentation

Comments on the Issue (you are @copilot in this section)

Uh oh!

codecov bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Dec 19, 2025

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Dec 19, 2025

Uh oh!

drbenvincent left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Dec 19, 2025

Uh oh!

drbenvincent commented Dec 19, 2025

Uh oh!

drbenvincent commented Dec 19, 2025

Uh oh!

cursor bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

✅ Bugbot reviewed your changes and found no bugs!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 19, 2025 •

edited

Loading

codecov bot commented Dec 19, 2025 •

edited

Loading

Copilot AI commented Dec 19, 2025 •

edited

Loading

cursor bot commented Dec 19, 2025 •

edited

Loading