feat: Add non-parametric estimator Weighted Average Quantile (WAQ) analysis method for when cuped is not available

## Motivation

Online experimentation metrics (revenue, payments) are often thick-tailed, 
which inflates the variance of the standard difference-in-means estimator and 
leads to wide confidence intervals.

Athey, Bickel, Chen, Imbens & Pollmann (2023) — *Semiparametric Estimation of 
Treatment Effects in Randomized Experiments* — propose two semiparametrically 
efficient estimators for this setting. This issue proposes implementing the 
simpler of the two: the **Weighted Average Quantile (WAQ)** estimator.

WAQ is particularly valuable when CUPED is not available,  for example, in 
e-commerce contexts where purchase frequency is low (think Zalando, Zara) and 
pre-experiment data is scarce.

## What is WAQ?

Under a constant additive treatment effect assumption, the WAQ estimator 
computes a weighted average of sorted quantile differences between treatment 
and control. The weights are proportional to minus the second derivative of the 
log density of the control outcome distribution, estimated nonparametrically 
via adaptive kernel density estimation.

Properties:
- For Normal outcomes: reduces to difference-in-means (no loss)
- For thick-tailed outcomes: substantially lower variance than difference-in-means
- Retains a causal interpretation even under mild misspecification (estimates a 
  weighted average of quantile treatment effects)

## Proposed usage
```python
plan = AnalysisPlan.from_metrics_dict({
    "metrics": [{"name": "revenue", "metric_type": "simple"}],
    "variants": [
        {"name": "control", "is_control": True},
        {"name": "treatment", "is_control": False}
    ],
    "variant_col": "variant",
    "analysis_type": "waq"
})
```

## Caveats

- CUPED outperforms WAQ when pre-experiment data is available; WAQ is the 
  better choice when it isn't
- Computationally heavier than OLS due to adaptive KDE , may need to explore 
  parallelization dependencies for large datasets

## Out of scope for this issue (that the R package adds)
- EIF (influence function) estimator
- Sample splitting / cross-fitting variant
- Proportional treatment effect model

## References
- Paper: https://arxiv.org/abs/2109.02603
- R implementation: https://github.com/michaelpollmann/parTreat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add non-parametric estimator Weighted Average Quantile (WAQ) analysis method for when cuped is not available #263

Motivation

What is WAQ?

Proposed usage

Caveats

Out of scope for this issue (that the R package adds)

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

feat: Add non-parametric estimator Weighted Average Quantile (WAQ) analysis method for when cuped is not available #263

Description

Motivation

What is WAQ?

Proposed usage

Caveats

Out of scope for this issue (that the R package adds)

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions