weighted_least_squares_wls

Weighted Least Squares (WLS) and Ordinary Least Squares (OLS). Covers heteroscedasticity, Feasible WLS, Huber robust regression, and validates methods with a Monte Carlo simulation

Overview

This project presents an in-depth analysis of Weighted Least Squares (WLS) regression, comparing it with Ordinary Least Squares (OLS) and alternative estimation methods in the presence of heteroscedasticity. Through systematic implementation, diagnostic evaluation, and empirical validation, this project provides a comprehensive understanding of regression techniques for heteroscedastic data.

Key Objectives

Investigate the impact of heteroscedasticity on regression estimation
Compare WLS and OLS performance across multiple metrics
Implement practical alternatives including Feasible WLS and robust methods
Validate findings through Monte Carlo simulation
Provide reproducible Python implementations with thorough diagnostics

Core Methodologies

1. Weighted Least Squares (WLS)

Implementation with known variance weights
Comparison of efficiency gains over OLS
Prediction interval analysis

2. Feasible Weighted Least Squares (FWLS)

Two-stage estimation when variance structure is unknown
Residual-based weight estimation
Performance comparison with true WLS

3. Robust Regression Methods

Iterative reweighting approaches
Huber's M-estimator for outlier resistance
Convergence behavior analysis

4. Diagnostic Framework

Residual analysis and heteroscedasticity detection
Q-Q plots for normality assessment
Comprehensive model comparison metrics

5. Empirical Validation

Monte Carlo simulation (1000 iterations)
Bias, variance, and MSE comparison
Small-sample performance evaluation

Technical Implementation

Dependencies

import numpy as np
import matplotlib.pyplot as plt
import statsmodels.api as sm
from scipy import stats
from statsmodels.iolib.table import SimpleTable

Data Generation

Artificial dataset with controlled heteroscedasticity
Two variance groups (low/high) with 3:1 standard deviation ratio
Quadratic true relationship with linear estimation (intentional misspecification)

Model Performance Ranking

Feasible WLS (FWLS) - Best overall when variance is unknown
WLS with known weights - Optimal when variance structure is known
Huber Robust Regression - Excellent outlier resistance
OLS with HC corrections - Moderate improvement over standard OLS
Standard OLS - Least efficient under heteroscedasticity

Practical Insights

FWLS performs comparably to WLS with known weights
Iterative reweighting requires careful implementation to avoid instability
Model diagnostics are crucial for identifying remaining issues
Monte Carlo validation confirms theoretical efficiency advantages

Getting Started

Prerequisites

pip install numpy matplotlib statsmodels scipy

Basic Usage

Clone the repository
Install required dependencies
Open and run the Jupyter notebook
Modify parameters to explore different scenarios

Parameters to Experiment With

Variance ratio between groups
Sample size
Degree of model misspecification
Heteroscedasticity patterns
Number of Monte Carlo iterations

Interpretation Guidelines

When to Use WLS/FWLS

Heteroscedasticity detected in residual plots
Prior knowledge of variance structure available
Prediction precision is a primary concern
Efficient parameter estimation required

When to Consider Alternatives

Outliers present → Huber robust regression
Variance structure unknown → FWLS
Limited sample size → OLS with HC corrections
Computational simplicity needed → Standard OLS

Diagnostic Checklist

Residual plots - Check for heteroscedasticity patterns
Q-Q plots - Assess normality assumption
Standard error comparison - Evaluate efficiency gains
Prediction intervals - Compare precision
Model selection criteria - AIC/BIC comparison
Monte Carlo results - Validate small-sample performance

Theoretical Background

Mathematical Foundation

WLS minimizes: $$\sum_{i=1}^{n} w_i (y_i - \hat{y}_i)^2$$ where $w_i = 1/\sigma_i^2$, giving less weight to observations with higher variance.

Efficiency Considerations

Under heteroscedasticity, WLS achieves the Gauss-Markov property (Best Linear Unbiased Estimator), while OLS remains unbiased but inefficient.

Learning Outcomes

Through this analysis, users will understand:

The impact of heteroscedasticity on regression estimation
Practical implementation of WLS and alternatives
Diagnostic techniques for model validation
Empirical performance evaluation methods
Trade-offs between different estimation approaches

Contributing

Contributions are welcome! Please feel free to:

Report issues or bugs
Suggest enhancements or additional methods
Improve documentation
Share use cases or applications

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Statsmodels development team for comprehensive statistical tools
Academic references listed in the notebook
Open-source community for invaluable resources and support

Note

This notebook is designed for educational and research purposes. Real-world applications may require additional considerations and validation.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
weighted_least_squares_wls.ipynb		weighted_least_squares_wls.ipynb

Folders and files

Latest commit

History

Repository files navigation

weighted_least_squares_wls

Overview

Key Objectives

Core Methodologies

1. Weighted Least Squares (WLS)

2. Feasible Weighted Least Squares (FWLS)

3. Robust Regression Methods

4. Diagnostic Framework

5. Empirical Validation

Technical Implementation

Dependencies

Data Generation

Model Performance Ranking

Practical Insights

Getting Started

Prerequisites

Basic Usage

Parameters to Experiment With

Interpretation Guidelines

When to Use WLS/FWLS

When to Consider Alternatives

Diagnostic Checklist

Theoretical Background

Mathematical Foundation

Efficiency Considerations

Learning Outcomes

Contributing

License

Acknowledgments

Note

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages