Adversarial Model Extraction on Graph Neural Networks

Implementation of the paper "Adversarial Model Extraction on Graph Neural Networks" by David DeFazio and Arti Ramesh (arXiv:1912.07721v1).

Overview

This project implements a model extraction attack on Graph Neural Networks (GNNs), demonstrating how an adversary can steal a GCN model with only:

API access to victim model predictions
A small 2-hop subgraph (10-150 nodes)

Key Result: Achieves 71.8% fidelity (vs paper's 80%) with 10 samples per class on Cora dataset.

Note on Development Timeline:

This project was developed locally from September 2024 to April 2025 and published to GitHub in March 2025. Commit timestamps reflect the actual development timeline during local development.

Quick Start

1. Install Dependencies

pip install -r requirements.txt

2. Train Victim Models

python scripts/train_victim_cora.py
python scripts/train_victim_pubmed.py

3. Run Extraction Attack

python experiments/run_extraction.py

Project Structure

.
├── src/
│   ├── data/              # Dataset loaders (Cora, Pubmed)
│   ├── models/            # GCN architecture
│   ├── extraction/        # Algorithm 1 & 2 implementation
│   └── utils/             # Graph utilities, feature sampling
├── experiments/           # Experiment scripts
│   ├── run_extraction.py      # Single extraction experiment
│   ├── cora_experiments.py    # Batch Cora experiments
│   └── pubmed_experiments.py  # Batch Pubmed experiments
├── scripts/               # Training scripts
└── models/               # Saved victim models

Key Features

✓ Algorithm 1: Core extraction using subgraph sampling
✓ Algorithm 2: Approximate inaccessible nodes
✓ Fidelity Measurement: Compare victim vs extracted predictions
✓ Multiple Datasets: Cora (7 classes) and Pubmed (3 classes)
✓ Configurable: Samples per class, epochs, noise parameters

Results

Dataset	Samples/Class	Our Fidelity	Paper Fidelity
Cora	10	71.8%	~80%
Cora	50	75.0%	~82%

See RESULTS.md for detailed analysis.

How It Works

Victim Training: Train GCN on citation network
Subgraph Access: Extract 2-hop neighborhood around center node
Sample Generation: Create synthetic samples for each class
Query Victim: Get predictions for perturbed subgraphs
Train Extraction: Learn model that mimics victim predictions
Measure Fidelity: Test agreement on full graph

Documentation

DOCUMENTATION.md - Detailed usage guide
RESULTS.md - Experimental results and analysis

Citation

@article{defazio2019adversarial,
  title={Adversarial Model Extraction on Graph Neural Networks},
  author={DeFazio, David and Ramesh, Arti},
  journal={arXiv preprint arXiv:1912.07721},
  year={2019}
}

License

This is a research implementation for educational purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Model Extraction on Graph Neural Networks

Overview

Note on Development Timeline:

Quick Start

1. Install Dependencies

2. Train Victim Models

3. Run Extraction Attack

Project Structure

Key Features

Results

How It Works

Documentation

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
data		data
experiments		experiments
models		models
scripts		scripts
src		src
.gitignore		.gitignore
DOCUMENTATION.md		DOCUMENTATION.md
FINAL_TEST_RESULTS.md		FINAL_TEST_RESULTS.md
README.md		README.md
RESULTS.md		RESULTS.md
USAGE.md		USAGE.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Adversarial Model Extraction on Graph Neural Networks

Overview

Note on Development Timeline:

Quick Start

1. Install Dependencies

2. Train Victim Models

3. Run Extraction Attack

Project Structure

Key Features

Results

How It Works

Documentation

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages