Computer Science Master's Degree Thesis

Title: Leveraging Spatio-Temporal Traffic Patterns to Enhance Travel Time Estimation

Institution: University of Bonn, Institute for Informatics 3

Summary

This thesis focuses on improving travel time estimation (TTE), a key component in smart city mobility applications. The work introduces a pattern-aware ensemble approach that combines state-of-the-art TTE methods (TEMP, LightGBM, DeepTTE) through a novel weight calculation mechanism informed by spatio-temporal traffic patterns. A custom clustering algorithm was developed using a distance matrix that captures the interplay of multiple traffic-related features. Experiments on the Porto Taxi Dataset show that the proposed method achieves a mean absolute error of 51.39 seconds, outperforming the conventional ensemble baseline as well as two state-of-the-art models. While the approach does not consistently surpass the best single method across all metrics, results highlight the potential of incorporating traffic pattern analysis into ensemble TTE systems and point toward promising directions for future research.

The main goal of this thesis is to enhance travel time estimation by integrating an ensemble-averaging approach with pattern extraction. More precisely, we aim to extract patterns in a given trip data by clustering, and use this information to build targeted predictive models in an ensemble. Throughout the thesis, we investigate the following questions:

How can we learn (spatio-temporal) patterns from trip data?
How can we incorporate the information from learned patterns into an ensemble approach to improve travel time estimation?
Can the pattern-based ensemble approach enhance travel time estimation?
How does changing the group of features for extracting patterns change the overall performance of the approach?

The thesis, LaTeX source code and defense slides

Here is a link to the Pdf file to the thesis. Here you can find the LaTeX source code for the thesis. The presentation for my thesis defense can be found here.

The model architecture

The architecture of the proposed model consists of four main building blocks, as shown in the following figure: data preprocessing, prediction generation, pattern extraction and pattern-based weight computation.

The preprocessing block is responsible for data cleaning, feature engineering, feature transformation, feature selection and partitioning.
In the prediction generation block, $n$ state-of-the-art methods for travel time estimation are implemented and trained on the preprocessed data independently from each other. Each predictor returns a prediction $p_i$ for $i ∈ [1, n]$.
The pattern extraction block employs a clustering algorithm to extract patterns from the dataset, and divides it into qualitative classes. The clustering ensures that similar trips are grouped together within the same pattern class.
The resulting class labels and predictions are combined in the pattern-based weight computation, which decides on how strongly or weakly a predictor should affect the final prediction of a trip based on its performance in the pattern class it belongs to.

Pattern Extraction

We propose the following distance measure: $D_{i,j}^{\text{all}} = \alpha_{\text{traj}} D_{i,j}^{\text{traj}} + \alpha_{\text{temp}} D_{i,j}^{\text{temp}} + \alpha_{\text{categ}} D_{i,j}^{\text{categ}}$ where $D_{i,j}^{\text{traj}}$ is the distance between trip $i$ and $j$ based on their trajectory, $D_{i,j}^{\text{temp}}$ based on their temporal features, $D_{i,j}^{\text{categ}}$ based on their categorical features. For more information on the distance metric and it's motivation, please have a look at Chapter 4 of the thesis.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
Clustering		Clustering
DeepTTE		DeepTTE
Ensemble		Ensemble
Master thesis - LaTeX		Master thesis - LaTeX
TEMP		TEMP
__pycache__		__pycache__
lightGBM		lightGBM
.gitignore		.gitignore
DeepTTE-B128-K3-A01-sampled_traj.out_		DeepTTE-B128-K3-A01-sampled_traj.out_
DeepTTE-B128-K3-A01-sampled_traj_c.out_		DeepTTE-B128-K3-A01-sampled_traj_c.out_
Master_Thesis_Defense_Hayal_Deniz_Özer.pdf		Master_Thesis_Defense_Hayal_Deniz_Özer.pdf
Master_Thesis_Hayal_Deniz_Özer.pdf		Master_Thesis_Hayal_Deniz_Özer.pdf
README.md		README.md
TEMP_find_enlarge_test_val.out_		TEMP_find_enlarge_test_val.out_
TEMP_tau_0.8.out_		TEMP_tau_0.8.out_
TEMP_tau_0.9.out_		TEMP_tau_0.9.out_
data_processing.ipynb		data_processing.ipynb
data_processing.py		data_processing.py
run.py		run.py
run.sh		run.sh
sampling_trajectories.out_		sampling_trajectories.out_
sgn-dbscan-max-min.out_		sgn-dbscan-max-min.out_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Science Master's Degree Thesis

Summary

The thesis, LaTeX source code and defense slides

The model architecture

Pattern Extraction

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Computer Science Master's Degree Thesis

Summary

The thesis, LaTeX source code and defense slides

The model architecture

Pattern Extraction

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages