Skip to content

technometrics-lab/17-Cyber-risk_and_the_cross-section_of_stock_returns

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cyber risk and the cross-section of stock returns

Goal

This study aims to build a cyber risk factor based on the 10-K disclosures of public firms and show that this factor is robust to all factors' benchmarks.

Abstract

We extract firms’ cyber risk with a machine learning algorithm measuring the proximity between their disclosures and a dedicated cyber corpus. Our approach outperforms dictionary methods, uses full disclosure and not devoted-only sections, and generates a cyber risk measure uncorrelated with other firms’ characteristics. We find that a portfolio of US-listed stocks in the high cyber risk quantile generates an excess return of 18.72% p.a. Moreover, a long-short cyber risk portfolio has a significant and positive risk premium of 6.93% p.a., robust to all factors’ benchmarks. Finally, using a Bayesian asset pricing method, we show that our cyber risk factor is the essential feature that allows any multi-factor model to price the cross-section of stock returns.


Documents

For more information please refer to:


Code

The code is a mix of notebooks and a py file:

  • the py file contains the functions needed in the notebooks;
  • the notebooks explain all the steps.

Data:

10-K statements from SEC EDGAR.

Cyber corpus from MITRE ATTACK.

Stock information from WRDS.


Files description:

Short description of the files:

File name Short Description
function_definitions.py defines of all the functions used in the other files
Reproduce_Florackis.ipynb reproduces the work of Florackis, Michaely and Weber
Project_data_acquisition.ipynb downloads and merges data of stock returns, stock characteristics and 10-k statements
Project_doc2vec.ipynb trains doc2vec models, computes the vector representation of the 10-k statements and the corresponding cyber risk scores
Project_analysis1.ipynb displays properties of the cyber risk scores, performs portfolio sorts and robustness tests
Project_analysis2.ipynb performs Fama-Macbeth regressions, Bayesian factor model selection and instrumented principal component analysis
Project_analysis3.ipynb compares my cyber risk measure to the one of Florackis, Michaely and Weber and performs double sorts
Project_analysis4.ipynb displays properties of the cyber risk scores, performs portfolio sorts and robustness tests using a cyber risk score built on mean similarities
Project_analysis5.ipynb displays properties of the cyber risk scores, performs portfolio sorts and robustness tests using a cyber risk score that allows negative similarities
Project_analysis6.ipynb displays properties of the cyber risk scores, performs portfolio sorts and robustness tests using a cyber risk score built on only non Item 1A paragraphs
Project_analysis7.ipynb displays properties of the cyber risk scores, performs portfolio sorts and robustness tests using a cyber risk score built on only Item 1A paragraphs
tests/... folder containing test files (testing BERT, doc2vec,...)

Each file contains more details and comments.


Hints of bibliography:

Please find the complete list on the bibliography of the master thesis.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •