From 242c9db61b02ec0ce2605be89ced297677258a2e Mon Sep 17 00:00:00 2001 From: Vadim Liventsev Date: Thu, 3 Dec 2020 18:57:38 +0100 Subject: [PATCH] Add a kaggle dataset Signed-off-by: Vadim Liventsev --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 17dfffb..bb18b01 100644 --- a/README.md +++ b/README.md @@ -395,6 +395,7 @@ A curated list of awesome research papers, datasets and software projects devote - [NL2Bash](https://github.com/TellinaTool/nl2bash) - This dataset contains a set of ~10,000 bash one-liners collected from websites such as StackOverflow and their English descriptions written by Bash programmers, as described in the [paper](https://arxiv.org/abs/1802.08979). - [GitHub JavaScript Dump October 2016](https://archive.org/details/javascript-sources-oct2016.sqlite3) - Dataset consisting of 494,352 syntactically-valid JavaScript files obtained from the top ~10000 starred JavaScript repositories on GitHub, with licenses, and parsed ASTs. - [BigCloneBench](https://jeffsvajlenko.weebly.com/bigcloneeval.html) - Clone detection benchmark of 8 million function clone pairs in the IJaDataset. +- [Codechef Competitive Programming](https://www.kaggle.com/arjoonn/codechef-competitive-programming?select=questions.csv) - Kaggle dataset of competitive programming tasks with problem statements in English an solutions in various programming languages ## Credits