Tokyo-Azure-Spark

This project uses Azure, Apache Spark, and Python to process and analyze Olympic data. Below are the main components and technologies used in the project:

Technologies Used

Azure: Microsoft's cloud platform used for data storage and processing.
Apache Spark: A unified analytics engine for processing large volumes of data.
Python: The programming language used to write data processing and analysis scripts.

Project Description

The goal of this project is to process and analyze Olympic data to extract valuable information about athletes, their coaches, teams, and events. The data is stored in Azure and processed using Apache Spark to efficiently handle large volumes of data.

Project Structure

CSVs/: Contains all the CSV files with Olympic data.
- Athletes.csv
- Coaches.csv
- EntriesGender.csv
- Medals.csv
- Teams.csv
tokyo_olympic.ipynb: The Jupyter notebook containing the data processing and analysis scripts.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CSVs		CSVs
README.md		README.md
ad		ad
tokyo_olympic.ipynb		tokyo_olympic.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tokyo-Azure-Spark

Technologies Used

Project Description

Project Structure

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tokyo-Azure-Spark

Technologies Used

Project Description

Project Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages