This project contains code for column name recommender for .csv and .nc file
The process contains three steps:
-
dataset raw files are analyzed and generating
columns_linked_file.pkl -
column name and min/max values are access
-
creating graph of names are values
-
giving a .csv/.nc file to get unknown column recommendation,
s3_check_file.pycontains the example