Similarity script is for searching duplicates in master data. The script use Levenshtein distance to find similar names.
INSTALLATION
Create csv files for compare (replace the names in conf JSON ("duplicates.csv") for file to load and ("result.csv") for result csv)
If neccessary, the additional file can be added, important for intergrating data set isAddFileToCheck = True and replace "additional.csv" for file name