--at first change the parameters (input file, number of clusters) in the main.py --to run: python main.py --final output file format:clusterlabel truelabel text --output file location: data/stackoverflow/traintest --You can run generate_initial_labels.py to generate initial labels for a dataset (not mandatory)