--at first change the parameters (input file, number of clusters) in the main.py
--to run: python main.py
--final output file format:clusterlabel	truelabel	text 
--output file location: data/stackoverflow/traintest
--You can run generate_initial_labels.py to generate initial labels for a dataset (not mandatory)