/PSO-Kmeans-Hybrid-for-High-Dimensional-Data-Clustering-with-Autoencoder

Enhancing the performance of high dimensional automatic data clustering using Particle Swarm Optimization (PSO) algorithm employing Autoencoder in Stock Market data.

Primary LanguagePython

PSO-Kmeans Hybrid for High Dimensional Data Clustering with Autoencoder

  1. Design and develop a PSO algorithm for automatic data clustering.
  2. Design and develop PSO employing Autoencoder for data clustering.
  3. Compare the performance of PSO and Autoencoder based PSO data clustering algorithms using different validity indices.
  4. Apply this algorithm on Stock Market Data and obtain inferences.

Methodology

System Design

Results

Method K-Means PSO K-Means PSO with Autoencoders
Dataset DB Index                Silhouette Index DB Index                Silhouette Index
High 0.99316                0.044056 0.499879                0.598376
Low 0.98635                0.079333 0.492837                0.694484
Close 0.98474                0.046373 0.474543                0.634368
Open 0.93643                0.056383 0.547732                0.745483
Volume 0.99736                0.043367 0.498746                0.648464

A lower value of the DB Index and a higher value of the Silhouette Index indicate improved clustering performance.

Conclusion

In terms of clustering accuracy and efficiency, the suggested method of utilizing PSO and K-Means with autoencoders has produced encouraging results. Our study demonstrates that, when applied to benchmark datasets like Nifty 100, our method outperforms other ones already in use. Investigating different Autoencoder variants for dimensionality reduction is one possible future direction. Applying this PSO and K-means algorithm to other, larger datasets is a different future approach that may be taken. It can be fascinating to apply the suggested method to practical applications in many industries, such as healthcare, banking, and image and video processing.