Simple Data Visualization and Exploratory Data Analysis of the Shapley Galaxy dataset using Python libraries such as numpy, pandas, matplotlib, seaborn and astropy.
The Shapley Supercluster or Shapley Concentration (SCl 124) is the largest concentration of galaxies in our nearby universe that forms a gravitationally interacting unit, thereby pulling itself together instead of expanding with the universe. It appears as a striking overdensity in the distribution of galaxies in the constellation of Centaurus. It is 650 million light-years away.
The dataset has the following columns:
- R.A. = Right ascension: Coordinate in the sky similar to longitude on Earth, 0 to 360 degrees
- Dec. = Declination: Coordinate in the sky similar to latitude on Earth, -90 to +90 degrees
- Mag = Magnitude: An inverted logarithmic measure of galaxy brightness in the optical band. A Mag=17 galaxy is 100-times fainter than a Mag=12 galaxy. Value is missing for some galaxies.
- V = Velocity: Speed of the galaxy moving away from Earth, after various corrections are applied
- SigV = Sigma of velocity: Heteroscedastic measurement error known for each individual velocity mesurement
###Using Astronomical Coordinate System the galaxies are plotted against Right ascension and Declination, which in result shows their position in the sky.
###To map the location of galaxy, we need to plot the cartesian space coordinate from the equatorial coordinate. This one formulated from the galactic coordinate, known distance, galactic longitude, and galactic latitude. Then we calculate the redshift and distance of the galaxies and add their respective columns in the existing dataframe.
###After we know the cartesian coordinate of galaxies, we could plot it in space. Zero indicates our galaxy (exactly our sun, which is negligible distance in mpc scale).
###Redshift and Distance Distrubution