/Minor-Data-Analysis

This repository deals with analyzing data from a CSV file, using 'pandas library'.

Minor-Data-Analysis

The following csv data file contains data of diamonds with following characteristics. "diamonds.csv"

carat - diamond's apparent size. cut - how good the diamond shaping is . color - colour of a diamond as letter representation. clarity - clarity of diamond. depth - depth of diamond. table - top surface of diamond. price - price of diamond. x - x dimension. y - y dimension. z - z dimension.

Read the file and find the following:

  1. Drop the unnamed column from the dataset once you read it.
  2. The occurences of all unique colors of diamonds, bar chart of the same.
  3. Average price of each cut of diamond.
  4. Find the volume of every diamond and store it in a new column in the dataset. (volume = xyz)
  5. Histograms of Price and depths colums as two separate plots
  6. A scatter plot of carat vs Price, volume vs Price, depth vs Price in a subplot of 1x3.
  7. Convert the categorical values of color, to numbers. i.e D, E, F, H, I, J to 0, 1, 2, 3, 4, 5 using Pandas methods only
  8. Which clarity(i.e the category) diamond has maximum mean depth?
  9. Find the diamond(i.e row) with minimum price per volume.