/ReCo2

Statistical Comparison of Renewable Energy Production and Levels of CO2 produced; performed with R, using data from Our World in Data.

Primary LanguageRMIT LicenseMIT

Renewable Energy and Environmental Footprint

Investigating the Reverse Correlation between Renewable Energy Production and Global Carbon Emissions: A Statistical Analysis of CO2 Levels


Table of Contents

Summary

This research project aims to conduct a comprehensive statistical analysis of the correlation between renewable energy production and the environmental footprint, with a specific focus on CO2 emissions. Key features of the study include:

  • Analyzing the relationship between renewable energy production and CO2 emissions
  • Utilizing a global dataset from "Our World in Data" to provide a broad perspective
  • Conducting pre-processing on the collected data to ensure quality and consistency
  • Performing statistical analysis using R software, including descriptive statistics and the Pearson Coefficient
  • Visualizing the results through linear charts

Data Collection

To conduct this analysis, a comprehensive dataset was collected, encompassing information on renewable energy production and CO2 emissions over a specified timeframe. The datasets include data from the "Our World in Data" organization1, covering a range of countries and regions, allowing for a global perspective on the topic.

Data Processing

Before performing the statistical analysis, the collected data underwent a pre-processing process to ensure its quality and consistency. The following steps were taken:

  • Handling missing values
  • Removing outliers
  • Standardizing the data format
  • Data transformation, including aggregating early data and converting units

Two subsets were created: "filtered_owid" for CO2 levels and "filtered_energy" for the TWh of energy produced in a specific year.

Statistical Analysis

The statistical analysis consisted of several steps, starting from descriptive statistics to gain a general understanding of the dataset. R software and libraries were used for the analysis, including:

These tools allowed for a comprehensive comparison and visualization of the datasets, providing an overview of the phenomenon under study. Linear charts were used to facilitate the comparison of the resulting values.

Additionally, the Pearson Coefficient was computed to measure the correlation between renewable energy production and CO2 emissions. The Pearson Coefficient is a statistical measure that quantifies the linear relationship between two variables. A result of -0.8866505 suggests a strong inverse correlation between renewable energy production and CO2 emissions2.

References

Papers and Articles

  • Umair, M., Yousuf, M.U., Ul-Haq, J., Hussain, Z. & Visas, H. 2023, "Revisiting the environmental impact of renewable energy, non-renewable energy, remittances, and economic growth: CO2 emissions versus ecological footprint for top remittance-receiving countries," Environmental Science and Pollution Research, vol. 30, no. 23, pp. 63565-63579. Read more

R Software Manuals & Libraries

Datasets

Authors

This statistical study was conducted by Francesco Silvano and Giacomo Noghera. The data and libraries used belong to their respective owners.

License

This project is licensed under the MIT License. See the LICENSE file for more information.

Footnotes

  1. https://ourworldindata.org

  2. https://articles.outlier.org/pearson-correlation-coefficient