/Graduating-Programs-Brazil

This repository aims to show the Brazilian Graduating Courses and the method to get and organize these information.

Primary LanguageRGNU General Public License v3.0GPL-3.0

GitHub commit activity GitHub last commit GitHub repo size

Summary

This project aims to share the current state and location of Brazilian Graduating Programs in Computer Science and related areas.

Motivation

The purpose is to realize an exploratory analysis of available data about Computing Programs and generate meaningful visualizations.

The project is separated in three stages:

  • Stage one: get the data from pdf and save into a dataframe and .csv file.
  • Stage two: validate and integrate new data to generate maps with Brazilian computer graduating programs. Check here to see only the maps.
  • Stage three: using shiny to provide an interactive interface for the users.

Tools

I used R language and RStudio IDE to perform all analysis.

What is data about?

The data is about Brazilian Computing Graduating Programs, including universities names, research topics, among others.

What data is available?

  • First analysis is based on this document made by CAPES, the Brazilian organization responsible for Higher Education Programs evaluation.

  • Sucupira Website. contains information about Brazilian Computing Programs.

  • From here, we got Brazilian states and cities information.

  • Shapefiles used was got from here. You can get from here too.

How to use?

You can clone/download/fork the project and use for your own goals.

  • R: all source code used for this analysis;
  • SQL: code to create a database to store all these data;
  • csv: source and manually processed data;
  • images: has all images generated from this analysis;
  • pdf: document used in first analysis;
  • images: all images produced in this analysis;

About R folder:

  • all_in_one.R: full code of a shinyapp produced;
  • data_integration_visualization.R: code of second stage (maps and data analysis);
  • get_data.R: code to generate data used for shinyapp;
  • pdf_reading.R: code of first stage (PDF data extraction);
  • shinyapp folder: shiny app ready to use, that is the third stage;

Contribute

Everyone is free and encouraged to contribute with this project through Pull Requests, to keep all this data up to date with Sucupira website. Anyone who wants to help, leave a message and let's work together.

Soon a guideline will be provided.

Live Demo

Data Science Broon Shinyapp

License

GNU General Public License v3.0.