Awesome Public Industrial Datasets

A list of awesome-public-datasets found in the industry and their descriptions are shown below. Clicking the link will take you to the data description page. The data and its description will be updated periodically.

Tags

  • Sector

  • Label

  • Time-series

  • Miscellaneous

  • Simulation

List of Datasets

Semicon

Chemical

  • Gas Sensor Array Drift: This archive contains 13910 measurements from 16 chemical sensors exposed to 6 different gases at various concentration levels.
  • Chemical Detection Platform: The dataset contains 18000 time-series recordings from a chemical detection platform at six different locations in a wind tunnel facility in response to ten high-priority chemical gaseous substances.
  • Dynamic Gas Mixtures: The data set contains the recordings of 16 chemical sensors exposed to two dynamic gas mixtures at varying concentrations. For each mixture, signals were acquired continuously during 12 hours.

Mechanical

Steel

Power

  • Appliance Energy: Experimental data used to create regression models of appliances energy use in a low energy building.

  • Combined Cycle Power Plant: Combined Cycle Power Plant over 6 years.

  • GREEND : GREEND is an energy dataset containing power measurements collected from multiple households in Austria and Italy. It provides detailed energy profiles on a per device basis with a sampling rate of 1 Hz.

  • Eco(Electricity Consumption & Occupancy) : The ECO data set is a comprehensive data set for non-intrusive load monitoring and occupancy detection research.

  • UK DALE dataset : This dataset records the power demand from five houses. In each house we record both the whole-house mains power demand every six seconds as well as power demand from individual appliances every six seconds. In three of the five houses (houses 1, 2 and 5) we also record the whole-house voltage and current at 16 kHz.

  • BLUED dataset : The dataset consists of voltage and current measurements for a single-family residence in the United States, sampled at 12 kHz for a whole week.

  • REDD: A Public Data Set for Energy Disaggregation Research: A freely available data set containing detailed power usage information from several homes, which is aimed at furthering research on energy disaggregation (the task of determining the component appliance contributions from an aggregated electricity signal)

Battery

Etc

  • Hill-Valley: This is NOT a manufacturing dataset, but looks good for testing pattern detection methods.

  • APS System Failures: The datasets' positive class consists of component failures for a specific component of the APS system. The negative class consists of trucks with failures for components not related to the APS.

Contributors

About MakinaRocks

http://www.makinarocks.ai/