dataintegration

There are 21 repositories under dataintegration topic.

  • mansik95/IMDB-Analysis

    This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.

    Language:TSQL300015
  • reedelk-runtime

    codecentric/reedelk-runtime

    Reedelk Runtime Platform Community Edition

    Language:Java28737
  • RushikeshShinde14/Hospital-Database-Management-System-SQL-Project

    Hospital Database Management System (DBMS) is a comprehensive SQL project designed to streamline and optimize the management of hospital operations. This project aims to provide an efficient and user-friendly solution for storing, retrieving, and manipulating various types of healthcare-related data.

  • GauravPandeyLab/ensemble_integration

    Integrating multimodal data through heterogeneous ensembles

    Language:Python2205
  • george-mountain/Data-Extraction-Integration-and-Analysis---Clustering-Operations

    This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.

    Language:Jupyter Notebook2200
  • saifsafsf/IMDb-Data-Integration-with-Talend

    Uses Rapid API to fetch IMDb data, filters, & uploads the data in different tables in a MySQL Database, in one click using Talend.

    Language:Python2100
  • camara94/introduction-to-data-engineering

    Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.

  • doshiharmish/Driving-Customer-Insights-Post-Merger

    Project involves merging customer reviews from Fudgemart and FudgeFlix to create a unified data warehouse using Kimball's approach. Utilizing Power BI, it aims to extract actionable insights for Fudge Inc., guiding strategic decisions, product enhancements, and market expansion based on comprehensive business intelligence.

    Language:TSQL1100
  • farrmt/IDM

    Farr, M. T., D. S. Green, K. E. Holekamp, and E. F. Zipkin. 2020. Integrating distance sampling and presence-only data to estimate species abundance. Ecology 00(00):e03204. 10.1002/ecy.3204

    Language:R1203
  • ianthropos88/Enterprise_Data_Architecture

    The pragmatic technology journey for an Enterprise Data Model serving reporting, analytical, advanced data science and other digital use cases with integrated data from a variety of sources.

  • Uniquenetra/ml-based-ontology-matching

    A project to enhance ontology matching accuracy using Large Language Models (LLMs) like S-BERT.

    Language:Jupyter Notebook1100
  • adeelnasir0405/Data-Integration-with-Talend

    To integrate data from "Orderline.csv" and "Product.csv" using Talend, filtering based on price, and performing inner and left joins to extract insights and facilitate data warehousing integration with Microsoft SQL Server.

  • BHPepper/HormoneTherapy-DSS-BreastCancer

    Hormone Therapy Decision Support System for Breast Cancer

    Language:R0100
  • pkorat/Extract.Transform.Load-Life_Expectancy

    An implementation of the data integration process Extract, Transform, Load (ETL)

    Language:Jupyter Notebook0000
  • thedummyprogrammer/TDP.Robot

    A software to automate tasks, monitor and data integration in Windows systems, using a graphical interface.

    Language:C#0100
  • ZG3Z/BTS-Weather-Clustering

    Language:Jupyter Notebook0100
  • CJ-Nieto/Hack-para-la-gestion-del-conocimiento-entre-agencias-espanol

    Proyecto para el Hackathon Innovation Challenge Microsoft, utilizando datos públicos para mejorar la gestión del conocimiento en salud global. Facilitamos la colaboración interinstitucional y decisiones basadas en evidencia entre agencias, empresas y organizaciones.

    Language:Python00
  • Kush-Trivedi/Logistic-Regression-K-NN-for-Heart-Attack

    Various predictor factors to try to generate a forecast about heart disease patients and Logistic regression and K-Nearest Neighbor to develop a model to predict whether the patients have heart disease or not for the analysis, Finally Some basic visualizations.

    Language:R10
  • tashi-2004/Global-Ecommerce-Retail-Trends-Analysis

    The Global E-commerce & Retail Analysis project involves data preprocessing, dimensionality reduction with PCA 📉, CLV calculation and What-If analysis . Key insights include effective PCA for data reduction, detailed CLV analysis across segments , and the impact of pricing strategies on sales.

    Language:Jupyter Notebook
  • zsomborjoel/python-data

    Data integration and other data related programs

    Language:Python10