/dataCompare

Database comparison platform: Hive table data comparison, MySQL data comparison, automatic configuration for data comparison, avoid frequent write SQL processing

Primary LanguageJavaApache License 2.0Apache-2.0

dataCompare

EN doc CN doc

Introduction

dataCompare is a database comparison platform: support Hive table data comparison, MySQL、Doris data comparison, realize automatic configuration for data comparison, avoid frequent SQL writing for processing, and consider supporting CK and so on

image

Features

(1)Low-code configuration can realize the comparison of data tables without cumbersome sql development

(2)The following functions are currently supported: magnitude comparison, consistency comparison, difference case discovery, MySQL、Hive and Doris haved already supported

(3)Subsequent plan support: unfamiliar table pointer detection, including: enumeration value detection, range value detection, primary key id hash detection

Software Architecture

image

Technology stack:

End:Spring boot + Mybatis

DataBase:MySQL

Parsing Engine:Antrl

Big Data:Hive、Spark

System flowchart

输入图片说明 输入图片说明 输入图片说明 输入图片说明

img_1.png

img.png

Demonstration of system functionality

Home image

DbConfig

mysql config

image

hive config

image

job config image

Comparison results are displayed image

image

3fd83de9c582347f7f88cc82f438db4

The system running environment

java jdk8

mysql 5.7.36

Runing config

(1)Run the SQL files in the SQL directory in the database, create database and tables

(2)download the release jar(https://github.com/zhugezifang/dataCompare/releases) or build jar using the source code of the project

(3)edit database config information of application.yml

(4)run java -jar -Dspring.config.location=application.yml dataCompare.jar (application.yml and jar must in the same directory)

(5)visit http://127.0.0.1/ (UserName:admin PassWord:admin123)

Environment installation configuration

(1)If you want to implement Hive data comparison configuration, you need to install the Hive environment first(The installation documentation refers to the docker quick installation of the Hive environment:https://blog.csdn.net/ifenggege/article/details/107860477)

(2)After installation, when creating a new data source connection, select Hive at the address jdbc:hive2://ip:10000

Technological Communication

image

Thanks

Thanks ruoyi Provides front-end services