Pinned Repositories
Spark
Spark Club
AirDataX
AirDataX-3.0
Chinese
Tools and resources for Chinese texts preprocessing. Validated in two papers, one CCF C, EI indexing and one CCF B, SCI indexing.
clickhouse-
confluent
DataX
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
esun-flumeng-kafka-plugin
newly plugin for flume2kafka support offset control
flumeng-kafka-plugin
flumeng-kafka-plugin
kafka-connect-storage-common
Shared software among connectors that target distributed filesystems and cloud storage.
500com's Repositories
500com/clickhouse-
500com/Chinese
Tools and resources for Chinese texts preprocessing. Validated in two papers, one CCF C, EI indexing and one CCF B, SCI indexing.
500com/AirDataX-3.0
500com/kafka-connect-storage-common
Shared software among connectors that target distributed filesystems and cloud storage.
500com/confluent
500com/esun-flumeng-kafka-plugin
newly plugin for flume2kafka support offset control
500com/flumeng-kafka-plugin
flumeng-kafka-plugin
500com/sparklint
A tool for monitoring and tuning Spark jobs for efficiency.
500com/AirDataX
500com/sparkoscope
Enabling Spark Optimization through Cross-stack Monitoring and Visualization
500com/Spark
Spark Club
500com/DataX
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
500com/kafka-exactly-once