Pinned Repositories
aal-common-utils
一个公共的工具类项目
acid-file-formats
Code for Apache Hudi, Apache Iceberg and Delta Lake analysis
AdBlockerWebview
An ordinary webview that can block basic ads.
adblockplusandroid
Adblock Plus app for Android
AdMonitor
以对业务实现层最少侵入为原则,在SDK层实现对Android原生广告View的曝光监听上报、点击监听上报,做到业务层只需调用统一注册方法告知SDK层该View为广告控件,剩余功能逻辑由SDK层内部完成。
CN_POI_Data
抓取中国地区的所有POI数据,基于http://www.poi86.com/
DataX
DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
kafkaAndspark-streaming
Apache Log file analysis
spark_sql-learning
学习 spark sql 的一些练习
SpringCloud-Learning
Spring Cloud教程
lyBigdata's Repositories
lyBigdata/Chat2DB
智能的通用数据库工具和SQL客户端(General-purpose database tools and SQL clients with AI (ChatGPT))
lyBigdata/chitu-sdp-1
赤兔实时计算平台是基于 Apache Flink 构建的企业级、一站式、高性能、低门槛大数据实时计算平台,广泛适用于流式数据应用开发场景。
lyBigdata/chitu-sdp-website
lyBigdata/crabc
Crabc是低代码开发平台,企业级API接口发布系统,采用SpringBoot、JWT、Mybatis等框架和SPI插件机制实现
lyBigdata/cube-studio
cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,数据资产对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式算法训练,超参搜索,推理服务VGPU,多集群调度,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型一键微调,llmops,私有知识库,AI应用商店,支持模型一键开发/推理/微调,私有化部署,支持国产cpu/gpu芯片,支持rdma,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
lyBigdata/DataLink
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
lyBigdata/dataService
dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unification of data services, facilitate the governance of data services, and unify the caliber of indicators. It can improve the development efficiency of business and face business changes faster
lyBigdata/datasophon
It is committed to rapidly implementing the deployment, management, monitoring and automatic operation and maintenance of the big data cloud native platform, helping you quickly build a stable, efficient, elastic and scalable big data cloud native platform.
lyBigdata/datatunnel
DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。
lyBigdata/flink-catalog-in-jdbc
lyBigdata/flink-deployer
flink部署器,支持flink on yarn/k8s,基于Flink自带ClusterDescriptor的不同实现进行通用封装
lyBigdata/flink-http-connector
Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.
lyBigdata/flink-jobserver
REST job server for Apache Flink
lyBigdata/flink-platform-backend
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
lyBigdata/flink-submitter-api
this project can help you to submit,query,kill flink task by java api
lyBigdata/flink-yun
Streaming data analysis platform based on Flink(至流云-打造流数据分析平台)
lyBigdata/FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
lyBigdata/icebergManager
基于hdfs、iceberg、spark、flink做的一个iceberg管理客户端
lyBigdata/jiron-cloud
该项目整合了多款优秀的开源产品,构建了一个功能全面的数据开发平台。平台提供了强大的数据集成、数据开发、数据查询、数据服务、数据质量管理、工作流调度和元数据管理功能。#dinky #dolphinscheduler #datavines #flinkcdc #openmetadata #flink #数据开发 #数据平台 # 数据开发平台 #大数据
lyBigdata/jt808-server
JT808、JT808协议解析;支持TCP、UDP,实时兼容2011、2013、2019版本协议,支持分包。支持JT/T1078音视频协议,T/JSATL12苏标主动安全协议,T/GDRTA002粤标主动安全协议,支持Android客户端编解码。
lyBigdata/jts
The JTS Topology Suite is a Java library for creating and manipulating vector geometry.
lyBigdata/lightning-catalog
3rd party metastore opensourced by Zetaris for the preparing data in ad-hoc analytics, data pipeline and ML project
lyBigdata/ollama
Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.
lyBigdata/plugin-submitter-api
this project can help you to submit,query,kill flink or spark task by java
lyBigdata/ruoyi-tdesign
基于RuoYi-Vue-Plus的重构版本。UI后台管理系统使用TDesign;定期同步RuoYi-Vue-Plus功能.
lyBigdata/sedona
A cluster computing framework for processing large-scale geospatial data
lyBigdata/spark-clickhouse-connector
Spark ClickHouse Connector build on DataSourceV2 API
lyBigdata/spark-jobserver
REST job server for Apache Spark
lyBigdata/spark-yun
Big data computing platform based on Spark(至轻云-打造大数据计算平台)
lyBigdata/sqltool
一个提供动态结构化查询语言(DSQL)解析和执行的通用ORM框架(连接池在分布式环境适用),支持包括MySQL、PostgreSQL、Oracle、SQLServer在内的多种数据库。该仓库为镜像仓库,参与贡献请前往 https://gitee.com/tenmg/sqltool