spancer
Big data practitioner, data architect of the smart factory. Expert in big data architecture, search engine, big data analysis, agile development.
changsha
Pinned Repositories
bigdata-docker-builds
Docker images for building hadoop3.2, hive 3.1, hbase2.3, presto 0.247, flink1.11.3 on yarn, etc.
bigdata-docker-compose
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
CS-Notes
:books: 技术面试必备基础知识、Leetcode 题解、Java、C++、Python、后端面试、操作系统、计算机网络、系统设计
elasticlake
open source data lake build on top of apache iceberg
elasticsearch-ansj-analysis-plugin
ansj analysis elasticsearch plugin
FiboRulex
FiboRulex - 实时AI智能决策引擎、规则引擎、风控引擎、数据流引擎。 通过可视化界面进行规则配置,无需繁琐开发,节约人力,提升效率,实时监控,减少错误率,随时调整; 支持规则集、评分卡、决策树,名单库管理、机器学习模型、三方数据接入、定制化开发等;
flink-es-demo
基于ES快速实现车辆碰撞分析、套牌车分析、尾随分析。
flink-iceberg-demo
flink iceberg integration tests, jobs running on yarn.
prestodb-hbase-connector
prestodb hbase connector, using zookeepr to hold the metadata.
zeus
Zeus is an open-source, analytical engine for big data hold in data lake; it was designed to provide OLAP (Online Analytical Processing) capability in the big data era. You can use Zeus to store, query, analysis, and manage data.
spancer's Repositories
spancer/bigdata-docker-compose
Deploy bigdata platform using docker compose. Big data components include hadoop, hive, hbase, presto, flink, es, kafka, etc.
spancer/awesome-quant
**的Quant相关资源索引
spancer/CF-Workers-docker.io
这个项目是一个基于 Cloudflare Workers 的 Docker 镜像代理工具。它能够中转对 Docker 官方镜像仓库的请求,解决一些访问限制和加速访问的问题。
spancer/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
spancer/ChatGPT-Midjourney
🍭 一键拥有你自己的 ChatGPT+Midjourney 网页服务 | Own your own ChatGPT+Midjourney web service with one click
spancer/ChatGPT-Next-Web
一键拥有你自己的 ChatGPT 网页服务。 One-Click to deploy your own ChatGPT web UI.
spancer/chatgpt-on-wechat
Wechat robot based on ChatGPT, which using OpenAI api and itchat library. 使用ChatGPT搭建微信聊天机器人,基于 GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/LinkAI,支持个人微信、公众号、企业微信部署,能处理文本、语音和图片,访问操作系统和互联网,支持基于知识库定制专属机器人。
spancer/dagster-playground
Data Engineering Stack with Dagster, Trino, iceberg and jupyter.
spancer/Data-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
spancer/data-stack
modern data stack
spancer/ezdata
基于python开发的数据处理和任务调度系统。 支持数据源管理,数据模型管理,数据集成,数据查询API接口封装,低代码自定义数据处理任务模版,单任务及dag任务工作流调度等功能。集成了数据大屏系统实现数据可视化。集成了chatgpt等llm模块实现了数据对话问答,交互式数据分析功能。
spancer/free-for-dev
A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev
spancer/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
spancer/llama
Inference code for LLaMA models
spancer/metahuman_overview
数字人资料整理
spancer/MLOps1
Master Thesis Project - Open Source MLOps: How to Unlock the Potential of Machine Learning
spancer/modern-data-stack-docker
modern data stack in docker compose
spancer/ollama
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
spancer/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
spancer/OpenGPT-4o
OpenGPT 4o is a fee alternative to OpenAI GPT 4o
spancer/pygwalker
PyGWalker: Turn your pandas dataframe into a Tableau-style User Interface for visual analysis
spancer/QGIS
QGIS is a free, open source, cross platform (lin/win/mac) geographical information system (GIS)
spancer/stock
stock股票系统.爬取stock股票关键数据,计算stock股票各种指标,识别stock股票K线形态,内置多种stock股票策略,支持stock股票验证回测及stock股票自动交易,是量化投资工具。captures key daily data of stocks, calculates various stock indicators, K-line pattern recognition, has a variety of built-in stock selection strategies, stock selection verification back test, Automated Trading. quantitative investment tool.
spancer/tools-Auto_Mac_Author
爬取b站热榜,人工智能写文案,自动生成复数麦克阿瑟视频 next_step: 字幕自动换行+输出文案副本
spancer/tools-gen-txt-to-image
一款文生视频应用,用于小说推文,生成漫画等视频。使用主流大模型,结合Stable Diffusion,实现文生图,图生视频本地化私有部署。
spancer/tools-video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
spancer/win11-
CloudMoe Windows 10/11 Activation Toolkit get digital license, the best open source Win 10/11 activator in GitHub. GitHub 上最棒的开源 Win10/Win11 数字权利(数字许可证)激活工具!
spancer/WindTerm
A professional cross-platform SSH/Sftp/Shell/Telnet/Serial terminal.
spancer/Youtube-ETL-Pipeline
💜🌈📊 A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker 🌺
spancer/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)