dewenkong's Stars
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
xingshaocheng/architect-awesome
后端架构师技术图谱
fengdu78/Coursera-ML-AndrewNg-Notes
吴恩达老师的机器学习课程个人笔记
MLEveryday/100-Days-Of-ML-Code
100-Days-Of-ML-Code中文版
Vonng/ddia
《Designing Data-Intensive Application》DDIA中文翻译
matomo-org/matomo
Empowering People Ethically with the leading open source alternative to Google Analytics that gives you full control over your data. Matomo lets you easily collect data from websites & apps and visualise this data and extract insights. Privacy is built-in. Liberating Web Analytics. Star us on Github? +1. And we love Pull Requests!
fengdu78/deeplearning_ai_books
deeplearning.ai(吴恩达老师的深度学习课程笔记及资源)
afshinea/stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
pyecharts/pyecharts
🎨 Python Echarts Plotting Library
apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
karpathy/char-rnn
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
apachecn/Interview
Interview = 简历指南 + 算法题 + 八股文 + 源码分析
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
apache/iceberg
Apache Iceberg
sql-machine-learning/sqlflow
Brings SQL and AI together.
glados-network/GLaDOS
status of GLaDOS
salesforce/TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
DTStack/flinkStreamSQL
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
byzer-org/byzer-lang
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
intuit/wasabi
Wasabi A/B Testing service is an open source project that is no longer under active development or being supported
piskvorky/bounter
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
shunfei/indexr
An open-source columnar data format designed for fast & realtime analytic with big data.
douban/paracel
Distributed training framework with parameter server
shunfei/sfmind
dantezhao/data-group
hit-computer/char-rnn
Recurrent Neural Networks(GRU) for character-level language models on Chinese, in Python/Theano
fengdu78/DataSciComp
A collection of popular Data Science Competitions
rocky1001/pykylin
Python DBAPI Driver and Sqlalchemy Dialect for Apache Kylin, the "Extreme OLAP Engine for Big Data"
raymondchen-byte/Netflix-Recommender-with-Spark
基于Apache Spark的Netflix电影的离线与实时推荐系统