joswlv's Stars
timeplus-io/proton
A stream processing engine and database, and a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse
ibis-project/ibis
the portable Python dataframe library
eriksencosta/money
Monetary calculations and allocations made easy
twosigma/flint
A Time Series Library for Apache Spark
PKUFlyingPig/Self-learning-Computer-Science
the resources I use to learn computer science in my spare time
redpanda-data/kminion
KMinion is a feature-rich Prometheus exporter for Apache Kafka written in Go. It is lightweight and highly configurable so that it will meet your requirements.
apache/polaris
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
apache/spark-kubernetes-operator
Apache Spark Kubernetes Operator
seglo/kafka-lag-exporter
Monitor Kafka Consumer Group Latency with Kafka Lag Exporter
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
jillesvangurp/kt-search
Multi platform kotlin client for Elasticsearch & Opensearch with easily extendable Kotlin DSLs for queries, mappings, bulk, and more.
G-Research/spark-extension
A library that provides useful extensions to Apache Spark and PySpark.
cube-js/cube
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
sjrusso8/spark-connect-rs
Apache Spark Connect Client for Rust
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
risingwavelabs/risingwave
Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
ryo-ma/github-profile-trophy
🏆 Add dynamically generated GitHub Stat Trophies on your readme
p0deje/Maccy
Lightweight clipboard manager for macOS
Aider-AI/aider
aider is AI pair programming in your terminal
metabase/metabase
The simplest, fastest way to get business intelligence and analytics to everyone in your company :yum:
plokhotnyuk/jsoniter-scala
Scala macros for compile-time generation of safe and ultra-fast JSON codecs + circe booster
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
unitycatalog/unitycatalog
Open, Multi-modal Catalog for Data & AI
mrpowers-io/quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
nebula-plugins/nebula-release-plugin
Release opinions based around gradle-git
backstage/backstage
Backstage is an open framework for building developer portals
jetbrains-infra/log4j-json-layout
Log4J Layout to Format Logs into Logstash Json Format
defog-ai/sqlcoder
SoTA LLM for converting natural language questions to SQL queries
Openpanel-dev/openpanel
All the goodies from both Mixpanel and Plausible combined into one tool.