maver1ck's Stars
Moosync/Moosync
Music player capable of playing local audio or from Youtube, Spotify and many more
srbhr/Resume-Matcher
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
PostHog/posthog
🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
EmilStenstrom/django-components
Create simple reusable template components in Django.
better/jsonschema2db
Generate tables dynamically from a JSON Schema and insert data
microsoft/Analysis-Services
Git repo for Analysis Services samples and community projects
apache/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Netflix/mantis
A platform that makes it easy for developers to build realtime, cost-effective, operations-focused applications
EqualExperts/dbt-unit-testing
This dbt package contains macros to support unit testing that can be (re)used across dbt projects.
gimral/flink-kafka-catalog
podhmo/alchemyjsonschema
convert sqlalchemy model to jsonschema.
fastapi/sqlmodel
SQL databases in Python, designed for simplicity, compatibility, and robustness.
datafold/data-diff
Compare tables within or across databases
re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
TobikoData/sqlmesh
Efficient data transformation and modeling framework that is backwards compatible with dbt.
kestra-io/kestra
Orchestration and automation platform to execute millions of scheduled and event-driven workflows declaratively in code and from the UI
metriql/metriql
The metrics layer for your data. Join us at https://metriql.com/slack
n8n-io/n8n
Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
LeapBeyond/scrubadub
Clean personally identifiable information from dirty dirty text.
ubisoft/mobydq
:whale: Tool to automate data quality checks on data pipelines
Swiple/swiple
Swiple enables you to easily observe, understand, validate and improve the quality of your data
fal-ai/dbt-fal
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
speakleash/speakleash
ScalefreeCOM/datavault4dbt
Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including the Staging Area, DV2.0 main entities, PITs and Snapshot Tables.
apache/flink-kubernetes-operator
Apache Flink Kubernetes Operator
langchain-ai/streamlit-agent
Reference implementations of several LangChain agents as Streamlit apps
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.