scottsunsh's Stars
openl-tablets/openl-tablets
OpenL Tablets Business Rules Management System
writer/writer-framework
No-code in the front, Python in the back. An open-source framework for creating data apps.
quickwit-oss/quickwit
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
py-why/EconML
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
ccsb-scripps/AutoDock-Vina
AutoDock Vina
Unleash/unleash
Open-source feature management solution built for developers.
jupyterhub/binderhub
Run your code in the cloud, with technology so advanced, it feels like magic!
elyra-ai/elyra
Elyra extends JupyterLab with an AI centric approach.
openfaas/faas
OpenFaaS - Serverless Functions Made Simple
dremio/dremio-oss
Dremio - the missing link in modern data
MaterializeInc/materialize
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
qubole/streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
qubole/rubix
Cache File System optimized for columnar formats and object stores
qubole/sparklens
Qubole Sparklens tool for performance tuning Apache Spark
treeverse/lakeFS
lakeFS - Data version control for your data lake | Git for data
spectacles-ci/spectacles
A continuous integration tool for Looker and LookML.
PostHog/posthog
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
terminusdb/terminusdb
TerminusDB is a distributed database with a collaboration model
hoppscotch/hoppscotch
Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)
great-expectations/great_expectations
Always know what to expect from your data.
allure-framework/allure2
Allure Report is a flexible, lightweight multi-language test reporting tool. It provides clear graphical reports and allows everyone involved in the development process to extract the maximum of information from the everyday testing process
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
finos/perspective
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
ncbi-nlp/BioSentVec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
aimclub/FEDOT
Automated modeling and machine learning framework FEDOT
rudderlabs/rudder-server
Privacy and Security focused Segment-alternative, in Golang and React
neo4j-contrib/neo4j-etl
Data import from relational databases to Neo4j.
CamDavidsonPilon/lifetimes
Lifetime value in Python