jhintonrsos's Stars
just-modeling/cdc-debezium-engine-release
This is a repo to publish releases for project cdc-debezium-engine. Original repository is https://github.com/just-modeling/cdc-debezium-engine
JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
huggingface/smolagents
🤗 smolagents: a barebones library for agents that think in python code.
cyclops-ui/cyclops
Developer Friendly Kubernetes 👁️
aspect-build/bazel-examples
Bazel examples
abiosoft/colima
Container runtimes on macOS (and Linux) with minimal setup
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
MrPowers/mack
Delta Lake helper methods in PySpark
soniaai/rules_poetry
Bazel rules that use Poetry for Python package management
databrickslabs/geoscan
Geospatial clustering at massive scale
databrickslabs/lsql
Lightweight SQL execution wrapper only on top of Databricks SDK
databricks/cli
Databricks CLI
justinbreese/databricks-gems
Some random how-to examples relating to Databricks.
databrickslabs/pytester
Python Testing for Databricks
databrickslabs/pylint-plugin
Databricks Plugin for PyLint
palantir/pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
Cocoon-Data-Transformation/cocoon
Data management with LLMs
ben-nour/SQL-tips-and-tricks
SQL tips and tricks
pyspark-ai/pyspark-ai
English SDK for Apache Spark
andre-salvati/databricks-template
Project Template for Spark/Databricks with Python packaging and CI/CD automation
nicolattuso/DLT_Template
A template repository for Delta Live Tables projects
databrickslabs/blueprint
Baseline for Databricks Labs projects written in Python
souvik-databricks/dlt-with-debug
A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT run and Non-DLT interactive notebook run.
databrickslabs/dbx
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
chisasaw/redcache-ai
A memory framework for Large Language Models and Agents.
databrickslabs/mosaic
An extension to the Apache Spark framework that allows easy and fast processing of very large geospatial datasets.
databricks/notebook-best-practices
An example showing how to apply software engineering best practices to Databricks notebooks.
databricks-demos/dbconnect-examples
Build data apps with Databricks dbconnect
tobymao/sqlglot
Python SQL Parser and Transpiler