Pinned Repositories
databricks-hail-installation
databricks-nlp-audio-azure
db-chromium
Public repository showing how it's possible to use Chromium with Databricks
db-logging-example
A small example setting Python's logging configuration using a module invoked from a notebook.
eclipse-icon-enlarger
Scales Eclipse icons (PNG and GIF) to double their size for QHD laptops.
example-nifi-csv-cleaner
Example custom processor to clean a large CSV with unconventional seperators
public-sector-bootcamp
A set of introductory exercises to walk through some of Databricks capabilities
spark-itcase
Set of tools to run Spark programs remotely during development builds
vectara-skunk-client
My attempt at a Vectara SDK for Python
vectara-speech-helper
This project is a demonstration of the power combining Retrieval Augmented Generation (RAG) with custom prompts.
davidglevy's Repositories
davidglevy/db-logging-example
A small example setting Python's logging configuration using a module invoked from a notebook.
davidglevy/databricks-hail-installation
davidglevy/vectara-skunk-client
My attempt at a Vectara SDK for Python
davidglevy/public-sector-bootcamp
A set of introductory exercises to walk through some of Databricks capabilities
davidglevy/vectara-speech-helper
This project is a demonstration of the power combining Retrieval Augmented Generation (RAG) with custom prompts.
davidglevy/databricks-nlp-audio-azure
davidglevy/db-chromium
Public repository showing how it's possible to use Chromium with Databricks
davidglevy/db-file-ingestion
A set of tests for Databricks file ingestion
davidglevy/db-ide-fun
Run various operations from an IDE ... for fun
davidglevy/db-oauth-vs-uc
Compare OAuth data access vs Unity Catalog in Databricks
davidglevy/db-retention-demo
Shows various options for applying data retention policies within Databricks.
davidglevy/db-skip-change-commits
Walk through of the changes in DBR 13.0 with skipChangeCommits (replacing ignoreChanges)
davidglevy/db-value-at-risk
Refresh on the Value at Risk accelerator
davidglevy/duckdb-vs-databricks
Comparing performance of DuckDB vs Databricks with TPC-H
davidglevy/express-hello-world
Express Hello World Example on Render https://render.com
davidglevy/facial-expression-classification
Databricks notebooks for the Facial Expression Classification course
davidglevy/load-abs-census-data
Loads the ABS census data into Delta tables for use
davidglevy/mlp-classification-template
A likely crappy adaption of the mlp-regression-template built on the idea that to motivate someone to do a good job, you should do a bad job first for critique (straw man).
davidglevy/nlp-workspace
A terraform script to spin up a Databricks workspace which has NLP relevant content.
davidglevy/render-react-hosting-1
create-react-app deployed on Render
davidglevy/river-modelling-demo
River modelling demo using public sources of data
davidglevy/solr-and-nifi-test-example
Set of basic Solr and Nifi tests to demonstrate regression testing capabilities.
davidglevy/spark-ide-example
This will be an example IDE based project to develop code using dbx
davidglevy/spark-threading-fun
A small repository for doing various experiments with threading to solve "Embarrassingly Parallel" (IO Bound) concurrency.
davidglevy/traveller-demo
Example of a UC workspace simulating travellers with required documents.
davidglevy/variant-spark-examples
Example notebooks for terraform
davidglevy/VariantSpark
machine learning for genomic variants
davidglevy/vectara-ingest
An open source framework to crawl data sources and ingest into Vectara
davidglevy/vectara-skunk-examples
A set of examples showing the client being used to demonstrate Vectara features
davidglevy/web-crawler
Lightweight indexer/crawler to get web content into Vectara