dennyglee
data dork, scribe, geek, ultimate frisbee fan, mountain climber (barely!), wanna be cyclist... occasionally awake
@databricksSeattle, WA
Pinned Repositories
azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
Caprica
Web Social SDK based on Scala, Node.js, Hadoop, Hive, and Analysis Services. The kit is based off of the Hadoop Summit presentation "How Klout changed the landscape of social media with Hadoop and BI".
ChateauKebob
The purpose of this project is to expand on the “Reaching Compliance: SQL Server 2008 Compliance Guide” to more easily handle larger volumes of structured and unstructured data. The end goal is to gain richer and deeper insight using the latest analytics. To achieve this, we are building a Big Data-to-BI project involving HDInsight (Hadoop on Windows or Azure), SQL Server 2012, SQL Server Analysis Service 2012 Tabular, Integration Services, PowerPivot, and Power View.
databricks
Repository of sample Databricks notebooks
dicom-to-png
A simple python module to make it easy to batch convert DICOM files to PNG images.
fine-tuning
LLM fine-tuning experiments, practice, examples
dennyglee's Repositories
dennyglee/databricks
Repository of sample Databricks notebooks
dennyglee/fine-tuning
LLM fine-tuning experiments, practice, examples
dennyglee/pyspark-ai
English SDK for Apache Spark
dennyglee/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
dennyglee/delta-rs
A native Rust library for Delta Lake, with bindings into Python
dennyglee/dennyglee.github.io
about me! data dork, scribe, geek, ultimate frisbee fan, mountain climber (barely!), wanna be cyclist... occasionally awake
dennyglee/MochiDiffusion
Run Stable Diffusion on Mac natively
dennyglee/unitycatalog
Open, Multi-modal Catalog for Data & AI
dennyglee/viz-word-emb
Visualize your word embeddings
dennyglee/wee-slack
A WeeChat script for Slack.com. Supports threads and reactions, synchronizes read markers, provides typing notification, etc..
dennyglee/.github
dennyglee/arrow-rs
Official Rust implementation of Apache Arrow
dennyglee/delta-docker
Official Dockerfile for Delta Lake
dennyglee/delta-docs
Delta Lake Documentation
dennyglee/delta-dotnet
DeltaLake bindings for dotnet based on delta-rs
dennyglee/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
dennyglee/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
dennyglee/LibreChat
Enhanced ChatGPT Clone: Features OpenAI, Bing, Anthropic, OpenRouter, PaLM 2, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
dennyglee/llama.cpp
Port of Facebook's LLaMA model in C/C++
dennyglee/llama_index
LlamaIndex is a data framework for your LLM applications
dennyglee/llamafile
Distribute and run LLMs with a single file.
dennyglee/llm-foundry
LLM training code for MosaicML foundation models
dennyglee/megablocks
dennyglee/mlflow
Open source platform for the machine learning lifecycle
dennyglee/mlx-examples
Examples in the MLX framework
dennyglee/sniffer
csv and flat-file sniffer built in Rust.
dennyglee/unitycatalog-ui
Unity Catalog UI
dennyglee/website-1
Delta Lake Website
dennyglee/website-pysparkai
pyspark-ai website
dennyglee/yet-another-delta-catalog-ts
An administration UI for Delta Sharing implemented using Next.js and TypeScript.