Pinned Repositories
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
arrow-datafusion-python
Apache Arrow DataFusion Python Bindings
arrow-rs
Official Rust implementation of Apache Arrow
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
aws-custom-credential-provider
A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role
aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
deltatorch
nkarpov.github.io
nkarpov's Repositories
nkarpov/deltatorch
nkarpov/nkarpov.github.io
nkarpov/arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
nkarpov/arrow-datafusion-python
Apache Arrow DataFusion Python Bindings
nkarpov/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
nkarpov/aws-custom-credential-provider
A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role
nkarpov/cria
Tiny inference-only implementation of LLaMA
nkarpov/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
nkarpov/delta-examples
Delta Lake examples
nkarpov/delta-rs
A native Rust library for Delta Lake, with bindings into Python
nkarpov/spark
Apache Spark - A unified analytics engine for large-scale data processing
nkarpov/Dataset
News: the 4k dataset is ready for download.
nkarpov/delta-go
nkarpov/dspy
Stanford DSPy: The framework for programming with foundation models
nkarpov/Flowise
Drag & drop UI to build your customized LLM flow using LangchainJS
nkarpov/ipython-gpt
nkarpov/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
nkarpov/lst-bench
LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
nkarpov/moondream
tiny vision language model
nkarpov/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
nkarpov/openplayground
An LLM playground you can run on your laptop
nkarpov/privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
nkarpov/pyicloud
A Python + iCloud wrapper to access iPhone and Calendar data.
nkarpov/rawdog
Generate and auto-execute Python scripts in the cli
nkarpov/super-json-mode
Low latency JSON generation using LLMs ⚡️
nkarpov/tidb
TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial
nkarpov/tigerbeetle
The distributed financial transactions database designed for mission critical safety and performance.
nkarpov/unitycatalog
Open, Multi-modal Catalog for Data & AI
nkarpov/webgpu-torch
Tensor computation with WebGPU acceleration
nkarpov/WebODM
User-friendly, commercial-grade software for processing aerial imagery. 🛩