Pinned Repositories
Big-Data-Benchmark-for-Big-Bench
Big Bench Workload Development
flink-deployer
A tool that help automate deployment to an Apache Flink cluster
gradoop
Distributed Graph Analytics with Apache Flink
HDF-Workshop
heron
Heron is a realtime, distributed, fault-tolerant stream processing engine from Twitter
nifi-flow-registry
nifi-soap
oryx
Simple real-time large-scale machine learning infrastructure.
StreamingData-Book-Examples
tile38
Tile38 is a fast geolocation data store, spatial index, and realtime geofence. It supports a variety of object types including lat/lon points, bounding boxes, XYZ tiles, Geohashes, and GeoJSON. 🌐
apsaltis's Repositories
apsaltis/AI-PredictiveMaintenance
apsaltis/analytics-zoo
Distributed Tensorflow, Keras, PyTorch and BigDL on Apache Spark
apsaltis/aws-sdk-rust
AWS SDK for the Rust Programming Language
apsaltis/bincode
A binary encoder / decoder implementation in Rust.
apsaltis/cloud-dataflow-nyc-taxi-tycoon
This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab
apsaltis/colima
Container runtimes on macOS (and Linux) with minimal setup
apsaltis/Cymatic3D
Software for creating real-time Cymatics in three dimensions. For research, entertainment, and VJ-ing.
apsaltis/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
apsaltis/differential-datalog
An incremental programming language
apsaltis/flink-training-troubleshooting
apsaltis/flink-tutorials
apsaltis/flowbite-next-starter
apsaltis/fm_data_tasks
Foundation Models for Data Tasks
apsaltis/fraud-detection-demo
Repository for Advanced Flink Application Patterns series
apsaltis/genai-quickstart-pocs
This repository contains sample code demonstrating various use cases leveraging Amazon Bedrock and Generative AI. Each sample is a separate project with its own directory, and includes a basic Streamlit frontend to help users quickly set up a proof of concept.
apsaltis/incubator-horaedb
Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.
apsaltis/llama2
This chatbot app is built using the Llama 2 open source LLM from Meta.
apsaltis/lopdf
A Rust library for PDF document manipulation.
apsaltis/marsyas
Marsyas - Music Analysis, Retrieval and Synthesis for Audio Signals
apsaltis/materialize
The data warehouse for operational workloads.
apsaltis/nifi
Mirror of Apache NiFi
apsaltis/Nominatim
Open Source search based on OpenStreetMap data
apsaltis/nominatim-docker
100% working container for Nominatim
apsaltis/perspective
Streaming pivot visualization via WebAssembly
apsaltis/predictive-maintenance-using-machine-learning
Set up end-to-end demo architecture for predictive maintenance issues with Machine Learning using Amazon SageMaker
apsaltis/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
apsaltis/sparklens
Qubole Sparklens tool for performance tuning Apache Spark
apsaltis/ThirdAILabs-Demos
Notebooks for ThirdAI demos
apsaltis/utoipa
Simple, Fast, Code first and Compile time generated OpenAPI documentation for Rust
apsaltis/xorfilter
Go library implementing xor filters