tspannhw
PrincipalDev Advocate, Vector Database, Milvus, Apache NiFi, Data Engineer, IoT, AI, ML, Python Developer, RP, FLANK, Apache Kafka, Java
Principal Developer AdvocatePrinceton, New Jersey, USA
Pinned Repositories
EverythingApacheNiFi
EverythingApacheNiFi
FLiP-Pi-DeltaLake-Thermal
Apache Pulsar -> Sink -> DeltaLake
FLiPStackWeekly
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
meetups
Meetup Materials
nifi-attributecleaner-processor
Clean up attribute names for Apache NiFi to send to Apache Avro
nifi-convertjsontoddl-processor
Apache NiFi 1.5/1.6/1.9.2+ Processor to produce DDL
nifi-extracttext-processor
Apache NiFi Custom Processor Extracting Text From Files with Apache Tika
nifi-nlp-processor
Apache NiFi NLP Processor
phoenix
Apache Phoenix / Hbase Spring Boot Microservices
SpeakerProfile
My speaker profile for events and conferences based on codepo8/presenter-terms
tspannhw's Repositories
tspannhw/GetWebCamera
Apache NiFi 1.23 Custom Processor for WebCams
tspannhw/FLaNK-EveryTransitSystem
Every transit system
tspannhw/FLaNK-December2023
FLaNK December 2023
tspannhw/CFM-Monitoring
tspannhw/FLaNK-EdgeAI
FLaNK-EdgeAI
tspannhw/FLaNK-Ice
Apache Iceberg - Cloud Data Lakehouse
tspannhw/FLaNK-Py-Stocks
FLaNK Python Stocks to Apache Kafka - Cloudera
tspannhw/FLaNK-RPI5
FLaNK-RPI5, Raspberry Pi 5
tspannhw/FLaNK-VectorDB
NiFi and Vector Databases
tspannhw/randomdanceparty
tspannhw/1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
tspannhw/advent-of-code-flink-paimon
tspannhw/CML_AMP_Solr_9
Deploy Solr 9 as a CML Application
tspannhw/DEMO-multimodal-search
tspannhw/Electric_and_Utilities_System_Demo
Using CDF, CDW, CML and Data Viz, this demo is a complete Electric and Utilities Company use case to broadly leverage the CDP Data Services platform
tspannhw/Ender3V2S1
This is optimized firmware for Ender3 V2/S1 3D printers.
tspannhw/fashion_vdb
fashion stuff
tspannhw/FLaNK-CDW
CDW, NiFi, DataFlow, Impala
tspannhw/FLaNK-ContinuousSQL
tspannhw/FLaNK-DailyMeds
Time to ingest your daily meds
tspannhw/flink-iceberg-minio-trino
This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics and machine learning workloads.
tspannhw/flink-iceberg-playground
minio as local storage and DynamoDB as catalog
tspannhw/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
tspannhw/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
tspannhw/MAmmoTH
This repo contains the code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning"
tspannhw/psc
PubSubClient (PSC)
tspannhw/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
tspannhw/shoe-store
Shoe Store Loyalty Engine - Flink SQL Workshop
tspannhw/voyager
🛰️ Voyager is an approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
tspannhw/watsonxdata-python-sdk
This is used for wastonx.data Python SDK