Pinned Repositories
hadoop-cdh-pseudo-docker
ai-coach.js
Javascript library for browser side AI solutions like AskTheDocs, etc.
AskForKPI
commodore64_LLM_RAG
my-dockerfiles
Dockerfiles for Kafka and Flink images
NextAction
A more GTD-like workflow for Todoist. Uses the REST API to add and remove a @next_action label from tasks.
odi-groovy-sdk-prj
tpch_csv-2-json_awk
TPCH 2 json conversion with awk
gszecsenyi's Repositories
gszecsenyi/commodore64_LLM_RAG
gszecsenyi/ai-coach.js
Javascript library for browser side AI solutions like AskTheDocs, etc.
gszecsenyi/AskForKPI
gszecsenyi/my-dockerfiles
Dockerfiles for Kafka and Flink images
gszecsenyi/NextAction
A more GTD-like workflow for Todoist. Uses the REST API to add and remove a @next_action label from tasks.
gszecsenyi/anomaly_detection_with_autoencoder
anomaly detection sample codes
gszecsenyi/articles-tutorials
Streaming Twitter to Kafka with Apache Nifi
gszecsenyi/Azure-Databricks
Azure Databricks - Advent of 2020 Blogposts
gszecsenyi/kudu-docker
Docker Image for Kudu
gszecsenyi/odi-groovy-sdk-prj
gszecsenyi/bigdata2go.github.io
Public repository for bigdata2go project
gszecsenyi/cca-175_preparation
Problems and solutions to CCA-175 exam
gszecsenyi/cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
gszecsenyi/CodecMapper
Build mapping files derived from Java Charsets which can be processed by Python's gencodec.py.
gszecsenyi/coding_challenge
gszecsenyi/docker-spark-cluster
A simple spark standalone cluster for your testing environment purposses
gszecsenyi/dockerfiles
Dockerfiles the Trivadis BDS manages
gszecsenyi/embeddings.js
Simple text embeddings library for Node.js (OpenAI, Mistral, Local)
gszecsenyi/hdinsight-kafka-java-get-started
Basic example of using Java to create a producer and consumer that work with Kafka on HDInsight. Also a demonstration of the streaming api.
gszecsenyi/llm_confluence_integration
LLM - Confluence integration
gszecsenyi/multi-agent-concierge
gszecsenyi/Ollama-in-GitHub-Codespaces
Learn all how to run Ollama in GitHub Codespaces for free
gszecsenyi/oracle-bigdatalite-scripts
Scripts for Oracle BigDataLite VM tests
gszecsenyi/pyspark-cheatsheet
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
gszecsenyi/python-cli-template
A template for a Python CLI package that can be published to PyPI.
gszecsenyi/RDT
A library of Reversible Data Transforms
gszecsenyi/spark-ignite
Shared RDD Ignite environment and example applicaton with Spark
gszecsenyi/spark-kubernetes
spark on kubernetes
gszecsenyi/spark-redis
A connector for Spark that allows reading and writing to/from Redis cluster
gszecsenyi/vectordb.js
Simple in-memory vector database for text similarity in Node.js