Neuw84
Senior Partner Solutions Architect – Data & Analytics at Amazon Web Services (AWS)
Amazon Web ServicesSpain
Pinned Repositories
iceberg-streaming-examples
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.
bds2k17
Repository containing code for the Big Data Spain 2017 technical talk "Towards an Unified API for Spark and the IIoT" Edit
CValue-TermExtraction
A free implementation of the C-Value algorithm
datahack
Datahack spark live coding demo code
debezium-kafka-connect-docker-s3tables
flink-iceberg-streaming
Repo containing a complete end to end example of using Flink with Iceberg in Streaming fashion using Zeppelin as notebook engine
opensearch-workshop
Scripts and instructions for Amazon OpenSearch Migration Workshop (ES,SOLR,Splunk)
RAKE-Java
A Java implementation of the Rapid Automatic Keyword Extraction Framework ( RAKE )
structured-streaming-avro-demo
Spark 3.0.0 Structured Streaming Kafka Avro Demo
Wikipedia2WordNet
Library for mapping from WIkipedia Articles to WordNet Synsets in Java 8
Neuw84's Repositories
Neuw84/datahack
Datahack spark live coding demo code
Neuw84/LiTe
A Language Independent Term Extractor/Linker/Desambiguator
Neuw84/wikipediaminer
An open source toolkit for mining Wikipedia
Neuw84/awesome-java
A curated list of awesome Java frameworks, libraries and software.
Neuw84/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
Neuw84/cheatsheets-ai
Essential Cheat Sheets for deep learning and machine learning researchers
Neuw84/dcos-commons
Neuw84/deep-learning-links
Collection of Deep learning related papers / links / tutorials...
Neuw84/docker-emqtt
Docker container for eMQTT Broker by NerdNobs
Neuw84/docker-kafka
Kafka (and Zookeeper) in Docker
Neuw84/docker-mesos
Mesos, Marathon and Chronos using Docker Compose
Neuw84/durian
Guava's spikier (unofficial) cousin
Neuw84/EnterpriseNIFI
Enterprise NIFI Talks and Code
Neuw84/euskal2k17
Repo for storing Euskal Party 2017 Demo code
Neuw84/examples
DC/OS examples
Neuw84/FiloDB
Distributed. Columnar. Versioned. Streaming. SQL.
Neuw84/jwt-spring-security-demo
A small demo for using JWT (Json Web Token) with Spring Security and Spring Boot
Neuw84/kafka-examples
Snippets and small examples demonstrating kafka features and configs
Neuw84/kite
Kite SDK
Neuw84/language-detection
This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)
Neuw84/og-aws
📙 Amazon Web Services — a practical guide
Neuw84/oreilly-captions
Neuw84/sarama
Sarama is a Go library for Apache Kafka 0.8
Neuw84/SpanishInflectorStemmer
Spanish Light Stemmer that remove plurals using "Real Academia de la Lengua" recommendations
Neuw84/spark-structured-streaming
Spark structured streaming with Kafka data source and writing to Cassandra
Neuw84/Spark-Structured-Streaming-Examples
Spark Streaming / Kafka / Cassandra Example
Neuw84/spark-timeseries
A library for time series analysis on Apache Spark
Neuw84/Wiki
Neuw84/Wikipedia353Spanish
The WikipediaSimilarity 353 Test Collection is a dataset for measuring semantic relatedness between articles in Wikipedia. It is an adaption of an earlier dataset (the WordSimilarity 353 Test Collection) for measuring semantic relatedness between words.
Neuw84/zdd-lab
DC/OS Zero Downtime Deployments Lab