Neuw84
Senior Partner Solutions Architect – Data & Analytics at Amazon Web Services (AWS)
Amazon Web ServicesSpain
Pinned Repositories
iceberg-streaming-examples
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.
bds2k17
Repository containing code for the Big Data Spain 2017 technical talk "Towards an Unified API for Spark and the IIoT" Edit
CValue-TermExtraction
A free implementation of the C-Value algorithm
datahack
Datahack spark live coding demo code
debezium-kafka-connect-docker-s3tables
flink-iceberg-streaming
Repo containing a complete end to end example of using Flink with Iceberg in Streaming fashion using Zeppelin as notebook engine
opensearch-workshop
Scripts and instructions for Amazon OpenSearch Migration Workshop (ES,SOLR,Splunk)
RAKE-Java
A Java implementation of the Rapid Automatic Keyword Extraction Framework ( RAKE )
structured-streaming-avro-demo
Spark 3.0.0 Structured Streaming Kafka Avro Demo
Wikipedia2WordNet
Library for mapping from WIkipedia Articles to WordNet Synsets in Java 8
Neuw84's Repositories
Neuw84/Wikipedia2WordNet
Library for mapping from WIkipedia Articles to WordNet Synsets in Java 8
Neuw84/wikipediaminer
An open source toolkit for mining Wikipedia
Neuw84/AutomaticKeyphraseExtraction
Data for Automatic Keyphrase Extraction Task
Neuw84/c710_ubuntu_pis
Acer C710 Ubuntu 14.04 Post Installation Script.
Neuw84/hadoopecosystemtable.github.io
This page is a summary to keep the track of Hadoop related projects, and relevant projects around Big Data scene focused on the open source, free software enviroment.
Neuw84/ixa-pipe-chunk
ixa-pipe-chunk performs Chunking for English. This module is part of a multilingual NLP pipeline developed by the IXA NLP Group (ixa.si.ehu.es)
Neuw84/ixa-pipe-nerc
IXA pipes Named Entity Recognition tagger (http://ixa2.si.ehu.es/ixa-pipes).
Neuw84/ixa-pipe-pos
IXA pipes Part of Speech tagger (http://ixa2.si.ehu.es/ixa-pipes).
Neuw84/ixa-pipe-tok
IXA pipes sentence segmenter and tokenizer (http://ixa2.si.ehu.es/ixa-pipes).
Neuw84/kaflib
A library for managing KAF documents
Neuw84/SpanishInflectorStemmer
Spanish Light Stemmer that remove plurals using "Real Academia de la Lengua" recommendations
Neuw84/xjsf-servlet-framework
xjsf - Lightweight framework for XML & JSON webservices Forked from https://code.google.com/p/xjsf/