Pinned Repositories
aop
AMQP on Pulsar protocol handler
beam
Apache Beam
collector-core
Collector-related code shared between different collector implementations
committer-core
Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.
committer-solr
Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.
CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
DataVec
ETL Library for Machine Learning
ddth-queue
Library to interact with various queue implementations
grakn
A Hyper-Relational Database for Knowledge-Oriented System
jsteggink's Repositories
jsteggink/beam
Apache Beam
jsteggink/aop
AMQP on Pulsar protocol handler
jsteggink/collector-core
Collector-related code shared between different collector implementations
jsteggink/committer-core
Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.
jsteggink/CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
jsteggink/ddth-queue
Library to interact with various queue implementations
jsteggink/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
jsteggink/elasticsearch-cloud-kubernetes
jsteggink/elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
jsteggink/faiss
A library for efficient similarity search and clustering of dense vectors.
jsteggink/hadoop-ceph
Implementation of Hadoop file system
jsteggink/language
Shared repository for open-sourced projects from the Google AI Language team.
jsteggink/lucene
Apache Lucene open-source search software
jsteggink/lucene-solr
Mirror of Apache Lucene + Solr
jsteggink/opennlp
Mirror of Apache OpenNLP
jsteggink/pulsar
Apache Pulsar - distributed pub-sub messaging system
jsteggink/pulsar-io-amqp-1-0
support sink/source for AMQP version 1.0.0
jsteggink/pulsar-spark
When Apache Pulsar meets Apache Spark
jsteggink/sigma.js
A JavaScript library aimed at visualizing graphs of thousands of nodes and edges
jsteggink/solarium
PHP Solr client library
jsteggink/solr
Apache Solr open-source search software
jsteggink/spark-corenlp
Stanford CoreNLP wrapper for Apache Spark
jsteggink/spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
jsteggink/spark-on-openshift
Spark operator deployment and usage on OpenShift
jsteggink/spark-solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
jsteggink/stickers-hackaton
jsteggink/streaming-amqp
AMQP data source for dstream (Spark Streaming)
jsteggink/trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
jsteggink/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
jsteggink/vcv-guitar-simulator
Template for a VCV Rack Blank module