Pinned Repositories
warcutils
Library with utility classes for working with the 2014 Common Crawl warc, wet and wat files.
jupyter-bigdata-notebooks
Notebooks for the big data group
mtc-hadoop
Many-task computing (https://en.wikipedia.org/wiki/Many-task_computing) for Hadoop
newsreader-hadoop
Port of the newsreader pipeline to hadoop
seticombine
SETI scripts and tools
sne-es-db
Material for the DB guest lecture at the SNE Essential Skills course
docker-jupyter
Modified Jupyter environment in a container.
dockerspawner
Spawns JupyterHub user servers in Docker containers
NBIC-BioAssist--hands-on-Hadoop
vmk's Repositories
vmk/NBIC-BioAssist--hands-on-Hadoop
vmk/docker-jupyter
Modified Jupyter environment in a container.
vmk/dockerspawner
Spawns JupyterHub user servers in Docker containers
vmk/hathi-client
Client software and configuration for the Hathi Hadoop cluster
vmk/imputestuff
vmk/initrelicdb
vmk/jads-nosql-mongodb
vmk/jupyter-bigdata-notebooks
Notebooks for the big data group
vmk/kc-eu-2023-k8s-wasm-microservices
vmk/ndw-bridgedata-spark
Structured streaming Lab
vmk/newsreader-hadoop
Port of the newsreader pipeline to hadoop
vmk/notebooks
My notebooks.
vmk/pigner
vmk/rite
A pilot job framework written in Java
vmk/schedulegwas
vmk/scheduleinterpro
vmk/schedulepsiblast
vmk/seticombine
SETI scripts and tools
vmk/shc
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
vmk/sne-es-db
Material for the DB guest lecture at the SNE Essential Skills course
vmk/warcexamples
Example programs that work with the March 2014 Common Crawl warc, wet and wat files on the SURFsara hadoop cluster environment.
vmk/warcutils
Library with utility classes for working with the 2014 Common Crawl warc, wet and wat files.
vmk/WordCountNotebook