Pinned Repositories
dvenabler
Adds DocValues to Solr index fields without full re-index
heatmap
A GitHub-inspired graph for visualising activity
heritrix3-wrapper
Small wrapper to start/stop and communicate with Heritrix 3.
jwat
Java Web Archive Toolkit
jwat-tools
JWAT Tools
netarchivesuite
Netarchivesuite 5.X development
netarchivesuite-svngit-migration
Git conversion of Subversion repository.
netsearch
Merged search-arctika and search-achon into a multi-module project
so-me
Social Media harvests
solrwayback
A search interface and wayback machine for the UKWA Solr based warc-indexer framework.
NetarchiveSuite's Repositories
netarchivesuite/retro
Konverting af gamle data til WARC format anno 2012
netarchivesuite/NAS-research
Research tools for Netarkivet.dk
netarchivesuite/batchJobs
Batchjobs for Netarchivesuite
netarchivesuite/compression
Scripts related to the workflow for compressing an arc/warc repository
netarchivesuite/language-detector
Language Detection Library for Java
netarchivesuite/webdanica-extractlinks
Tools for extracting links from ARC and WARC files
netarchivesuite/nas_ansible
Start of an ansible deploy framework for Netarchive Suite
netarchivesuite/shine
Prototype SOLR-powered web archive exploration UI.
netarchivesuite/heritrix3-scripts
Some heritrix3 scripts, for use in the h3 console.
netarchivesuite/jenkins-jobs
Defines the NetarchiveSUite jobs to run on SBForge through the Jenkins DSL plugin
netarchivesuite/tika
Mirror of Apache Tika
netarchivesuite/bitrepository-rest-client
Java REST client to interact with the Bitrepository software.
netarchivesuite/common-datastructures
Library of common data structures implemented in Java.
netarchivesuite/netarkivet-4.4-openwayback
Netarkivet openwayback linked with NAS 4.4 jars.
netarchivesuite/netarchivesuite-svngit-migration
Git conversion of Subversion repository.
netarchivesuite/webarchive-commons
Common web archive utility code.
netarchivesuite/heritrix3-client-testbed
A little testbed/playground/prototype for exploring the h3 REST interface.
netarchivesuite/netarchivesuite.github.io
netarchivesuite/docker-netarchivesuite
netarchivesuite/xml-formatter
Fork of http://code.google.com/p/xml-formatter/ with enhancements
netarchivesuite/jbs
Builds Lucene/Solr indexes out of NutchWAX segments and revisit records via Hadoop.
netarchivesuite/hadoop-tools
netarchivesuite/bacon
Experimenting with Apache Pig.