Pinned Repositories
adam
Avro Datafile for Alignment/Map
ampx-docs
avro-parquet-spark-example
An example of using Avro and Parquet in Spark SQL
CausalImpact
An R package for causal inference in time series
docker-scripts
Dockerfiles and scripts for Spark and Shark Docker images
dockerfiles
Repo to hold my Docker files
Hadoop-BAM
Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework with the Picard SAM JDK, and command line tools similar to SAMtools.
htsjdk
A Java API for high-throughput sequencing data (HTS) formats.
incubator-parquet-mr
Mirror of Apache Parquet
maxmind-geoip2-scala
Simple Scala wrapper for MaxMind GeoIP2 webservice client and database reader http://maxmind.github.io/GeoIP2-java/
AndreSchumacher's Repositories
AndreSchumacher/avro-parquet-spark-example
An example of using Avro and Parquet in Spark SQL
AndreSchumacher/docker-scripts
Dockerfiles and scripts for Spark and Shark Docker images
AndreSchumacher/adam
Avro Datafile for Alignment/Map
AndreSchumacher/ampx-docs
AndreSchumacher/CausalImpact
An R package for causal inference in time series
AndreSchumacher/dockerfiles
Repo to hold my Docker files
AndreSchumacher/Hadoop-BAM
Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework with the Picard SAM JDK, and command line tools similar to SAMtools.
AndreSchumacher/htsjdk
A Java API for high-throughput sequencing data (HTS) formats.
AndreSchumacher/incubator-parquet-mr
Mirror of Apache Parquet
AndreSchumacher/maxmind-geoip2-scala
Simple Scala wrapper for MaxMind GeoIP2 webservice client and database reader http://maxmind.github.io/GeoIP2-java/
AndreSchumacher/parquet-mr
Java readers/writers for Parquet columnar file formats to use with Map-Reduce
AndreSchumacher/scalding
A Scala API for Cascading
AndreSchumacher/SeqPig
SeqPig is a library for Apache Pig for the distributed analysis of large sequencing datasets. It provides import and export functions for file formats commonly used for sequencing data, as well as a collection of Pig user-defined-functions (UDF’s) to help process aligned and unaligned sequence data.
AndreSchumacher/shark
Hive on Spark
AndreSchumacher/spark
Mirror of Apache Spark
AndreSchumacher/stable-diffusion-webui
Stable Diffusion web UI
AndreSchumacher/whirr
Mirror of Apache Whirr