oxhead
Ph.D. student @ NCSU | Research areas: Distributed Systems (Cloud and Storage), Performance Optimization, Applied Machine Learning
oxhead's Stars
julycoding/The-Art-Of-Programming-By-July-2nd
本项目曾冲到全球第一,干货集锦见本页面最底部,另完整精致的纸质版《编程之法:面试和算法心得》已在京东/当当上销售
jayinai/ml-interview
Preparing for machine learning interviews
entropyltd/spark-cloud
Spark-cloud is a set of scripts for starting spark clusters on ec2
vin0110/tecl
Unit test for ECL
vin0110/haas
HPCC as a Service
thuijskens/bayesian-optimization
Python code for bayesian optimization using Gaussian processes
HIPS/Spearmint
Spearmint Bayesian optimization codebase
BernhardWenzel/markdown-search
Search engine for markdown files with tagging
brendangregg/perf-tools
Performance analysis tools based on Linux perf_events (aka perf) and ftrace
SamyPesse/How-to-Make-a-Computer-Operating-System
How to Make a Computer Operating System in C++
linkedin/ambry
Distributed object store
httpie/http-prompt
An interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitter.com/httpie
openucx/ucx
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
dano/aioprocessing
A Python 3.5+ library that integrates the multiprocessing module with asyncio
tlhumphrey2/EasyFastHPCCoAWS
Cloud Formation template and scripts for easily configuring and deploying a fast HPCC System on AWS from browser
xolox/python-executor
Programmer friendly subprocess wrapper
iostackproject/Crystal-Controller
SDS Controller for Object Storage in the IOStack architecture
hpcc-systems/HPCC-Platform
HPCC Systems (High Performance Computing Cluster) is an open source, massive parallel-processing computing platform for big data processing and analytics.
lucidworks/spark-solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
p8952/bocker
Docker implemented in around 100 lines of bash
NetApp/NetApp-Hadoop-NFS-Connector
This projects provides a NFSv3 connector for Hadoop. Using the connector, Apache Hadoop and Apache Spark can use NFSv3 server as their storage backend.
cdapio/cdap
An open source framework for building data analytic applications.
ipython/ipython
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
quantcast/qfs
Quantcast File System
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
pulsarIO/realtime-analytics
Realtime analytics, this includes the core components of Pulsar pipeline.
openzipkin/zipkin
Zipkin is a distributed tracing system
harelba/hadoop-job-analyzer
nastra/hackerrank
Solutions to HackerRank and CodeChef problems