vicher37's Stars
shap/shap
A game theoretic approach to explain the output of any machine learning model.
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
crownpku/text2vec
Easily generate document/paragraph/sentence vectors and calculate similarity.
taki0112/Vector_Similarity
Python, Java implementation of TS-SS called from "A Hybrid Geometric Approach for Measuring Similarity Level Among Documents and Document Clustering"
hetong007/higgsml
Repository for post higgs-competition model submission
REMitchell/apiscraper
killrweather/killrweather
KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apache Spark Streaming, Apache Cassandra, Apache Kafka and Akka for fast, streaming computations on time series data in asynchronous event-driven environments.
pandas-dev/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
frostming/legit
Git for Humans, Inspired by GitHub for Mac™.
twitter/algebird
Abstract Algebra for Scala
PipelineAI/pipeline
PipelineAI
apache/zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
databricks/spark-csv
CSV Data Source for Apache Spark 1.x
seahboonsiew/pyspark-csv
An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parses csv data into SchemaRDD. No installation required, simply include pyspark_csv.py via SparkContext.
gtagency/buzzmobile-old
ROS code for a self driving car
scopatz/nanorc
Improved Nano Syntax Highlighting Files
tfolkman/BDAH
Files for class EE 381V: Big Data Analytics for Healthcare
adshi/Raw-code
MSBA
scascketta/CapMetrics
Historical transit data for Austin including vehicle positions, ridership, and schedules.
fivethirtyeight/data
Data and code behind the articles and graphics at FiveThirtyEight
jgscott/STA380
STA 380: Predictive Modeling
Ironholds/pystr
Python String Methods in R.
tranhungt/okcupidjs
Automate your OKCupid Activity. This is an API Wrapper for OkCupid App, allowing you to automate processes and collect data for further analysis
dong-y/D3.js_tutorial
D3.js tutorial by d3Vienno Youtube
codesuki/react-d3-components
D3 Components for React
esbullington/react-d3
Modular React charts made with d3.js
visionmedia/move.js
CSS3 backed JavaScript animation framework
jrnold/ggthemes
Additional themes, scales, and geoms for ggplot2