seanv507's Stars
malteos/awesome-document-similarity
A curated list of resources on document similarity measures (papers, tutorials, code, ...)
eugenium/layerCNN
iosband/ts_tutorial
rushter/data-science-blogs
A curated list of data science blogs
ibayer/fastFM-core
A short paper describing the library is available on arXiv.
ibayer/fastFM
fastFM: A Library for Factorization Machines
andylolu2/cuda-mnist
Training MLP on MNIST in 1.5 seconds with pure CUDA
awslabs/aws-batch-helpers
AWS Batch helpers is a collection of scripts and tools that can be used with AWS Batch.
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
awslabs/aws-glue-libs
AWS Glue Libraries are additions and enhancements to Spark for ETL operations.
awslabs/athena-glue-service-logs
Glue scripts for converting AWS Service Logs for use in Athena
Netflix/metaflow
Open Source AI/ML Platform
awsdocs/aws-batch-user-guide
The open source version of the AWS Batch user guide. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
jghoman/awesome-apache-airflow
Curated list of resources about Apache Airflow
matthewwardrop/formulaic
A high-performance implementation of Wilkinson formulas for Python.
pditommaso/awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
pbreheny/biglasso
biglasso: Extending Lasso Model Fitting to Big Data in R
yet-another-account/ubuntu-setup
A set of setup scripts for Ubuntu
delip/PyTorchNLPBook
Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://amzn.to/3JUgR2L
eBay/tsv-utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
ydataai/ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
bbalasub1/glmnet_python
briandailey/python-packages-license-check
Exactly what it says it does - check your installed packages and report licenses.
scikit-learn/scikit-learn
scikit-learn: machine learning in Python
ninia/jep
Embed Python in Java
sushant-hiray/scala-python-example
Example to demonstrate using keras library via jep in scala
IDSIA/sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
dataquestio/ds-containers
Containers for data science
docker/machine
Machine management for a container-centric world