andrewkho's Stars
awslabs/s3-connector-for-pytorch
The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
DAGWorks-Inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
bbc/bbplot
R package that helps create and export ggplot2 charts in the style used by the BBC News data team
hhursev/recipe-scrapers
Python package for scraping recipes data
stroxler/tdx
collection-Types, Data and decorators, and eXception handling tools
artisynth/artisynth_core
Core modules for ArtiSynth mechanical modeling system
kieferk/dfply
dplyr-style piping operations for pandas dataframes
uptake/uptasticsearch
An Elasticsearch client tailored to data science workflows.
EdwardRaff/JSATFX
GUI components for JSAT
EdwardRaff/JSAT
Java Statistical Analysis Tool, a Java library for Machine Learning
zertrin/duplicity-backup.sh
Bash wrapper script for automated backups with duplicity supporting Amazon's S3 online storage as well as other storage destinations (ftp, rsync, sftp, local storage...).