denmoroz's Stars
iarai/concurrent-dataloader
Profiling and Improving the PyTorch Dataloader for high-latency Storage
ML-SystemDesign/MLSystemDesign
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
google-deepmind/graphcast
ZFTurbo/timm_3d
PyTorch Volume Models for 3D data
microsoft/torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
nsidc/earthaccess
Python Library for NASA Earthdata APIs
cogeotiff/rio-tiler
User friendly Rasterio plugin to read raster datasets.
ben1post/xarray-simlab-ode
Framework for building and solving ODE-based models, an extension of xarray-simlab
blaylockbk/Herbie
Download numerical weather prediction datasets (HRRR, RAP, GFS, IFS, etc.) from NOMADS, NODD partners (Amazon, Google, Microsoft), ECMWF open data, and the University of Utah Pando Archive System.
TheJacksonLaboratory/zarrdataset
A dataset for loading zarr files to be used in machine learning training pipelines
TileDB-Inc/TileDB
The Universal Storage Engine
google/tensorstore
Library for reading and writing large multi-dimensional arrays.
jaychempan/Awesome-LWMs
🌍 A Collection of Awesome Large Weather Models (LWMs) | AI for Earth (AI4Earth) | AI for Science (AI4Science)
PyTables/PyTables
A Python package to manage extremely large amounts of data
marsupialtail/quokka
Making data lake work for time series
delta-incubator/deltaray
Delta reader for the Ray open-source toolkit for building ML applications
NX-AI/xlstm
Official repository of the xLSTM.
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
fzi-forschungszentrum-informatik/Lanelet2
Map handling framework for automated driving
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
feast-dev/feast
The Open Source Feature Store for Machine Learning
rtqichen/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
onnx/onnxmltools
ONNXMLTools enables conversion of models to ONNX
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
lancedb/lancedb
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
alirezadir/Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.