rostandk

The Netherlands

rostandk's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook40.3k 417 694.3k
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:Python38k 763 10.3k14.5k
linexjlin/GPTs
leaked prompts of GPTs
29k 312 273.9k
conductor-oss/conductor
Conductor is an event driven orchestration platform
Language:Java18.1k 39 103510
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.7k 143 4541.5k
microsoft/SynapseML
Simple and Distributed Machine Learning
Language:Scala5.1k 146 732833
PaddlePaddle/PaddleRec
Recommendation Algorithm大规模推荐算法库，包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM，DSIN，SIGN，IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM，TiSAS，AutoFIS等，包含经典推荐系统数据集criteo 、movielens等
Language:Python4.3k 194 217726
benfred/implicit
Fast Python Collaborative Filtering for Implicit Feedback Datasets
Language:Python3.6k 77 492611
microsoft/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
Language:Python3.4k 51 318279
grantjenks/python-diskcache
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
Language:Python2.4k 22 260137
pykeen/pykeen
🤖 A Python library for learning and evaluating knowledge graph embeddings
Language:Python1.7k 27 555191
lucidrains/soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Language:Python1.4k 51 2290
jupyter-incubator/sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Language:Python1.3k 48 436448
NVIDIA-Merlin/NVTabular
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Language:Python1.1k 33 788143
NVIDIA-Merlin/Merlin
NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.
Language:Python789 34 443118
LucaCanali/sparkMeasure
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
Language:Scala715 34 40146
facebookresearch/SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
Language:Python471 15 2248
grai-io/grai-core
Language:Python301 2 2320
NVIDIA-Merlin/models
Merlin Models is a collection of deep learning recommender system model reference implementations
Language:Python264 23 49250
nicholasmireles/DotDict
A simple Python library to make chained attributes possible.
Language:Python232 2 53
adidas/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Language:Python228 18 238
microsoft/MSMARCO-Question-Answering
MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answering
Language:Python211 15 935
haxsaw/hikaru
Move smoothly between Kubernetes YAML and Python for creating/updating/componentizing configurations.
Language:Python207 6 3418
cerndb/spark-dashboard
Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
Language:Dockerfile118 11 522
NVIDIA-Merlin/systems
Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature stores, nearest neighbor search, and exploration strategies) into end-to-end recommendation pipelines that can be served with Triton Inference Server.
Language:Python90 17 10030
outlines-dev/functions
A collection of Outlines functions
Language:Python45 5 13
benchsci/tinsel
PySpark schema generator
Language:Python38 3 05
javiber/scrat
Persistent Caching of Expensive Function Results
Language:Python31 1 11
NVIDIA-Merlin/core
Core Utilities for NVIDIA Merlin
Language:Python19 15 4314
truskovskiyk/ml-in-production-webinars
10 3 02

rostandk

rostandk's Stars

mlabonne/llm-course

apache/airflow

linexjlin/GPTs

conductor-oss/conductor

NielsRogge/Transformers-Tutorials

microsoft/SynapseML

PaddlePaddle/PaddleRec

benfred/implicit

microsoft/hummingbird

grantjenks/python-diskcache

pykeen/pykeen

lucidrains/soundstorm-pytorch

jupyter-incubator/sparkmagic

NVIDIA-Merlin/NVTabular

NVIDIA-Merlin/Merlin

LucaCanali/sparkMeasure

facebookresearch/SONAR

grai-io/grai-core

NVIDIA-Merlin/models

nicholasmireles/DotDict

adidas/lakehouse-engine

microsoft/MSMARCO-Question-Answering

haxsaw/hikaru

cerndb/spark-dashboard

NVIDIA-Merlin/systems

outlines-dev/functions

benchsci/tinsel

javiber/scrat

NVIDIA-Merlin/core

truskovskiyk/ml-in-production-webinars