mshtelma

DatabricksFrankfurt, Germany

mshtelma's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook39.3k 408 684.2k
guidance-ai/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook19.1k 118 5471k
huggingface/candle
Minimalist ML framework for Rust
Language:Rust15.9k 155 718962
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.3k 194 3802.2k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.3k 99 5501.1k
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.9k 110 158466
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71552
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python6.9k 73 160476
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.3k 59 102330
amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Language:Python2.6k 31 78289
NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Language:Python2.5k 58 52531
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.2k 33 206257
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2k 46 129147
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
Language:Python1.9k 15 3264
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
1.9k 49 10101
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.8k 24 3998
systemdesignfightclub/SDFC
Roadmap and Resource Compilation for System Design Fight Club
1.8k 38 1203
SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Language:Python1.5k 25 7110
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Language:Python1.2k 35 55980
Azure/GPT-RAG
Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Language:Bicep881 16 73182
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook567 9 4840
ToluClassics/candle-tutorial
Tutorial for Porting PyTorch Transformer Models to Candle (Rust)
Language:Rust252 3 313
normster/llm_rules
RuLES: a benchmark for evaluating rule-following in language models
Language:Python211 2 315
lucidrains/CALM-pytorch
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
Language:Python169 9 310
corl-team/rebased
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
Language:Python157 5 43
AI4Finance-Foundation/Deep-Reinforcement-Learning-for-Stock-Trading-DDPG-Algorithm-NIPS-2018
Practical Deep Reinforcement Learning Approach for Stock Trading. NeurIPS 2018 AI in Finance.
Language:Python75 4 233
austin-starks/Deep-RL-Stocks
Reinforcement Learning for Stock Market Prediction
Language:Python60 3 018
allisonwang-db/pyspark-data-sources
Custom PySpark Data Sources
Language:Python27 3 24
rmosleydb/agent-studio
A Databricks framework for quick Agent solutions
Language:Python21 6 014
Data-drone/OpenMMLab-testing
Running OpenMMLab on Databricks
Language:Python1 1 0

mshtelma

mshtelma's Stars

mlabonne/llm-course

guidance-ai/guidance

huggingface/candle

meta-llama/llama-recipes

state-spaces/mamba

jzhang38/TinyLlama

LargeWorldModel/LWM

deepseek-ai/DeepSeek-Coder

NExT-GPT/NExT-GPT

amazon-science/chronos-forecasting

NVIDIA/GenerativeAIExamples

intel/neural-compressor

huggingface/datatrove

facebookresearch/schedule_free

zjunlp/LLMAgentPapers

S-LoRA/S-LoRA

systemdesignfightclub/SDFC

SqueezeAILab/LLMCompiler

Lightning-AI/lightning-thunder

Azure/GPT-RAG

ContextualAI/gritlm

ToluClassics/candle-tutorial

normster/llm_rules

lucidrains/CALM-pytorch

corl-team/rebased

AI4Finance-Foundation/Deep-Reinforcement-Learning-for-Stock-Trading-DDPG-Algorithm-NIPS-2018

austin-starks/Deep-RL-Stocks

allisonwang-db/pyspark-data-sources

rmosleydb/agent-studio

Data-drone/OpenMMLab-testing