carmocca

Research Engineer

@Lightning-AI Spain

carmocca's Stars

syncthing/syncthing
Open Source Continuous File Synchronization
Language:Go61.8k 1k 5.4k4.1k
mckaywrigley/chatbot-ui
AI chat for every model.
Language:TypeScript27.2k 243 9327.5k
refined-github/refined-github
:octocat: Browser extension that simplifies the GitHub interface and adds useful features
Language:TypeScript23.5k 222 3.9k1.4k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda21.2k 208 1162.3k
benfred/py-spy
Sampling profiler for Python programs
Language:Rust12.1k 110 348400
state-spaces/mamba
Mamba SSM architecture
Language:Python11.3k 98 376918
stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python10.1k 103 18599
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python9.2k 158 5722.1k
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Language:Python8.9k 85 354689
Lightning-AI/litgpt
Load, pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
Language:Python8.1k 80 669809
skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Language:Python6.1k 68 1.6k422
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python5.9k 67 268507
pythonprofilers/memory_profiler
Monitor Memory usage of Python code
Language:Python4.3k 81 238375
JoePenna/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
Language:Jupyter Notebook3.2k 39 107559
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python1.8k 19 77138
Lightning-AI/lightning-thunder
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Language:Python1.1k 34 24859
xl0/lovely-tensors
Tensors, ready for human consumption
Language:Jupyter Notebook1.1k 11 2015
mosaicml/streaming
A Data Streaming Library for Efficient Neural Network Training
Language:Python1k 20 139125
bigscience-workshop/bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Language:Shell962 37 19100
repository-settings/app
Pull Requests for GitHub repository settings
Language:JavaScript906 34 188177
pytorch/PiPPy
Pipeline Parallelism for PyTorch
Language:Python658 36 25078
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Language:Python463 10 1428
Lightning-AI/litdata
Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.
Language:Python245 14 5822
llm-efficiency-challenge/neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Language:Python243 16 1656
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Language:Python206 8 1013
pytorch/torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind.
Language:Python134 21 1039
pytorch/torchdistx
Torch Distributed Experimental
Language:Python113 25 2631
graphcore-research/unit-scaling
A library for unit scaling in PyTorch
Language:Jupyter Notebook65 6 44
rom1504/gpu-tester
gpu tester detects broken and slow gpus in a cluster
Language:Python62 2 45
graphcore-research/out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
Language:Jupyter Notebook323

carmocca

carmocca's Stars

syncthing/syncthing

mckaywrigley/chatbot-ui

refined-github/refined-github

karpathy/llm.c

benfred/py-spy

state-spaces/mamba

stas00/ml-engineering

NVIDIA/Megatron-LM

cleanlab/cleanlab

Lightning-AI/litgpt

skypilot-org/skypilot

Lightning-AI/lit-llama

pythonprofilers/memory_profiler

JoePenna/Dreambooth-Stable-Diffusion

eric-mitchell/direct-preference-optimization

Lightning-AI/lightning-thunder

xl0/lovely-tensors

mosaicml/streaming

bigscience-workshop/bigscience

repository-settings/app

pytorch/PiPPy

penghao-wu/vstar

Lightning-AI/litdata

llm-efficiency-challenge/neurips_llm_efficiency_challenge

EleutherAI/cookbook

pytorch/torchsnapshot

pytorch/torchdistx

graphcore-research/unit-scaling

rom1504/gpu-tester

graphcore-research/out-of-the-box-fp8-training