kdexd

I do computer vision. Prev: CS PhD at the University of Michigan.

World LabsSan Francisco, CA

kdexd's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.6k 230 2733.2k
modularml/mojo
The Mojo Programming Language
Language:Mojo23.5k 264 2.2k2.6k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python18.6k 158 01.3k
ml-explore/mlx
MLX: An array framework for Apple silicon
Language:C++18k 148 5891k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.8k 123 1.2k1.4k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.9k 98 181.1k
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python12.8k 170 242878
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9.3k 85 38875
meta-llama/llama-models
Utilities intended for use with Llama models.
Language:Python5.4k 75 148903
lancedb/lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
Language:Rust4.1k 43 1.1k236
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.8k 31 263345
pytorch/torchtitan
A native PyTorch Library for large model training
Language:Python2.8k 44 198228
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2.1k 47 137156
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.8k 21 70118
visual-layer/fastdup
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.
Language:Python1.6k 24 25277
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.3k 12 3456
pytorch-labs/segment-anything-fast
A batched offline inference oriented version of segment-anything
Language:Python1.2k 11 4472
apple/ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
Language:Python1.1k 27 2154
google-research-datasets/wit
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
1k 38 741
AdaptiveMotorControlLab/CEBRA
Learnable latent embeddings for joint behavioral and neural analysis - Official implementation of CEBRA
Language:Python926 35 4478
dangeng/visual_anagrams
Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"
Language:Jupyter Notebook886 9 1384
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
Language:Python668 17 6457
HazyResearch/aisys-building-blocks
Building blocks for foundation models.
421 30 017
rom1504/cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
Language:Python313 9 3324
facebookresearch/lightplane
Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.
Language:Python271 28 47
gautierdag/bpeasy
Fast bare-bones BPE for modern tokenizer training
Language:Python142 2 33
facebookresearch/meru
Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023
Language:Python140 8 713
masadcv/FastGeodis
Fast Implementation of Generalised Geodesic Distance Transform for CPU (OpenMP) and GPU (CUDA)
Language:C++92 4 3314
hammoudhasan/SynthCLIP
Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.
Language:Python90 7 92
juliusberner/ddn_tutorial
Tutorial on Deep Declarative Networks
Language:HTML1 1 01

kdexd

kdexd's Stars

meta-llama/llama3

modularml/mojo

black-forest-labs/flux

ml-explore/mlx

Dao-AILab/flash-attention

naklecha/llama3-from-scratch

openai/tiktoken

karpathy/minbpe

meta-llama/llama-models

lancedb/lance

rom1504/img2dataset

pytorch/torchtitan

huggingface/datatrove

cambrian-mllm/cambrian

visual-layer/fastdup

facebookresearch/MetaCLIP

pytorch-labs/segment-anything-fast

apple/ml-aim

google-research-datasets/wit

AdaptiveMotorControlLab/CEBRA

dangeng/visual_anagrams

mlfoundations/datacomp

HazyResearch/aisys-building-blocks

rom1504/cc2dataset

facebookresearch/lightplane

gautierdag/bpeasy

facebookresearch/meru

masadcv/FastGeodis

hammoudhasan/SynthCLIP

juliusberner/ddn_tutorial