cfoster0's Stars
notarussianteenager/srf-attention
Simplex Random Feature attention, in PyTorch
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
mlfoundations/open_lm
A repository for research on medium sized language models.
kernelmachine/silo-lm
SILO Language Models code repository
cure-lab/LTSF-Linear
[AAAI-23 Oral] Official implementation of the paper "Are Transformers Effective for Time Series Forecasting?"
IBM/ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.
FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
persimmon-ai-labs/adept-inference
Inference code for Persimmon-8B
wilson-labs/cola
Compositional Linear Algebra
UKPLab/on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
abacusai/Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.
HazyResearch/spacetime
Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
jina-ai/finetuner
:dart: Task-oriented embedding tuning for BERT, CLIP, etc.
cyrilou242/ftcc
Fast Text Classification with Compressors dictionary
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Felix-Petersen/difflogic
A Library for Differentiable Logic Gate Networks
scaleapi/llm-engine
Scale LLM Engine public repository
BaseModelAI/cleora
Cleora AI is a general-purpose open-source model for efficient, scalable learning of stable and inductive entity embeddings for heterogeneous relational data. Created by Synerise.com team.
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
explosion/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
lamini-ai/lamini
The Official Python Client for Lamini's API
FutureComputing4AI/Hrrformer
Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)
sdan/vlite
fast vector database made in numpy
0hq/tinyvector
A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
bit-gpt/app
BitGPT it's your personal AI in your pocket
acerbilab/pybads
PyBADS: Bayesian Adaptive Direct Search optimization algorithm for model fitting in Python