small-models

There are 9 repositories under small-models topic.

  • SqueezeAILab/SqueezeLLM

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Language:Python641182743
  • SqueezeAILab/KVQuant

    [NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

    Language:Python302131525
  • aitomatic/openssa

    OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving

    Language:Python232171135
  • MCG-NJU/AMD

    [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models

    Language:Python12211
  • Decoder-Only-LLM

    logic-OT/Decoder-Only-LLM

    This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

    Language:Jupyter Notebook10103
  • zhangyifei01/Awesome-Self-supervised-Learning-of-Tiny-Models

    Overview of self-supervised learning of tiny models, including distillation-based methods (aks. self-supervised distillation) and non-distillation methods.

  • sfarhat/dapt

    Code for "On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models"

    Language:Python4210
  • ENSTA-U2IS-AI/optuMNIST

    Help us define the Pareto front of small models for MNIST classification. Frugal AI.

    Language:Python110
  • antonio-f/Phi-3-Vision

    Phi-3-Vision model test - running locally

    Language:Jupyter Notebook10