SmSmall Models3

Small Models

Big capability, small package

modelsRow 3: Deploymentintermediate2 hoursRequires: Lg

Overview

Small Language Models (SLMs) offer efficient, deployable AI that can run on edge devices or with minimal resources.

Compact AI models optimized for efficiency while maintaining useful capabilities.

Not every task needs GPT-4. SLMs offer faster inference, lower cost, and can run locally or on-device.

Techniques like distillation, quantization, and pruning compress large models. Architecture innovations create efficient models from scratch.

Microsoft's efficient small model

Meta's smallest Llama

Google's lightweight models

Run small models locally

Efficient CPU inference

Quantized model format

The reasoning engine

Teaching AI new tricks

Beyond text