Skip to main content

Introduction

LastMile is the full-stack developer platform to debug, evaluate and improve LLM applications. We make it easy to fine-tune custom evaluators, set up guardrails & monitor app performance.

from lastmile import LastMile;
LastMile.eval("Hello world")

Meet alBERTa 🍁

alBERTa is a family of small language models designed for evaluation. They are optimized to be:

  • small -- 400M parameter entailment model
  • fast -- can run inference on CPU in < 300ms
  • customizable -- fine-tune for custom evaluation tasks

Out-of-the-box metrics

Design your own metric

Explore our guides