Skip to main content

AlBERTa

A 400M parameter state-of-the-art, encoder based model designed for evaluating. This models works well without few-shot prompting or fine-tuning. However, for further performance improvements, the AlBERTa fleet of models are available for fine-tuning.

Model Description

Performance

HaluEval86%
WikiEval98%
Latency< 2ms for 1000 tokens

Example Use

import json
import oså
from typing import Any, Generator
import pandas as pd
from lastmile_auto_eval import (
EvaluationMetric,
EvaluationResult,
evaluate as auto_evaluate,
stream_evaluate,
)
from IPython.display import display

result: EvaluationResult = auto_evaluate(
dataframe=evaluation_data,
metrics=[
EvaluationMetric.P_FAITHFUL,
],
lastmile_api_token=os.getenv("LASTMILE_API_TOKEN"),
)