Skip to main content

Example Datasets

Synthetic datasets representing typical AI applications

We have generated a few datasets that are representative of typical AI applications so you have test data to try out the platform. All datasets are available in the lastmile-docs repository.

info

The datasets were generated using gen_autoeval_datasets.ipynb notebook. You can use it to generate other kinds of synthetic datasets.

Dataset Index

SuperCard Dining Concierge

An AI dining recommendation bot that takes in a user query in natural language, and returns a list of restaurants (in natural language). Analogous to a RAG application with the restaurant DB being the source of truth.

The dataset contains example interactions, as well as a Relevance label -- 1 if the interaction is related to dining, and 0 if it isn't.

MediaCo Data Warehouse Agent

A multi-agent AI application, where each agent is able to query a data warehouse, and together answer questions spanning all parts of a media company's various properties (book publisher, music label, TV network, etc.)

The dataset contains example interactions, as well as a Faithfulness label -- 1 if the interaction invoked the correct agent and returned a relevant response, and 0 if it didn't.

StrikeAir Customer Support

A conversational AI system that answers customers' questions on behalf of an airline ("Strike Air").

The dataset contains example interactions, as well as a Clarity & Politeness label -- 1 if the interaction was accurate and cordial, and 0 if it wasn't.

WMB Financial Wealth Advisor

An AI retrieval system for financial advisors, helping them more efficiently provide financial advice to their clients.

The dataset contains example interactions, as well as a Correctness label -- 1 if the interaction was accurate and factual given some ground truth data, and 0 if it wasn't.