Literal is a an all in one observability, evaluation and analytics platform for building production-grade LLM apps.Literal covers a wide range of use cases, from conversational applications to task automation.
Observability: Monitor your LLM app (including conversations, intermediary steps, feedback, files, prompts, token consumption) in a few minutes with our SDKs. Literal provides a unified view of all your data in one place.
Dataset: Create datasets mixing production data and hand written examples to run non regression tests.
Online Evals: Evaluate your threads and runs in real time using off the shelf and custom evaluators.
Prompt Collaboration: Safely iterate, version and deploy prompts directly from Literal.