Skip to main content

Overview

The Ingestion skill uploads your agent logs into Docent by writing a script to convert them into Agent Run format. To learn more about how SDK ingestion works under the hood, you can read our Quickstart guide. We recommend ingesting one trajectory format at a time. If you have run an eval with multiple harnesses, first sort runs with different harnesses into separate folders and run the ingestion skill on each folder.

Quickstart

We strongly recommend installing our Claude Code plugin so that you receive automatic updates.

How the ingestion skill works

Our ingestion skill incorporates best practices to upload and organize your data within Docent. The skill investigates your file structure, examines your trajectory format, and maps each field in your schema to a Docent object. It produces:
  1. ingestion-plan.md: A mapping of your trajectory fields to Docent’s data model, including any fields that will be intentionally omitted. You can review this file to verify that the model intends to organize and display your data in a suitable format.
  2. ingest.py: A script that reads your logs and uploads them via the SDK. You can modify and rerun this as needed.
The skill asks clarifying questions if your data format is ambiguous or if it needs more context about how you want the data structured. We recommend ingesting one trajectory format at a time.

Best practices

  • Ingest one trajectory format at a time. If you’re ingesting multiple formats, ask your agent to sort your trajectories by the scaffold that generated them (e.g., openhands/, mini-swe-agent/, custom/) before activating the Docent skill.
  • Verify your uploaded data in the web interface. After upload, check that transcripts display correctly and metadata appears where you expect. It’s normal to iterate a few times on different organization structures.

What’s next

  • Ready to analyze? See Search and Clustering to start exploring your data
  • Want to tweak the ingestion script yourself? Read up on Docent’s underlying data models.