Logging - AC2

The log() function accepts typed dataclasses for structured logging. Each type creates a specific span in the trace.

from caribou import Message, ToolExecution, Grade, Event
caribou.log(item)

Message

Log conversation messages. Assistant messages can include LLM metadata.

caribou.log(Message(role="user", content="Hello"))

caribou.log(Message(
    role="assistant",
    content="Hi there!",
    tool_calls=[{"id": "call_1", "function": {"name": "search", "arguments": "{}"}}],
    model="gpt-4o",
    provider="openai",
    input_tokens=100,
    output_tokens=50,
    cost_usd=0.002,
    duration_ms=1500,
))

Grader LLM calls use the grader_name field:

caribou.log(Message(
    role="assistant",
    content="Score: 0.8",
    grader_name="helpfulness",
    model="gpt-4o",
    input_tokens=200,
    output_tokens=10,
))

Field	Type	Description
`role`	`str`	`"user"`, `"assistant"`, or `"system"`
`content`	`str \| list[dict] \| None`	Text or OpenAI-style multimodal content
`tool_calls`	`list[dict] \| None`	Tool calls made by assistant
`tool_call_id`	`str \| None`	Tool call ID (for tool responses)
`name`	`str \| None`	Function name (for tool responses)
`model`	`str \| None`	LLM model name
`provider`	`str \| None`	LLM provider
`input_tokens`	`int \| None`	Input token count
`output_tokens`	`int \| None`	Output token count
`cost_usd`	`float \| None`	Cost in USD
`duration_ms`	`int \| None`	Duration in milliseconds
`grader_name`	`str \| None`	Grader name (for grader LLM calls)
`prompt`	`str \| None`	Grader prompt summary

Span naming

Scenario	span_kind	span_name
Regular message	`message`	`message.{role}`
Message with `grader_name`	`grader.{grader_name}`	`grader.{grader_name}`

ToolExecution

Log tool calls with arguments, results, and error status.

caribou.log(ToolExecution(
    name="read_file",
    call_id="call_abc123",
    arguments='{"path": "/etc/hosts"}',
    result="127.0.0.1 localhost",
    error=None,
    requestor="agent",
    duration_ms=50,
))

Field	Type	Description
`name`	`str`	Tool name
`call_id`	`str \| None`	Tool call ID
`arguments`	`dict \| str \| None`	Tool input
`result`	`str \| list[dict] \| None`	Tool result (text or multimodal)
`error`	`str \| None`	Error message if failed
`requestor`	`str \| None`	`"agent"`, `"user"`, or `"system"`
`duration_ms`	`int \| None`	Duration in milliseconds

Creates a tool.{name} span.

Grade

Log grading results with scores, reasoning, and per-criterion breakdowns.

caribou.log(Grade(
    score=0.85,
    grader_type="MyGrader",
    reasoning="Passed 2 of 3 tests",
    criteria=[
        {"type": "test_pass", "passed": True, "score": 1.0},
        {"type": "test_pass", "passed": False, "score": 0.0},
    ],
    passed_count=2,
    total_count=3,
))

Field	Type	Description
`score`	`float`	Score (0.0 to 1.0)
`grader_type`	`str \| None`	Grader class name
`reasoning`	`str \| None`	Reasoning explanation
`criteria`	`list[dict] \| None`	Individual criteria results
`passed_count`	`int \| None`	Number of passed criteria
`total_count`	`int \| None`	Total criteria count

Creates a grader or grader.{grader_type} span. Each criterion in criteria creates a child grader.criterion span.

Event

Log lifecycle events with optional details.

caribou.log(Event(name="setup_start"))
caribou.log(Event(name="grading_complete", details={"duration_ms": 500}))

Field	Type	Description
`name`	`str`	Event name
`details`	`dict \| None`	Additional event details

Creates an event.{name} span.

Multimodal content

Caribou processes base64 images in message content, tool results, tool arguments, grader artifacts, and event details. Images are uploaded to S3 and replaced with caribou-image:// URLs, which Kestrel renders automatically. Use OpenAI-style content parts:

caribou.log(ToolExecution(
    name="render_3d",
    call_id="call_1",
    arguments={"region": "interface"},
    result=[
        {"type": "image_url", "image_url": {"url": "data:image/png;base64,iVBORw0KGgo..."}},
        {"type": "text", "text": "Rendered view of the interface region."},
    ],
))

caribou.log(Message(
    role="user",
    content=[
        {"type": "image_url", "image_url": {"url": "data:image/png;base64,..."}},
        {"type": "text", "text": "What does this structure show?"},
    ],
))

Caribou

Manta

​Message

​Span naming

​ToolExecution

​Grade

​Event

​Multimodal content