Agent Knowledge Profiles

While traces and decision graphs help you debug a single failure, Agent Knowledge Profiles help you understand your agent’s performance across thousands of runs.

What is an Agent Knowledge Profile?

A Knowledge Profile is an aggregated dashboard specific to a single Agent (identified by the X-Agent-Name header you send through the proxy). It combines:

Usage Analytics: Volume, token consumption, and cost over time.
Performance Analytics: P50/P90/P99 latency distributions.
Evaluation Scores: The results of automated LLM-as-a-judge evaluations run by the WhyOps analyse service.

LLM Evaluations

You can configure WhyOps to automatically run deep evaluations on a sampling of your agent’s traces. When you trigger an analysis run (e.g., mode: deep, judgeModel: gpt-4o), WhyOps evaluates the trace across specific dimensions that you select.

Viewing the Data

The Knowledge Profile visualizes these scores using recharts to show trends over time.

Trend Lines: See if your agent’s intent_precision dropped after you deployed a new system prompt last Tuesday.
Radar Charts: Compare your agent’s strengths. Perhaps your agent is excellent at tool_invocation_quality (formatting JSON arguments correctly) but terrible at tool_routing_quality (picking the right tool in the first place).
Failure Modals: The profile highlights the most common static analysis findings for your agent. If 40% of your traces have an ORPHAN_TOOL_CALL_RESPONSE warning, you know there is a systemic bug in your agent framework’s event emission.

Configuring Evaluations

You can set up automated cron-based evaluations via the API:

PUT /api/agent-analyses/:agentId/config
Authorization: Bearer <WHYOPS_API_KEY>

{
  "enabled": true,
  "cronExpr": "0 0 * * *", // Run nightly
  "timezone": "UTC",
  "lookbackDays": 1,
  "mode": "standard",
  "judgeModel": "gpt-4o",
  "dimensions": [
    "intent_precision",
    "tool_routing_quality",
    "answer_completeness_clarity"
  ]
}

This ensures you have a continuously updating baseline of your agent’s quality without writing manual evaluation scripts.

Getting Started

SDK Overview

TypeScript SDK

Vercel AI SDK

LangChain JS (Beta)

Python SDK

Go SDK

Direct APIs

UI Features

Core Concepts

Architecture

Agent Knowledge Profiles

What is an Agent Knowledge Profile?

LLM Evaluations

Viewing the Data

Configuring Evaluations

Getting Started

SDK Overview

TypeScript SDK

Vercel AI SDK

LangChain JS (Beta)

Python SDK

Go SDK

Direct APIs

UI Features

Core Concepts

Architecture

Documentation Index

​What is an Agent Knowledge Profile?

​LLM Evaluations

​Viewing the Data

​Configuring Evaluations

What is an Agent Knowledge Profile?

LLM Evaluations

Viewing the Data

Configuring Evaluations