Skip to Content

Test Types

AnalysisEntry

Analysis entry for chat-level analysis content.

FieldTypeRequiredDescription
contentstringNoAnalysis content text

AnalyticsFacets

Resolved analytics facets — embeddable in any artifact response.

Contains filter field visibility, available options for dropdowns, and date range boundaries. Returned inline from artifact get/search responses so each page has its filter facets ready for SSR.

FieldTypeRequiredDescription
fieldsAnalyticsFilterFieldsYesFilter field visibility configuration
department_optionsAnalyticsFilterOption[]NoDepartment dropdown options
cohort_optionsAnalyticsFilterOption[]NoCohort dropdown options
role_optionsAnalyticsRoleOption[]NoAvailable role resource options
attempt_optionsstring[]NoAvailable attempt options
date_range_earlieststringNoEarliest available date for filtering
date_range_lateststringNoLatest available date for filtering

AnalyticsFilterField

Visibility/disabled state for a single filter field.

FieldTypeRequiredDescription
visiblebooleanNoWhether the filter field is visible
disabledbooleanNoWhether the filter field is disabled

AnalyticsFilterFields

Per-page filter field visibility configuration.

FieldTypeRequiredDescription
date_rangeAnalyticsFilterFieldNoDate range filter config
departmentsAnalyticsFilterFieldNoDepartment filter config
cohortsAnalyticsFilterFieldNoCohort filter config
rolesAnalyticsFilterFieldNoRole filter config
attemptsAnalyticsFilterFieldNoAttempt filter config

AnalyticsFilterOption

A single filter option for dropdown selectors.

FieldTypeRequiredDescription
valuestringYesOption value for the filter
labelstringYesHuman-readable option label

AnalyticsRoleOption

Hydrated role resource option for analytics role filters.

FieldTypeRequiredDescription
valuestringYesOption value for the filter
labelstringYesHuman-readable option label
idstringYesRole resource UUID
namestringYesRole resource name
descriptionstringNoRole description
icon_idstringNoIcon resource UUID
color_idstringNoColor resource UUID
levelintegerYesRole privilege level

BenchmarkDepartmentItem

Department resource for benchmark.

FieldTypeRequiredDescription
department_idstringYesDepartment identifier
namestringNoDepartment display name
descriptionstringNoDepartment description

BenchmarkEvalOperational

Eval card for the benchmark page — analogous to ChatSimulationOperational.

FieldTypeRequiredDescription
eval_idstringYesEval identifier
eval_namestringNoEval display name
eval_descriptionstringNoEval description
model_idsstring[]NoAssociated model IDs
total_testsintegerNoTotal number of tests
archived_testsintegerNoNumber of archived tests
total_invocationsintegerNoTotal number of invocations
completed_invocationsintegerNoNumber of completed invocations
highest_scorenumberNoHighest score achieved
has_passedbooleanNoWhether eval has been passed
statusstringNoEval status
infinite_modebooleanNoWhether eval uses infinite mode
department_idsstring[]NoAssociated department IDs
rubric_idsstring[]NoAssociated rubric IDs

BenchmarkHistoryItem

Single test row in benchmark history list — mirrors HistoryItem shape.

FieldTypeRequiredDescription
test_idstringYesTest identifier
datestringNoFormatted date string of the test
profile_idstringNoUUID of the profile who owns the test
profile_namestringNoDisplay name of the profile
eval_idstringNoParent eval ID
eval_namestringNoParent eval name
rubric_idstringNoRubric ID for this test
rubric_namestringNoRubric display name
num_modelsintegerNoTotal number of models in the test
num_models_completedintegerNoNumber of models completed
model_idsstring[]NoUUIDs of associated models
model_namesstring[]NoDisplay names of associated models
scoreintegerNoOverall test score percent (0-100)
score_statusstringNoScore status label (e.g. high, medium, low)
pass_pctintegerNoPass percentage threshold from rubric
show_viewbooleanNoWhether the view action is available
show_continuebooleanNoWhether the continue action is available
is_archivedbooleanNoWhether the test is archived
infinite_modebooleanNoWhether the test uses infinite mode
department_idsstring[]NoAssociated department IDs

BenchmarkHistoryResponse

Paginated history response.

FieldTypeRequiredDescription
dataBenchmarkHistoryItem[]NoHistory items
total_countintegerNoTotal number of matching records
pageintegerNoCurrent page number
page_sizeintegerNoItems per page
eval_optionsFilterOption[]NoEval filter options
model_optionsFilterOption[]NoModel filter options
profile_optionsFilterOption[]NoProfile filter options
rubric_optionsFilterOption[]NoRubric filter options

CallerPermissions

Evaluated permissions for the current caller on this artifact type.

FieldTypeRequiredDescription
can_createbooleanYesWhether the caller can create new artifacts
can_draftbooleanYesWhether the caller can create/update drafts
can_duplicatebooleanYesWhether the caller can duplicate artifacts
has_accessbooleanNoWhether the caller can view this entity
can_editbooleanNoWhether the caller can edit this entity
can_deletebooleanNoWhether the caller can delete this entity
disabled_reasonstringNoHuman-readable reason if editing is disabled

ColumnInfo

FieldTypeRequiredDescription
namestringYesColumn name
typestringYesColumn data type
nullablebooleanYesWhether the column is nullable

DocsApiResponse

FieldTypeRequiredDescription
listPageMetaItemYes
detailPageMetaItemYes
newPageMetaItemYes

DocsResponse

FieldTypeRequiredDescription
namestringYesResource or entry name
typestringYesResource or entry type identifier
descriptionstringYesHuman-readable description
materialized_viewMvInfoNoMaterialized view metadata
tablesTableInfo[]YesRelated database tables
operationsOperationInfo[]YesAvailable operations

EvalSetup

Run-level eval scaffold — first-class on the generate response.

Audit’s **output spread carries this onto <artifact>.generate.completed. Null when no rubric-bearing agent participated.

FieldTypeRequiredDescription
test_idstringYes
invocationsInvocationSlot[]Yes

FeedbackEntry

Feedback by standard for grading state.

standard_group_id is derived from standards metadata lookup.

FieldTypeRequiredDescription
idstringNoUUID of the feedback entry
standard_idstringNoUUID of the associated standard
standard_group_idstringNoUUID of the standard group
totalnumberNoTotal score for this standard
feedbackstringNoFeedback text for this standard

FilterOption

A single filter option for dropdown selectors.

FieldTypeRequiredDescription
valuestringYesInternal value for the filter option
labelstringNoDisplay label for the filter option
countintegerNoNumber of matching records

GenerateConfig

Developer configuration — all optional with sensible defaults.

FieldTypeRequiredDescription
operationsstring[]No
dangerousbooleanNo
paramsobjectNo
group_idstringNo

GenerationsTestListItem

Single generation group in the test generations response.

FieldTypeRequiredDescription
group_idstringYesUUID of the generation group
session_idstringNoUUID of the parent session
group_namestringNoName of the generation group
created_atstringNoTimestamp of the generation

GetInvocationDraftResponse

FieldTypeRequiredDescription
idstringYesUUID of the draft
created_atstringYesCreation timestamp
generatedbooleanYesWhether this was AI-generated
mcpbooleanYesWhether MCP tooling was used
activebooleanYesWhether this draft is active
session_idstringYesAssociated session UUID
namestringNoImmutable draft label set at create time
department_idsstring[]YesAssociated department UUIDs
description_idsstring[]YesAssociated description UUIDs
flag_idsstring[]YesAssociated flag UUIDs
key_idsstring[]YesAssociated key UUIDs
modality_idsstring[]NoAssociated modality UUIDs
quality_idsstring[]NoAssociated quality UUIDs
model_flag_idsstring[]YesAssociated model flag UUIDs
model_position_idsstring[]YesAssociated model position UUIDs
model_rubric_idsstring[]YesAssociated model rubric UUIDs
name_idsstring[]YesAssociated name UUIDs
profile_idsstring[]YesAssociated profile UUIDs
reasoning_level_idsstring[]YesAssociated reasoning level UUIDs
temperature_level_idsstring[]YesAssociated temperature level UUIDs
voice_idsstring[]YesAssociated voice UUIDs
value_idstringNoAssociated value UUID
pricing_idsstring[]YesAssociated pricing UUIDs
endpoint_idsstring[]YesAssociated endpoint UUIDs
pending_department_idsstring[]NoPending department UUIDs
pending_description_idsstring[]NoPending description UUIDs
pending_flag_idsstring[]NoPending flag UUIDs
pending_key_idsstring[]NoPending key UUIDs
pending_modality_idsstring[]NoPending modality UUIDs
pending_quality_idsstring[]NoPending quality UUIDs
pending_model_flag_idsstring[]NoPending model flag UUIDs
pending_model_position_idsstring[]NoPending model position UUIDs
pending_model_rubric_idsstring[]NoPending model rubric UUIDs
pending_name_idsstring[]NoPending name UUIDs
pending_reasoning_level_idsstring[]NoPending reasoning level UUIDs
pending_temperature_level_idsstring[]NoPending temperature level UUIDs
pending_voice_idsstring[]NoPending voice UUIDs
pending_value_idsstring[]NoPending value UUIDs
pending_pricing_idsstring[]NoPending pricing UUIDs
pending_endpoint_idsstring[]NoPending endpoint UUIDs

GetTestFeedbackResponse

FieldTypeRequiredDescription
feedback_idstringYes
grade_idstringYes
call_idstringYes
tool_call_idstringYes
standard_idstringNo
totalintegerYes
feedbackstringYes
total_pointsintegerYes
pass_pointsintegerYes
created_atstringYes

GetTestGradeResponse

FieldTypeRequiredDescription
idstringYes
invocation_idstringYes
created_atstringYes
updated_atstringYes
passedbooleanYes
scoreintegerYes
time_takenintegerYes
generatedbooleanYes
mcpbooleanYes
activebooleanYes
call_idstringYes

GetTestInvocationResponse

FieldTypeRequiredDescription
invocation_idstringYes
test_idstringYes
group_idstringYes
invocation_created_atstringYes
invocation_titlestringYes
use_custombooleanYes
positionintegerYes
invocation_completedbooleanYes
grade_idstringYes
grade_scorenumberYes
grade_passedbooleanYes
grade_time_takennumberYes
rubric_idstringYes
agent_idsstring[]No
quality_idstringNo
department_idsstring[]No
voice_idstringNo
temperature_level_idstringNo
reasoning_level_idstringNo
modality_idsstring[]No

GetTestInvocationRunsResponse

FieldTypeRequiredDescription
idstringYes
test_invocation_idstringYes
test_invocation_traces_idstringNo
run_idstringNo
created_atstringYes
updated_atstringYes
generatedbooleanYes
mcpbooleanYes
activebooleanYes

GetTestInvocationTracesResponse

FieldTypeRequiredDescription
idstringYes
test_invocation_idstringYes
run_idstringNo
created_atstringYes
updated_atstringYes
generatedbooleanYes
mcpbooleanYes
activebooleanYes
reasoning_level_idsstring[]No
temperature_level_idsstring[]No
voice_idsstring[]No
prompt_idsstring[]No
instruction_idsstring[]No
tool_idsstring[]No
quality_idsstring[]No
modality_idsstring[]No

GetTestResponse

FieldTypeRequiredDescription
test_idstringYes
call_idstringYes
eval_idstringYes
profile_idstringYes
department_idsstring[]Yes
test_namestringYes
test_descriptionstringYes
num_invocationsintegerYes
infinite_modebooleanYes
is_dynamicbooleanYes
archivedbooleanYes
test_created_atstringYes

GradeData

Grade information for a chat (no id - not a resource).

FieldTypeRequiredDescription
scorenumberNoGrade score achieved
passedbooleanNoWhether the grade is passing
descriptionstringNoGrade description text
time_takenintegerNoTime taken in seconds
total_pointsnumberNoTotal available points
pass_pointsnumberNoPoints required to pass

GradingStateData

Grading state for a chat in Record format.

All fields are Records keyed by standard_id strings. This is the exact format the client needs - no transformation required.

FieldTypeRequiredDescription
achieved_standardsobjectNoMap of standard_id to achieved status
passed_standardsobjectNoMap of standard_id to passed status
feedback_by_standard_idobjectNoMap of standard_id to feedback text

GroupCall

Tool call referenced by a message.

FieldTypeRequiredDescription
idstringYes
tool_namestringNo
template_namestringNo
toolobjectNo
ledger_statusstringNo
ledger_operationstringNo
ledger_artifactstringNo
ledger_artifact_idstringNo

GroupMessage

Message within a run.

FieldTypeRequiredDescription
idstringYes
rolestringYes
created_atstringNo
text_idsstring[]No
audio_idsstring[]No
image_idsstring[]No
video_idsstring[]No
file_idsstring[]No
call_idsstring[]No
callsGroupCall[]No
reasoningbooleanNoTrue when this row is a chain-of-thought trace persisted alongside the assistant answer (rendered as a collapsed accordion).
in_contextbooleanNoWhether this message is included in the LLM context for the next generation. Mirrors the dedup pass that builds chat history (see in_context_reason).
in_context_reasonstringNoWhy this message is in/out of LLM context. ‘kept’ = included; ‘deduped_read’ = older read-only call to a tool that has a fresher result later in the group; future values may include ‘trimmed_top_n’.

GroupResource

Lightweight \{id, name\} for cross-referencing run-level ids (model_id / agent_id / profile_id) against human-readable names on the analytics panel. Names come from the canonical get_models / get_agents / get_profiles black boxes.

FieldTypeRequiredDescription
idstringYes
namestringNo

GroupRun

Run within a group, with its messages.

Carries token / cost / model / agent / profile attribution so the analytics view can render per-run cost + actor info without a parallel detail shape. profile_id is the authoring profile (human user), agent_id is the LLM-side actor, model_id is the model used by that agent. All optional — runs predating these columns or with unresolved attributions surface None.

FieldTypeRequiredDescription
idstringYes
created_atstringNo
input_tokensintegerNo
output_tokensintegerNo
cached_input_tokensintegerNo
costnumberNo
model_idstringNo
agent_idstringNo
profile_idstringNo
previous_context_start_indexintegerNoIndex in messages where the current run’s own messages begin; earlier rows are previous-context replay. None when the run has no previous context attached.
messagesGroupMessage[]No

InvocationDepartmentResource

FieldTypeRequiredDescription
department_idstringNoUnique identifier
namestringNoDisplay name
descriptionstringNoDescription text
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationDescriptionResource

FieldTypeRequiredDescription
idstringNoUnique identifier
descriptionstringNoDescription text
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationDetail

Per-invocation graded-view payload.

Mirrors ChatData from attempt — one of these per invocation, each carrying its rubric structure and the list of runs that executed against it. primary_run_id is the default local selection (usually the most recent / row-summary grade).

FieldTypeRequiredDescription
invocation_idstringYesUUID of the test_invocation_entry
rubric_idstringNoUUID of the rubric used to grade this invocation
rubric_structureRubricStructureDataNoRubric structure for TableRubric (standards / groups / mappings)
primary_run_idstringNoDefault selected run for this invocation
agent_idstringNoUUID of the agent under test
model_idstringNoUUID of the model the agent is set up with
voice_idstringNoUUID of the voice resource
temperature_level_idstringNoUUID of the temperature level
reasoning_level_idstringNoUUID of the reasoning level
quality_idstringNoUUID of the quality level
modality_idsstring[]NoModality resource ids
runsInvocationRunDetail[]NoPer-execution detail

InvocationEndpointResource

FieldTypeRequiredDescription
idstringNoUnique identifier
base_urlstringNoEndpoint base URL
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationFlagResource

Flag option row — one per (name, type, value) flags_resource entry.

FieldTypeRequiredDescription
idstringNoFlag resource identifier
namestringNoFlag display name
typestringNoFlag type
valuebooleanNoUnderlying bool value of this option
descriptionstringNoFlag description text
icon_idstringNoIcon identifier for the flag
iconstringNoResolved SVG markup for the icon (hydrated from icons_resource)
generatedbooleanNoWhether the flag was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationKeyResource

FieldTypeRequiredDescription
idstringNoUnique identifier
key_idstringNoResource identifier alias for picker compatibility
namestringNoDisplay name
descriptionstringNoDescription text
key_maskedstringNoMasked key preview
masked_keystringNoMasked key preview alias
activebooleanNoWhether this key is active
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationModalityResource

FieldTypeRequiredDescription
idstringNoUnique identifier
modality_idstringNoResource identifier alias for picker compatibility
modalitystringNoModality code
namestringNoHuman-readable modality name
descriptionstringNoDescription text
is_inputbooleanNoWhether this modality is input-facing
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationModelFlagOptionResource

Cross-product option row: one per (model_id, flag_type, value) tuple.

FieldTypeRequiredDescription
model_idstringNoModel identifier
flag_idstringNoFlag resource identifier
typestringNoFlag type, e.g. ‘model_active’
valuebooleanNoUnderlying flag value for this option
namestringNoDisplay name from the flags_resource row
descriptionstringNoDescription text from the flags_resource row
iconstringNoIcon SVG markup hydrated from icons_resource

InvocationModelFlagResource

Per-model flag junction row for an invocation.

FieldTypeRequiredDescription
idstringNoModel-flag resource identifier
model_idstringNoAssociated model identifier
flag_idstringNoAssociated flag identifier
typestringNoFlag type (e.g. ‘model_active’) of the linked flags_resource row
valuebooleanNoUnderlying bool value of the linked flags_resource row
namestringNoDisplay name
descriptionstringNoDescription text
iconstringNoIcon SVG markup
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationModelFlagValue

Denormalized per-(model, type) selection for draft requests.

FieldTypeRequiredDescription
model_idstringYesTarget model identifier
typestringYesFlag type, e.g. ‘model_active’
valuebooleanYesDesired flag value

InvocationNameResource

FieldTypeRequiredDescription
idstringNoUnique identifier
namestringNoDisplay name
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationPricingResource

FieldTypeRequiredDescription
idstringNoUnique identifier
pricing_idstringNoResource identifier alias for picker compatibility
pricing_typestringNoPricing type
pricenumberNoPrice amount
unit_namestringNoUnit name
unit_categorystringNoUnit category
unit_valueintegerNoUnit value
namestringNoDisplay label
descriptionstringNoDescription text
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationQualityResource

FieldTypeRequiredDescription
idstringNoUnique identifier
quality_idstringNoResource identifier alias for picker compatibility
qualitystringNoQuality label
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationReasoningLevelResource

FieldTypeRequiredDescription
idstringNoUnique identifier
reasoning_levelstringNoReasoning level label
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationRunDetail

Per-run grade + replay detail within an invocation.

One row per test_invocation_runs_entry binding. Carries the TableRubric-ready grading_state plus the message/call ids the client uses to slice entries.messages / entries.calls.

FieldTypeRequiredDescription
run_idstringYesUUID of the runs_entry row this binding executed
binding_idstringYesUUID of the test_invocation_runs_entry binding row
grade_idstringNoUUID of the test_grade_entry, if graded
created_atstringNoWhen the binding was created
completedbooleanNoWhether the binding has a completion record
gradeGradeDataNoScore / passed / time_taken summary
grading_stateGradingStateDataNoAchieved/passed/feedback maps keyed by standard_id
feedbacksFeedbackEntry[]NoPer-standard feedback rows
analysesAnalysisEntry[]NoChat-level analysis content (currently unused for tests)
message_idsstring[]NoMessage ids belonging to this run
call_idsstring[]NoTool-call ids belonging to this run

InvocationSlot

One agent’s slot in a multi-agent generation pool.

Populated by setup_generation_test when an agent carries a rubric. The client uses these IDs to drive the eval workflow: review the candidate’s output, optionally fire a grader against its invocation_id, and promote/reject by call_id via the existing idempotency_key + accept pattern.

FieldTypeRequiredDescription
invocation_idstringYes
agent_idstringYes
rubric_idstringNo

InvocationTemperatureLevelResource

FieldTypeRequiredDescription
idstringNoUnique identifier
temperaturenumberNoTemperature numeric value
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationValueResource

FieldTypeRequiredDescription
idstringNoUnique identifier
valuestringNoValue text
typestringNoValue type
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

InvocationVoiceResource

FieldTypeRequiredDescription
idstringNoUnique identifier
voicestringNoVoice label
generatedbooleanNoWhether this was AI-generated
suggestedbooleanNoWhether this is a suggested option
selectedbooleanNoWhether this is currently selected
pendingbooleanNoWhether this selection is pending acceptance

ListFilterOption

Standardized option for list endpoint filter sections.

FieldTypeRequiredDescription
idstringNoUnique identifier for this filter option
namestringNoDisplay name
countintegerNoNumber of matching records
hex_codestringNoHex color code for display
valuestringNoInternal value
typestringNoOption type discriminator

ListFilterSection

Filter section with options and echoed request state.

FieldTypeRequiredDescription
optionsListFilterOption[]NoAvailable filter options
selected_idsstring[]NoCurrently selected filter option IDs
searchstringNoActive search text for filtering

MvInfo

FieldTypeRequiredDescription
namestringYesMaterialized view name
definitionstringYesSQL definition of the view
columnsColumnInfo[]YesList of columns in the view

OperationInfo

FieldTypeRequiredDescription
namestringYesOperation name
descriptionstringYesHuman-readable description of the operation
paramsParamInfo[]YesList of operation parameters
returnsobjectNoReturn type schema

OperationPrompts

Starter prompts keyed by operation name.

Each key is an operation (e.g. “create”, “search”, “draft”, “export”) and the value is a list of starter prompts for that operation. The client picks from the operations the caller has permission for and rotates through them.

FieldTypeRequiredDescription
promptsobjectNoMap of operation name to starter prompts

PageMetaItem

FieldTypeRequiredDescription
titlestringYes
descriptionstringYes

ParamInfo

FieldTypeRequiredDescription
namestringYesParameter name
typestringYesParameter data type
requiredbooleanYesWhether the parameter is required
defaultanyNoDefault value if not required

ProducedMedia

One asset produced by a generation run.

resource_id is the canonical id the per-artifact download tools accept (e.g. Scenario_Image_Download(image_id=resource_id) for modality="image"). It maps to images_resource.id / videos_resource.id / audios_resource.id depending on the modality.

FieldTypeRequiredDescription
modality"image" | "video" | "audio"Yes
resource_idstringYes
upload_idstringYes
mime_typestringNo
file_sizeintegerNo

ProfileSummary

Caller identity derived from JWT — who you are on this page.

Superset of the old six-field version: now carries everything the client needs so that /\{artifact\}/context fully replaces /profiles/context and the extra getLayoutContextData round-trip can be dropped.

FieldTypeRequiredDescription
namestringYesDisplay name of the authenticated user
rolestringYesRole name (e.g. ‘Super Administrator’)
role_levelintegerYesRole hierarchy level (0 = highest privilege)
department_idsstring[]YesDepartments the user belongs to
artifact_accessstring[]YesArtifact types this role can access (sidebar visibility)
role_permissionsany[][]YesFull (artifact, operation) permission tuples for granular page gating
is_activebooleanYesWhether the user’s profile is active
idstringYesProfile UUID (SocketProvider, ProfileProvider)
themeThemeBundleNoResolved theme: hex primitives + derived oklch tokens + score thresholds
session_idstringNoCurrent session UUID
is_emulationbooleanNoWhether user is in emulation mode (ProfileProvider)
role_resourcesQGetProfileContextV4RoleResource[]NoAll role resources for emulation display (ProfileProvider)
scoped_rolesstring[]NoRoles the user can emulate (ProfileProvider)
activebooleanNoAlias for is_active (ProfileProvider uses this name)

QGetProfileContextV4RoleResource

FieldTypeRequiredDescription
rolestringNo
namestringNo
descriptionstringNo
icon_valuestringNo
color_hexstringNo

RubricStructureData

Rubric structure data in Record format.

All fields are Records keyed by standard_group_id or standard_id strings. This is the exact format the client needs - no transformation required.

FieldTypeRequiredDescription
standard_groupsobjectNoMap of group_id to standard_id lists
standard_groups_mappingobjectNoMap of group_id to group metadata
standards_mappingobjectNoMap of standard_id to standard metadata

RunStatus

Outcome of one watched run.

FieldTypeRequiredDescription
run_idstringYes
status"pending" | "completed" | "failed"Yes
modalitystringNoModality of generated output (image, video, text…)
resource_idsstring[]NoResource UUIDs produced by this run (e.g. images_resource ids)
errorstringNo

SearchMessageResponse

FieldTypeRequiredDescription
message_idstringYesUUID of the message
run_idstringYesUUID of the parent run
rolestringYesMessage role (e.g. user, assistant)
message_created_atstringYesMessage creation timestamp
text_idsstring[]YesUUIDs of text resources
audio_idsstring[]YesUUIDs of audio resources
image_idsstring[]YesUUIDs of image resources
video_idsstring[]YesUUIDs of video resources
file_idsstring[]YesUUIDs of file resources
call_idsstring[]YesUUIDs of call resources
reasoningbooleanNoChain-of-thought trace row

SearchTestItem

Single test row in search results.

FieldTypeRequiredDescription
test_idstringYesUUID of the test
eval_idstringNoUUID of the eval
eval_namestringNoName of the eval
eval_descriptionstringNoDescription of the eval
department_idsstring[]NoUUIDs of associated departments
test_namestringNoName of the test
test_descriptionstringNoDescription of the test
num_invocationsintegerNoNumber of invocations
infinite_modebooleanNoWhether infinite mode is enabled
is_dynamicbooleanNoWhether the test is dynamic
archivedbooleanNoWhether the test is archived
created_atstringNoISO timestamp when test was created

TableInfo

FieldTypeRequiredDescription
namestringYesTable name
columnsColumnInfo[]YesList of columns in the table

TestConfigGroup

A group bucket for the picker — used as the section header.

Renders one accordion section per row. run_count is the total rows in the group (across the whole inner pagination universe, not just the current expanded window). last_run_at drives the outer ordering (most-recent-group first).

FieldTypeRequiredDescription
group_idstringYesUUID of the group
namestringNoHuman-readable group name (or null if unnamed)
run_countintegerNoTotal run configs in this group
last_run_atstringNoISO timestamp of the most recent run in this group

TestConfigItem

A reusable run configuration the picker can queue.

Sources from runs_entry rows. Each row is a distinct config (agent + model + bundle) that can be re-fired any number of times into fresh trace executions. The bundle ids carried here come from the historical run’s agent_resource — the picker passes them as RunPanelState to /test/invocation/trace so the new trace records the same prompt + tool + instruction set the original run executed against.

FieldTypeRequiredDescription
run_idstringYesUUID of the runs_entry config
group_idstringNoUUID of the parent group (for grouping in the picker)
agent_namestringNoDisplay name of the agent
model_namestringNoDisplay name of the underlying model
labelstringYesHuman-readable picker label
created_atstringNoWhen this config was first created
prompt_idsstring[]NoPrompt resource ids from the historical agent
tool_idsstring[]NoTool resource ids from the historical agent
instruction_idsstring[]NoInstruction resource ids from the historical agent
model_idstringNoModel id from the historical agent
temperaturenumberNoTemperature value from the historical agent
reasoningstringNoReasoning level value from the historical agent
qualitystringNoQuality value from the historical agent
permissionsany[][]NoHistorical (artifact, operation) pairs this run executed

TestEntries

Entry payloads grouped by type.

FieldTypeRequiredDescription
testsGetTestResponse[]NoTest entry payloads
invocationsGetTestInvocationResponse[]NoInvocation entry payloads
runsGetTestInvocationRunsResponse[]NoRun entry payloads
groupsGetTestInvocationTracesResponse[]NoGroup entry payloads
gradesGetTestGradeResponse[]NoGrade entry payloads
feedbackGetTestFeedbackResponse[]NoFeedback entry payloads
messagesSearchMessageResponse[]NoMessage entry payloads
callsany[]NoTool call entries from original run

TestResources

Resource maps keyed by ID string.

FieldTypeRequiredDescription
evalsobjectNoEval resources keyed by ID
rubricsobjectNoRubric resources keyed by ID
agentsobjectNoAgent resources keyed by ID
modelsobjectNoModel resources keyed by ID
voicesobjectNoVoice resources keyed by ID
temperature_levelsobjectNoTemperature level resources keyed by ID
reasoning_levelsobjectNoReasoning level resources keyed by ID
modalitiesobjectNoModality resources keyed by ID
promptsobjectNoPrompt resources keyed by ID
instructionsobjectNoInstruction resources keyed by ID
toolsobjectNoTool resources keyed by ID
qualitiesobjectNoQuality resources keyed by ID
standard_groupsobjectNoStandard group resources keyed by ID
standardsobjectNoStandard resources keyed by ID

TestRunItem

A single run row for the UI table, derived from a benchmark invocation.

FieldTypeRequiredDescription
chat_idstringYesID of the chat
invocation_idstringYesID of the invocation
run_idstringNoID of the run
group_idstringNoID of the group
suite_entry_idstringNoID of the suite entry
model_namestringNoName of the model used
agent_namestringNoName of the agent used
statusstringNoRun status
grade_scorenumberNoGrade score for the run
grade_passedbooleanNoWhether the run passed grading

TestStatusSummary

FieldTypeRequiredDescription
totalintegerNoTotal number of invocations
completedintegerNoNumber of completed invocations
in_progressintegerNoNumber of in-progress invocations
not_startedintegerNoNumber of not-started invocations

ThemeBundle

Full theme payload for a page bootstrap.

Riding along on every /\{artifact\}/context response via ProfileSummary.theme. Layers:

  • primitives / dark_primitives — hex inputs the settings editor reads/writes (light + dark palettes).
  • tokens / dark_tokens — oklch tokens the client paints with. ThemeStyle emits two <style> blocks: one scoped to :root:not(.dark) (light) and one to :root.dark (dark).
  • thresholds — numeric score thresholds for analytics components. Empty-in → empty-out per token: missing values fall through to the matching globals.css default.
FieldTypeRequiredDescription
primitivesThemePrimitivesNoHex inputs from the setting (light palette, for the theme editor)
tokensThemeTokensNoDerived oklch tokens for light mode (SSR CSS-var injection)
dark_primitivesThemePrimitivesNoHex inputs from the setting (dark palette, for the theme editor)
dark_tokensThemeTokensNoDerived oklch tokens for dark mode (SSR CSS-var injection)
thresholdsThresholdsNoScore thresholds resolved from the setting

ThemePrimitives

40 optional fields. The 17 essentials drive the rest; the other 23 are overrides for fine-tuning when derivation isn’t what you want.

Empty primitive → empty token → client falls back to globals.css.

FieldTypeRequiredDescription
backgroundstringNo
primarystringNo
accentstringNo
cardstringNo
sidebarstringNo
muted_foregroundstringNo
ringstringNo
borderstringNo
destructivestringNo
successstringNo
warningstringNo
infostringNo
chart1stringNo
chart2stringNo
chart3stringNo
chart4stringNo
chart5stringNo
foregroundstringNo
card_foregroundstringNo
popoverstringNo
popover_foregroundstringNo
primary_foregroundstringNo
secondarystringNo
secondary_foregroundstringNo
mutedstringNo
accent_foregroundstringNo
destructive_foregroundstringNo
dangerstringNo
danger_foregroundstringNo
inputstringNo
success_foregroundstringNo
warning_foregroundstringNo
info_foregroundstringNo
sidebar_foregroundstringNo
sidebar_primarystringNo
sidebar_primary_foregroundstringNo
sidebar_accentstringNo
sidebar_accent_foregroundstringNo
sidebar_borderstringNo
sidebar_ringstringNo

ThemeTokens

40 fully-resolved CSS variable values (snake_case 1:1 with vars).

FieldTypeRequiredDescription
backgroundstringNo
foregroundstringNo
cardstringNo
card_foregroundstringNo
popoverstringNo
popover_foregroundstringNo
primarystringNo
primary_foregroundstringNo
secondarystringNo
secondary_foregroundstringNo
mutedstringNo
muted_foregroundstringNo
accentstringNo
accent_foregroundstringNo
destructivestringNo
destructive_foregroundstringNo
dangerstringNo
danger_foregroundstringNo
borderstringNo
inputstringNo
ringstringNo
successstringNo
success_foregroundstringNo
warningstringNo
warning_foregroundstringNo
infostringNo
info_foregroundstringNo
chart1stringNo
chart2stringNo
chart3stringNo
chart4stringNo
chart5stringNo
sidebarstringNo
sidebar_foregroundstringNo
sidebar_primarystringNo
sidebar_primary_foregroundstringNo
sidebar_accentstringNo
sidebar_accent_foregroundstringNo
sidebar_borderstringNo
sidebar_ringstringNo

Thresholds

Numeric score thresholds resolved from the active setting.

Server pre-buckets dashboard metrics into success | warning | danger | neutral already, so most components don’t need these values. Surface them for chart reference lines, tooltips, and any client-side bucketing.

FieldTypeRequiredDescription
successintegerYesScore >= this counts as success
warningintegerYesScore >= this counts as warning
dangerintegerYesScore < success threshold but >= this counts as danger; below is neutral/no-data

app__infra__invocation__types__DraftFormState

Server-authoritative form state returned after draft save.

FieldTypeRequiredDescription
name_idstringNoSaved name identifier
namestringNoSaved name value
description_idstringNoSaved description identifier
descriptionstringNoSaved description value
value_idstringNoSaved value identifier
flag_idsstring[]NoSaved flag identifiers
department_idsstring[]NoSaved department identifiers
key_idstringNoSaved key identifier
endpoint_idstringNoSaved endpoint identifier
modality_idsstring[]NoSaved modality identifiers
temperature_level_idstringNoSaved temperature level identifier
pricing_idstringNoSaved pricing identifier
reasoning_level_idstringNoSaved reasoning level identifier
quality_idsstring[]NoSaved quality identifiers
voice_idsstring[]NoSaved voice identifiers
model_flag_idsstring[]NoSaved model flag junction-row identifiers
model_flag_valuesInvocationModelFlagValue[]NoDenormalized (model_id, type, value) echo derived from model_flag_ids
model_position_idsstring[]NoSaved model position identifiers
model_rubric_idsstring[]NoSaved model rubric identifiers
pending_idsstring[]NoPending resource identifiers

app__infra__invocation__types__SectionFilter

Per-section filter options for GET requests.

FieldTypeRequiredDescription
searchstringNoFilter options by search text
limitintegerNoMax options to return
selectedbooleanNoOnly return selected items
suggestedbooleanNoOnly return suggested items
includebooleanNoInclude this section in response (default true)
parameter_idsstring[]NoReserved for compatibility with shared filter parsing

Last updated on