Platform Updates

Advanced Analytics Enhancements

Introduced flexible analytics capabilities to slice and filter AI performance metrics across models, agents, datasets, and time intervals
Added new Agent Span Count metric, enabling aggregation on agent level spans similar to existing tool span count metrics
Improved persona specific visibility for Product, Engineering, Compliance, and Leadership stakeholders
Enhanced metric exploration to support faster root cause analysis and performance monitoring

Refined global navigation structure to improve discoverability of core workflows
Maintained RBAC enforcement across updated navigation experiences
Reduced friction between experimentation, tracing, analytics, and governance workflows

Launched new Trace Overview dashboard with aggregated KPIs
Added Trace KPI summary card for faster performance visibility
Released redesigned Trace Viewer for improved debugging workflows
Enhanced trace table with span status badges and token counts
Surfaced cost metrics directly within trace workflows
Introduced enhanced filtering mechanisms for:
- Trace Viewer
- CE management and results pages
Introduced platform wide Dark Mode

Introduced built in synthetic dataset generators to support safe experimentation and evaluation workflows
Added dataset generators for:
- Binary classification: card fraud detection
- Binary classification: credit application approval
- Regression: loan amount prediction
- Regression: housing price prediction

Applied security patches and dependency upgrades
Improved evaluation stability
Enhanced reliability across experiments and trace workflows
Delivered UX refinements to improve overall platform responsiveness and consistency