Skip to main content

Docs Tutorials Reference Integrations Roadmap Changelog Self-host Enterprise

Archive

Archive

2026

January 9 - Chat Sessions in Observability
January 13 - Playground UX Improvements
January 20 - Test Set Versioning and New Test Set UI
January 28 - Navigation Links from Traces to App/Environment/Variant
January 29 - Onboarding Widget and Guided Walkthroughs
February 4 - Folders for Prompt Organization
February 17 - Enterprise Compliance Features
February 25 - AI-Powered Prompt Refinement in the Playground
February 27 - Tool Integrations in the Playground
March 11 - Webhooks and GitHub Automations for Prompt Deployments
April 14 - Unified Invoke API
May 18 - Annotation Queues
June 5 - Dark Mode
June 9 - Evaluate While You Iterate in the Playground

2025

January 4 - New Onboarding Flow
January 15 - Agenta is SOC 2 Type 1 Certified
January 27 - Quality of life improvements
February 4 - New Playground
March 11 - OpenTelemetry Compliance and Custom workflows from API
March 19 - Improvements to the Playground and Custom Workflows
April 7 - New Feature: Prompt and Deployment Registry
April 15 - Structured Output Support in the Playground
April 18 - We are SOC 2 Type 2 Certified
May 2 - Documentation Overhaul, New Models, and Platform Improvements
May 10 - Tool Support in the Playground
May 15 - Annotate Your LLM Response (preview)
June 17 - LlamaIndex Integration
July 29 - Support for Images in the Playground
August 7 - Major Playground Improvements and Enhancements
August 12 - Open-sourcing our Product Roadmap
August 29 - DSPy Integration
September 9 - Multiple Metrics in Human Evaluation
September 19 - Speed Improvements in the Playground
September 24 - Deep URL Support for Sharable Links
September 26 - New Evaluation Results Dashboard
October 14 - Filtering Traces by Annotation
October 24 - Vertex AI Provider Support
November 3 - Documentation Architecture Overhaul
November 10 - Customize LLM-as-a-Judge Output Schemas
November 11 - Online Evaluation
November 12 - Evaluation SDK
November 13 - Agenta Core is Now Open Source
November 17 - Jinja2 Template Support in the Playground
November 18 - Reasoning Effort Support in the Playground
November 20 - Provider Built-in Tools in the Playground
December 4 - Projects within Organizations
December 14 - Agenta Documentation MCP Server
December 17 - PDF Support in the Playground
December 31 - JSON Multi-Field Match Evaluator

2024

January 12 - Adding Cost and Token Usage to the Playground
January 22 - Revamping evaluation
January 24 - Improved human evaluation workflow
January 25 - Bring your own API key
January 29 - Improved error handling in evaluation
January 30 - New JSON Evaluator
January 31 - Prompt Versioning
February 4 - Minor fixes
February 14 - Deployment Versioning and RBAC
March 4 - Highlight ouput difference when comparing evaluations
March 11 - Minor improvements
March 25 - New evaluators
March 31 - Minor improvements
April 1 - Compare latency and costs
April 14 - Observability (beta)
April 23 - Evaluation Speed Increase and Numerous Quality of Life Improvements
April 28 - Miscellaneous Improvements
May 1 - Prompt and Configuration Registry
May 24 - Playground Improvements
May 25 - New LLM Provider: Welcome Gemini!
June 4 - Evaluators can access all columns
July 5 - More Reliable Evaluations
July 9 - Migration from MongoDB to Postgres
August 12 - RAGAS Evaluators and Traces in the Playground
August 20 - New Alpha Version of the SDK for Creating Custom Applications
August 22 - UI Redesign and Configuration Management and Overview View
September 22 - Evaluator Testing Playground and a New Evaluation View
October 22 - New Application Management View and Various Improvements
November 6 - Observability and Prompt Management
November 29 - Viewing Traces in the Playground and Authentication for Deployed Applications
December 11 - Add Spans to Test Sets

2023

January 1 - Changes to the SDK
October 23 - Launch of SDK Version 2 and Cloud-hosted Version
November 2 - Cypress Tests and UI Improvements
November 12 - Sentry Integration and User Communication Improvements
November 17 - Enhanced Self-hosting and Mistral Model Tutorial
December 1 - Multiple UI and CSV Reader Fixes
December 1 - Introduction of Chat-based Applications
December 7 - Minor Adjustments for Better Performance
December 7 - Bug Fix for Application Saving
December 12 - Integrated File Input and UI Enhancements
December 12 - Comprehensive Updates and Bug Fixes
December 18 - Resolved Batch Logic Issue in Evaluation
December 19 - Improving Side-by-side Comparison in the Playground