2025
- January 4 - New Onboarding Flow
- January 15 - Agenta is SOC 2 Type 1 Certified
- January 27 - Quality of life improvements
- February 4 - New Playground
- March 11 - OpenTelemetry Compliance and Custom workflows from API
- March 19 - Improvements to the Playground and Custom Workflows
- April 7 - New Feature: Prompt and Deployment Registry
- April 15 - Structured Output Support in the Playground
- April 18 - We are SOC 2 Type 2 Certified
- May 2 - Documentation Overhaul, New Models, and Platform Improvements
- May 10 - Tool Support in the Playground
- May 15 - Annotate Your LLM Response (preview)
- June 17 - LlamaIndex Integration
- July 29 - Support for Images in the Playground
- August 7 - Major Playground Improvements and Enhancements
- September 9 - Multiple Metrics in Human Evaluation
- September 19 - Speed Improvements in the Playground
- September 24 - Deep URL Support for Sharable Links
- September 26 - New Evaluation Results Dashboard
- October 14 - Filtering Traces by Annotation
- October 24 - Vertex AI Provider Support
- November 3 - Documentation Architecture Overhaul
- November 10 - Customize LLM-as-a-Judge Output Schemas
- November 11 - Online Evaluation
- November 12 - Evaluation SDK
- November 13 - Agenta Core is Now Open Source
- November 14 - Changelog
2024
- January 12 - Adding Cost and Token Usage to the Playground
- January 22 - Revamping evaluation
- January 24 - Improved human evaluation workflow
- January 25 - Bring your own API key
- January 29 - Improved error handling in evaluation
- January 30 - New JSON Evaluator
- January 31 - Prompt Versioning
- February 4 - Minor fixes
- February 14 - Deployment Versioning and RBAC
- March 4 - Highlight ouput difference when comparing evaluations
- March 11 - Minor improvements
- March 25 - New evaluators
- March 31 - Minor improvements
- April 1 - Compare latency and costs
- April 14 - Observability (beta)
- April 23 - Evaluation Speed Increase and Numerous Quality of Life Improvements
- April 28 - Miscellaneous Improvements
- May 1 - Prompt and Configuration Registry
- May 24 - Playground Improvements
- May 25 - New LLM Provider: Welcome Gemini!
- June 4 - Evaluators can access all columns
- July 5 - More Reliable Evaluations
- July 9 - Migration from MongoDB to Postgres
- August 12 - RAGAS Evaluators and Traces in the Playground
- August 20 - New Alpha Version of the SDK for Creating Custom Applications
- August 22 - UI Redesign and Configuration Management and Overview View
- September 22 - Evaluator Testing Playground and a New Evaluation View
- October 22 - New Application Management View and Various Improvements
- November 6 - Observability and Prompt Management
- November 29 - Viewing Traces in the Playground and Authentication for Deployed Applications
- December 11 - Add Spans to Test Sets
2023
- January 1 - Changes to the SDK
- October 23 - Launch of SDK Version 2 and Cloud-hosted Version
- November 2 - Cypress Tests and UI Improvements
- November 12 - Sentry Integration and User Communication Improvements
- November 17 - Enhanced Self-hosting and Mistral Model Tutorial
- December 1 - Multiple UI and CSV Reader Fixes
- December 1 - Introduction of Chat-based Applications
- December 7 - Minor Adjustments for Better Performance
- December 7 - Bug Fix for Application Saving
- December 12 - Integrated File Input and UI Enhancements
- December 12 - Comprehensive Updates and Bug Fixes
- December 18 - Resolved Batch Logic Issue in Evaluation
- December 19 - Improving Side-by-side Comparison in the Playground