Learn how multi-agent forecasting systems coordinate specialized time-series models with AI routing for accurate predictions and reduced latency.
Multi-agent forecasting represents a fundamental shift in how organizations approach time-series prediction. Instead of relying on a single monolithic model to handle all forecasting scenarios, a multi-agent system deploys multiple specialized forecasting agents—each optimized for specific data patterns, seasonality characteristics, or business domains—and uses an intelligent router or supervisor agent to coordinate their efforts.
The core insight is simple: no single forecasting algorithm excels at every problem. ARIMA (AutoRegressive Integrated Moving Average) performs exceptionally well on stationary series with clear autocorrelation structure. Prophet, developed by Meta, shines when you have strong seasonal patterns and holiday effects. Machine learning-based approaches like XGBoost or neural networks capture complex nonlinear relationships. By deploying these as independent agents and routing each prediction task to the most appropriate specialist, you achieve both accuracy gains and operational resilience.
This architecture mirrors real-world organizational structures. A data science team doesn't assign every forecasting problem to one person; instead, they route demand forecasting to the supply chain specialist, financial projections to the FP&A analyst, and infrastructure capacity planning to the platform engineer. Multi-agent systems codify this specialization into software.
At D23, we've seen this pattern emerge consistently across mid-market and scale-up organizations building embedded analytics and self-serve BI platforms. When teams integrate AI-powered forecasting into dashboards or products, the ability to route queries intelligently—rather than force-fitting all time-series data through one model—becomes the difference between a system that scales and one that fails on edge cases.
The routing supervisor (or orchestrator) is the decision-making brain of a multi-agent forecasting system. Its job is to examine incoming data and determine which specialist agent should handle the prediction task. This routing decision directly impacts accuracy, latency, and computational cost.
A well-designed supervisor typically evaluates:
Supervisors can be implemented as rule-based systems (if ACF > 0.7, route to ARIMA) or as learned models themselves. The most sophisticated implementations use large language models (LLMs) as supervisors. An LLM can read a time-series description in natural language—"quarterly SaaS revenue with increasing growth rate and a Q4 spike"—and route intelligently without explicit rules.
Google's approach to multi-agent forecasting, detailed in their multi-agent system for superior business forecasting, demonstrates this principle at scale. They combine data agents that prepare and analyze series characteristics with prediction agents that execute forecasts, coordinated by a supervisor that learns which agent performs best for each data pattern.
Each specialist agent in a multi-agent forecasting system is a focused, optimized implementation of a forecasting algorithm. Let's examine the three most common specialists and when they excel.
AutoRegressive Integrated Moving Average (ARIMA) has been the workhorse of time-series forecasting for decades. It models a series as a linear combination of its own past values (the autoregressive component), differences of the series to achieve stationarity (the integrated component), and past forecast errors (the moving average component).
ARIMA excels when:
ARIMA struggles with:
In a multi-agent system, ARIMA typically handles baseline forecasts for well-behaved, stationary series. It's fast to train, interpretable, and provides a reliable fallback.
Prophet, released by Meta (formerly Facebook), was specifically designed for business time-series forecasting at scale. It decomposes a series into trend, seasonality, and holiday effects, making it particularly effective for retail, e-commerce, and SaaS metrics.
Prophet excels when:
Prophet struggles with:
In a multi-agent system, Prophet is your go-to agent for consumer-facing metrics: website traffic, app downloads, subscription churn, and seasonal product demand. It's also valuable when business users need to understand why a forecast looks the way it does.
Machine learning approaches—XGBoost, LightGBM, neural networks, and ensemble methods—excel at capturing complex, nonlinear patterns that statistical models miss. They can incorporate exogenous variables (like marketing spend, weather, or competitor pricing) directly into the model.
ML agents excel when:
ML agents struggle with:
In a multi-agent system, ML agents handle complex, multivariate forecasting. They're particularly valuable when you have rich feature sets: e-commerce platforms routing to ML agents for product-level demand forecasting, SaaS companies using ML agents for logo churn with account health signals, and financial services using ML agents for credit risk or fraud prediction.
The effectiveness of a multi-agent forecasting system hinges entirely on routing logic. Poor routing—sending a series to the wrong agent—undermines all the benefits of specialization.
A practical routing supervisor begins by extracting statistical features from the incoming time series:
These features create a decision tree or classifier that routes each series to the appropriate agent. A simple rule might be:
If seasonality_strength > 0.7 AND has_exogenous_features == False:
Route to Prophet
Else if ACF[1] > 0.8 AND stationarity == True:
Route to ARIMA
Else:
Route to ML_ensemble
More sophisticated systems use language models as supervisors. An LLM can read metadata about a time series—"quarterly revenue for a SaaS company with strong Q4 seasonality and accelerating growth"—and reason about which agent to use without explicit rules.
Research on agentic frameworks like MoiraiAgent, which integrates contextual signals and uses LLM for expert selection in time-series forecasting, demonstrates that LLMs can effectively route between specialized forecasting agents. The LLM acts as a reasoning layer, weighing the characteristics of the data against the known strengths and weaknesses of each agent.
LLM-based routing offers several advantages:
Once each specialist agent generates a forecast, the supervisor must decide how to combine them. Simple averaging often underperforms; instead, practical systems use weighted averaging or meta-learning.
Weighted averaging assigns higher weights to agents with proven track records. If Prophet has historically forecasted retail revenue with 15% MAPE (Mean Absolute Percentage Error) and ARIMA with 22% MAPE, the final forecast weights Prophet at 60% and ARIMA at 40%.
Meta-learning trains a secondary model to learn optimal weights for each agent based on the characteristics of the current series. This is more sophisticated but requires historical forecast accuracy data.
Confidence intervals from each agent can be combined using Bayesian methods. If one agent produces a very narrow interval (high confidence) and another produces a wide interval (low confidence), the final interval reflects this uncertainty distribution.
New products, new markets, or sparse data present a challenge: you don't have enough history to train complex models reliably. A well-designed multi-agent system routes these cases strategically.
When data is sparse (< 50 observations):
When data is completely new (< 10 observations):
Time-series forecasting is never "done." As new data arrives, models degrade, and agents must adapt. A production multi-agent system includes mechanisms for incremental learning:
Multi-agent forecasting systems are most valuable when integrated into analytics and BI platforms where business users can access predictions without building models themselves. This is where platforms like D23, built on Apache Superset, become critical.
A self-serve BI platform can expose multi-agent forecasting as a dashboard feature. Users select a metric, choose a forecast horizon, and the platform automatically:
This requires tight integration between the forecasting system and the BI platform's query engine. The D23 platform architecture supports this through API-first design and MCP (Model Context Protocol) server integration, allowing forecasting agents to be called directly from dashboard queries.
Advanced platforms use text-to-SQL capabilities combined with multi-agent routing. A user asks: "Forecast next quarter's revenue with uncertainty bounds." The system:
This pattern aligns with research on TimeSeriesScientist, an LLM-driven agentic framework for time-series forecasting with tool-augmented reasoning. By combining language understanding with forecasting agents, platforms enable non-technical users to access sophisticated predictions.
For embedded analytics use cases—forecasts embedded directly in product UIs—a multi-agent system must expose forecasting as an API. A SaaS platform might call:
POST /api/forecast
{
"metric": "churn_rate",
"series": [0.02, 0.019, 0.021, ...],
"horizon": 30,
"confidence_level": 0.95
}
The API routes internally to the appropriate agent, returns a forecast with confidence intervals, and logs performance for future routing optimization. This enables embedded analytics—forecasts shown directly to customers in your product—without exposing the complexity of the underlying system.
The frontier of multi-agent forecasting incorporates foundation models—large, pre-trained models that have learned patterns across millions of time series.
TimesFM, Google's decoder-only foundation model for time-series forecasting, represents a new class of specialist agent. Unlike ARIMA or Prophet, which are trained on your specific data, foundation models are pre-trained on diverse time-series data and can generalize to new domains with minimal fine-tuning.
In a multi-agent system, TimesFM (or similar models like Chronos or NeuralForecast) serve as a universal "catch-all" agent that can handle any time series reasonably well. The routing logic becomes:
Foundation models excel at:
The most sophisticated multi-agent systems incorporate contextual signals beyond the time series itself. Research on FinVision, a multi-modal multi-agent framework for financial prediction, demonstrates this approach.
Contextual signals might include:
A multimodal agent system routes not just on time-series characteristics but on the full context. For example:
A multi-agent system is only as good as its accuracy. Measuring performance and continuously optimizing routing and agent selection is essential.
Standard forecasting accuracy metrics include:
For multi-agent systems, also track:
Proper evaluation requires walk-forward validation: simulate real-world usage by forecasting historical periods, comparing predictions to actual outcomes, and measuring accuracy.
A robust backtesting process:
Once deployed, monitor forecasting performance continuously:
Running multiple agents and aggregating forecasts costs more than a single model. In a production system:
For a dashboard with 1000 metrics forecasted daily, this compounds. A practical system might:
When forecasts drive business decisions (inventory, hiring, budgeting), stakeholders need to understand why. A multi-agent system can provide interpretability at multiple levels:
LLM-based supervisors excel here because they can explain reasoning in natural language.
Time series often experience structural breaks: a product pivot, market disruption, or business model change that fundamentally alters the pattern. Multi-agent systems can detect and handle these:
Multi-agent forecasting represents the maturation of time-series prediction from a specialized skill to an operational capability. By deploying specialized agents and routing intelligently, organizations achieve accuracy that no single model can match, while maintaining interpretability and operational resilience.
The convergence of three trends accelerates adoption:
Organizations building self-serve BI systems—whether through platforms like D23 or custom implementations—can now expose forecasting as a first-class citizen. Users ask questions in natural language, the system routes to the appropriate agent, and predictions appear in dashboards with confidence intervals and explanations.
For data leaders evaluating managed analytics solutions, the ability to deploy multi-agent forecasting without building custom infrastructure is a significant advantage. Instead of hiring specialists to implement ARIMA, Prophet, and ML models separately, you can configure a system that coordinates them automatically.
The future of forecasting isn't a single powerful model; it's an ensemble of specialists, intelligently routed, continuously learning, and deeply integrated into the analytics workflows where decisions are made.