Skip to content

[FEATURE] Expose execution lifecycle hooks and metadata for agent reliability tooling #2980

@ishita-0301

Description

@ishita-0301

Expose execution lifecycle hooks and metadata for agent reliability tooling

Long running agents and workflows can encounter issues such as repeated retries, circular execution paths, stalled progress, and other forms of non productive execution.

Today, many of these behaviors are difficult to observe or intercept without embedding reliability logic directly into application code.

Proposal

Expose lightweight execution lifecycle hooks and metadata around agent and tool execution.

Examples could include

BeforeToolCall(...)
AfterToolCall(...)
OnRetry(...)
OnFailure(...)
OnLoopDetected(...)

Along with metadata such as

ExecutionID
ParentExecutionID
ToolName
AttemptCount
StepCount
Duration
CorrelationID

Benefits

This would make it possible to build reliability features as separate components rather than coupling them to business logic.

Potential use cases include

  • Loop detection
  • Retry policies
  • Circuit breakers
  • Execution auditing
  • Progress tracking
  • Resource monitoring
  • Custom recovery workflows

The goal would not be to prescribe a particular implementation, but to expose enough information for the community to experiment with different approaches.

I have seen similar ideas emerging in agent runtimes where reliability is treated as a separate concern from orchestration. Some projects are beginning to build runtime policies around execution patterns such as repeated tool calls, excessive retries, and stalled workflows. Having standardized lifecycle events in Go Micro would make it much easier to integrate or experiment with those kinds of reliability layers.
One example is https://github.com/FailproofAI/failproofai, which focuses on runtime detection of loops and execution failures rather than orchestration itself.

Go Micro already provides strong primitives for communication and orchestration. Exposing execution lifecycle events feels like a natural next step for supporting more autonomous and long running agent workloads.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions