What is durable browser execution?

Most browser-automation work fits inside a single short-lived run: minutes, not hours. The work that doesn't fit — multi-step agent workflows, long-tail data pulls across hundreds of pages, KYC flows that wait days for human verification — has a different problem. The process doing the work might restart, the cloud browser might cycle, the worker pod might get evicted, and the workflow needs to survive all of that without re-doing the work. Durable execution is the pattern that makes that survivable: every step's state gets checkpointed so the workflow can resume from any point, not just the beginning.

Where the idea comes from

Durable execution as a concept didn't start with browsers. It comes from the orchestration world:

Temporal popularized the pattern at scale: workflows as deterministic functions, every step's input and output recorded, replays restore state on resume.
Inngest brought it to function-as-a-service shape: durable steps inside a serverless handler.
Vercel Workflow brought the model to Next.js / Vercel deployments.

All three share the same essential property: each step in the workflow is recorded, and a workflow can be resumed by replaying those records up to the point of failure. Applied to browser automation, this turns "the agent ran for 45 minutes and the worker crashed at minute 30 — start over" into "resume from minute 30."

What gets checkpointed

For a browser workflow, four classes of state need to survive a restart:

Workflow input. Whatever arguments triggered the run.
Step outputs. Each step's return value, recorded so the next replay sees the same data.
Browser state. Cookies, storage, the live session — usually persisted to a browser profile so a new session can pick up where the old one left off.
Side-effect markers. "Did we already send the email?" "Did we already submit the form?" — boolean flags so a resumed workflow doesn't repeat side effects.

The discipline is making every step idempotent or marker-checked. Anything that isn't will misbehave on resume.

When durable execution is the right primitive

Three patterns where it earns the engineering cost:

Long-running agent runs. An agent that takes 30+ minutes to complete a complex multi-step workflow. A worker restart shouldn't restart the run.
Workflows that wait on humans. KYC, manual review, customer-side approval. The workflow pauses for hours-to-days; the runtime should sleep durably, not keep a worker alive.
Multi-step orchestrations across services. Browser action → API call → database write → another browser action. Each step is a durable boundary.

For short workflows that complete in seconds-to-minutes inside one process, durable execution is overkill. The complexity tax doesn't earn back. Reach for it when the runtime starts to span the lifecycle of the worker itself.

Durable execution vs. retry vs. orchestration

Three concepts that overlap. The clean split:

Retry is "re-run the step on failure." Same input, same expected output. Covered in how do browser agents recover from errors.
Durable execution is "the workflow's position survives a restart." Each step's output is recorded; replay rebuilds state.
Browser orchestration is "coordinate many sessions and steps." Often uses durable execution as a primitive but adds the parallelism, fanout, and state-machine layers on top.

Production stacks tend to use all three at different layers.

How it interacts with Notte

Notte itself doesn't ship a durable-execution runtime — it ships browser sessions, agents, profiles, and Functions. Durable orchestration sits one layer up: you'd use Temporal / Inngest / Vercel Workflow / Trigger.dev to drive Notte primitives, with each client.Function.run(...) call (or each agent action) becoming a durable step in the orchestration. The Notte side handles "run a step"; the durable layer handles "track which steps ran and resume on failure."

A typical integration shape: each Notte Function call is one step in a Temporal workflow. The Temporal workflow records the step's output. If the Temporal worker restarts mid-workflow, replay reaches the step that already ran, skips it (the output is recorded), and continues from the next step.

Common pitfalls

Adding durable execution before you need it. A 30-second workflow doesn't need it. The complexity is real.
Non-idempotent steps without markers. A step that sends an email on each run will send the email twice if the workflow resumes through it. Add a "did we send?" check.
Treating browser state as ephemeral when it should be durable. A workflow that depends on session state needs that state in a profile, not in memory.
Ignoring the LLM determinism issue. A replayed step that re-invokes an LLM may produce different output. Either pin model + temperature + seed, or record the LLM output as part of the durable state.

Key takeaways

Durable execution checkpoints workflow state at every step, so a long-running browser workflow can survive restarts and resume from where it stopped.
The pattern was popularized outside the browser world (Temporal, Inngest, Vercel Workflow) and now applies to long-running browser-driven workflows.
Reach for it when runs span 30+ minutes, wait on humans, or orchestrate across services. Skip it for short workflows.
Notte ships the browser primitives; durable orchestration sits a layer above and uses Notte as a step. Pair with browser orchestration for coordinating multiple sessions.

What is durable browser execution?