NATS JetStream Streams, Consumers, and Replay

Overview
Stream Basics
Consumer Types
Ack and Redelivery
Replay and Backfill
Go Producer Example
Go Pull Consumer Example
Idempotency and Ordering
Operational Patterns
Conclusion

NATS JetStream Streams, Consumers, and Replay

Overview

This article focuses on how to use JetStream once it is deployed:

Create streams with subjects and retention rules.
Build consumers that acknowledge work explicitly.
Replay old data for recovery or backfill.
Write Go code that treats message delivery as stateful work.

If the deployment article is about keeping the broker healthy, this one is about using the durable message log correctly.

Stream Basics

A JetStream stream is a named log of messages that match one or more subjects.

Typical use cases:

Durable event fan-out.
Queue-like job distribution.
Audit and replay.
Backfill after a consumer bug.

Create a stream that captures a subject hierarchy:

nats stream add EVENTS \
  --subjects "events.>" \
  --storage file \
  --retention limits \
  --max-msgs=-1 \
  --max-age=72h

That configuration means:

Store all events.* traffic.
Keep data on disk.
Retain messages for 72 hours.

Design your subjects so the stream can absorb a logical family of events:

events.orders.created
events.orders.paid
events.orders.shipped

Avoid overloading a stream with unrelated subject trees.

Consumer Types

JetStream consumers come in two broad forms:

Push consumers deliver messages to a subscription.
Pull consumers let the application fetch messages when ready.

Push consumers are convenient when you want the broker to drive delivery. Pull consumers are better when you want the application to control batching and backpressure.

Example durable push consumer:

nats consumer add EVENTS orders-worker \
  --deliver all \
  --ack explicit \
  --replay instant \
  --filter "events.orders.>"

Example pull consumer:

nats consumer add EVENTS orders-batch \
  --deliver all \
  --ack explicit \
  --replay instant \
  --pull

Use pull consumers when:

You need batch processing.
The downstream system has tight rate limits.
You want the worker to control concurrency.

Ack and Redelivery

Consumer acknowledgements are the core reliability mechanism.

The workflow is simple:

Consumer receives a message.
Consumer processes it.
Consumer acks only after durable side effects complete.
If the ack never arrives, JetStream redelivers.

This makes consumers safe to restart, but it also means handlers must be idempotent.

Ack handling in Go:

package main

import (
  "context"
  "log"
  "time"

  "github.com/nats-io/nats.go"
)

func consumeOrders(ctx context.Context, nc *nats.Conn) error {
  js, err := nc.JetStream()
  if err != nil {
    return err
  }

  msgs, err := js.PullSubscribe(
    "events.orders.>",
    "orders-batch",
    nats.BindStream("EVENTS"),
  )
  if err != nil {
    return err
  }

  for {
    batch, err := msgs.Fetch(10, nats.Context(ctx), nats.MaxWait(5*time.Second))
    if err != nil {
      return err
    }

    for _, msg := range batch {
      if err := processOrder(msg.Data); err != nil {
        _ = msg.Nak()
        continue
      }
      if err := msg.Ack(); err != nil {
        return err
      }
    }
  }
}

func processOrder(data []byte) error {
  log.Printf("process order event: %s", string(data))
  return nil
}

Notice the ordering:

Process first.
Ack second.
Nak on recoverable failures.

If the worker crashes after processing but before acking, the message can come back. That is expected.

Replay and Backfill

JetStream replay is what makes a stream useful after a defect or deployment mistake.

Common replay modes:

Replay everything from the beginning.
Replay only from a time window.
Replay only for a specific subject filter.

That makes recovery workflows practical:

Deploy a fix.
Rewind the consumer or create a new one.
Reprocess from the stream.
Compare output counts and spot gaps.

Backfill is especially useful when:

A consumer wrote bad data downstream.
A new index or materialized view must be rebuilt.
A service needs to rebuild a cache from historical events.

If you need a one-off backfill, create a separate consumer instead of rewinding the production consumer blindly.

Go Producer Example

Producers only need to know the subject and the payload.

package main

import (
  "context"
  "encoding/json"

  "github.com/nats-io/nats.go"
)

type OrderCreated struct {
  OrderID string `json:"order_id"`
  UserID  string `json:"user_id"`
  Total   int64  `json:"total_cents"`
  Currency string `json:"currency"`
}

func publishOrderCreated(ctx context.Context, nc *nats.Conn, event OrderCreated) error {
  _ = ctx

  js, err := nc.JetStream()
  if err != nil {
    return err
  }

  payload, err := json.Marshal(event)
  if err != nil {
    return err
  }

  _, err = js.Publish("events.orders.created", payload)
  return err
}

The producer should not decide how the message is consumed. It should only publish a clear event to a stable subject.

Go Pull Consumer Example

Pull consumers are a good default for batch-oriented processing.

package main

import (
  "context"
  "time"

  "github.com/nats-io/nats.go"
)

func runBatchWorker(ctx context.Context, nc *nats.Conn) error {
  js, err := nc.JetStream()
  if err != nil {
    return err
  }

  sub, err := js.PullSubscribe(
    "events.orders.>",
    "orders-batch",
    nats.BindStream("EVENTS"),
    nats.AckExplicit(),
    nats.MaxAckPending(1000),
  )
  if err != nil {
    return err
  }

  for {
    msgs, err := sub.Fetch(50, nats.Context(ctx), nats.MaxWait(10*time.Second))
    if err != nil {
      return err
    }

    for _, msg := range msgs {
      if err := handleOrderEvent(msg.Data); err != nil {
        _ = msg.Nak()
        continue
      }
      if err := msg.Ack(); err != nil {
        return err
      }
    }
  }
}

func handleOrderEvent(data []byte) error {
  _ = data
  return nil
}

Operationally, the important knobs are:

Fetch batch size.
MaxAckPending.
Ack timeout.
Retry and backoff behavior.

Tune those based on downstream latency, not just broker capacity.

Idempotency and Ordering

JetStream gives you delivery guarantees, not business-level exactly-once semantics.

Design consumers so repeated delivery is harmless:

Use event IDs.
Track processed IDs in a durable store.
Make writes idempotent.
Use upserts when appropriate.

Ordering is also subtle:

Stream order is preserved within the log.
Parallel consumers can process out of order.
A redelivery may arrive after newer messages are already handled.

That means consumers should rely on event timestamps or sequence numbers when strict ordering matters.

Operational Patterns

Useful patterns for production:

Use one stream per domain or lifecycle boundary.
Keep dead-letter subjects separate from mainline traffic.
Create durable consumers for long-lived workers.
Use ephemeral consumers for ad hoc inspection.
Add replay scripts for recovery jobs.

Examples:

orders.events.v1
orders.audit.v1
orders.deadletter.v1

Avoid:

One catch-all stream for unrelated systems.
Ad hoc consumers created by every pod startup.
No retention policy.
No accounting for duplicate deliveries.

Conclusion

JetStream is straightforward when you keep the model honest:

Streams are durable logs.
Consumers track acknowledgement state.
Replay is a recovery tool.
Idempotency is mandatory.

If you want durable messaging with operationally simple semantics, JetStream is a strong fit. If you want workflow orchestration or transactional processing, use something else.

NATS JetStream Streams, Consumers, and Replay

Table of Contents

NATS JetStream Streams, Consumers, and Replay

Overview

Stream Basics

Consumer Types

Ack and Redelivery

Replay and Backfill

Go Producer Example

Go Pull Consumer Example

Idempotency and Ordering

Operational Patterns

Conclusion