The event model
================

Generated parsers do not materialize a parse tree. They emit a flat
sequence of **events**, and a program that consumes events can
reconstruct whatever tree (or no tree at all) it needs. This page
specifies the event stream — the contract every backend implements.

The four events
---------------

Every event is one of:

``Enter``
  Opens the subtree of a rule. Carries the rule's ``RuleKind`` and a
  ``Pos`` marking the start of the subtree (the position of its first
  child, or the position of its matching ``Exit`` for an empty rule).
  Only non-fragment rules produce this event — fragment rules are
  inlined without any ``Enter``/``Exit`` markers.

``Exit``
  Closes the matching ``Enter``. Carries the same ``RuleKind`` and a
  ``Pos`` marking the end of the subtree (the position just past the
  last consumed token of the rule's content, equal to the enter
  position if nothing was consumed).

``Token``
  A lexed token. Carries:

  * ``kind`` — a ``TokenKind`` value identifying which token
    declaration this matches. The reserved sentinels ``EOF`` and
    ``ERROR`` are also possible; see below.
  * ``span`` — a ``Span`` covering the matched input.
  * ``text`` — the matched source text, exactly as it appeared.
    Un-escaping, numeric conversion, and other transforms are not
    performed by the parser.

``Error``
  A recoverable diagnostic. Carries a human-readable message and a
  ``Span`` pointing at the offending lookahead. The parser continues
  emitting events after an error, so a file with many errors still
  yields a useful stream.

Every backend names these four cases the same way in its idiomatic
tagged-union form — in TypeScript they are ``{tag: "enter" | "exit" |
"token" | "error", ...}``; in Rust they are ``Event::Enter { .. }``
and friends; in Python they are ``Event`` objects with a ``.tag``
string attribute; in Go they are distinguished by an ``EventTag``
constant.

Ordering guarantees
-------------------

* **Source order.** Events are emitted in the order their source
  bytes appear. Skip tokens (see below) are interleaved with
  structural events accordingly.
* **Balanced structure.** Every ``Enter`` is matched by exactly one
  ``Exit`` for the same ``RuleKind``. Errors or recovery do not cause
  unmatched ``Enter``/``Exit`` pairs — if the parser commits to a
  rule, it finishes the rule.
* **Finality.** Events are never retracted or reordered. A consumer
  can commit to a side-effect on each event as it arrives.
* **Termination.** The stream ends when the parser reaches the end of
  input. If there are trailing bytes after the start rule completes,
  the parser emits an "expected end of input" error and consumes the
  remaining tokens before terminating.

Building a tree from events
---------------------------

The canonical consumer keeps a stack: push a new node on ``Enter``,
attach tokens as children of the top-of-stack node, and pop on
``Exit``. In pseudocode::

    stack = [root]
    for ev in parser:
        match ev.tag:
            case "enter":
                node = make_node(ev.rule)
                stack[-1].children.append(node)
                stack.push(node)
            case "token":
                stack[-1].children.append(ev.token)
            case "exit":
                stack.pop()
            case "error":
                errors.append(ev.error)

This is the direct, mechanical translation — consumers that want a
typed AST typically switch on ``ev.rule`` inside ``enter`` to pick
the right node type, and switch on ``ev.token.kind`` inside
``token`` to decode the leaf.

Skip tokens
-----------

Tokens declared with the ``?`` prefix (whitespace, comments) are
**skips**. The parser's state machine does not see them — they are
never consumed by ``Expect`` or examined by lookahead. The runtime
re-inserts them into the event stream just before the next structural
event, so consumers that want trivia (formatters, highlighters) see
skips in their correct source position, while consumers that only
care about structure can filter them out by kind or by the fact that
they appear outside any rule scope.

The ``Pos`` and ``Span`` types
------------------------------

Every backend exposes the same two shapes:

``Pos``
  ``{offset, line, column}``. ``offset`` is a 0-based byte offset
  into the source. ``line`` is 1-based. ``column`` is 1-based and
  counted in Unicode codepoints within the line (not bytes, not
  grapheme clusters).

``Span``
  A half-open ``[start, end)`` pair of ``Pos`` values. ``span.start
  == span.end`` denotes a zero-width span at a point — used, for
  example, for the ``Enter`` of an empty rule.

Reserved token kinds
--------------------

Two token kinds are reserved and never collide with a grammar token:

``EOF`` (kind id ``0``)
  Emitted once by the lexer when the input is exhausted. The parser
  consumes it internally; consumers typically do not see an ``EOF``
  token, but may see one inside ``Token`` events during error
  recovery in pathological cases.

``ERROR`` (kind id ``-1``)
  Emitted by the lexer when no token pattern matches at the current
  position. The lexer still advances by one codepoint so the parser
  can keep making progress. You will see an ``ERROR`` token in the
  event stream at the offending position, accompanied by a nearby
  ``Error`` event explaining what was expected.

Error recovery, observably
--------------------------

Two things happen on an unexpected token:

1. An ``Error`` event is emitted with a message like ``"expected X"``
   and a span over the current lookahead.
2. The parser runs recovery — it consumes tokens until the lookahead
   matches a token in the enclosing rule's synchronization set
   (essentially that rule's ``FOLLOW`` plus ``EOF``), then retries
   the expectation once. Tokens skipped during recovery are still
   emitted as ``Token`` events so consumers do not silently lose
   input.

This means a parse of a broken file produces a stream where every
input byte is accounted for: some as well-formed tokens, some as
errors plus the tokens recovery skipped over. An editor or linter
consuming the stream can highlight error spans without losing track
of the surrounding structure.