MIP developer guide

How to write, test, and debug Memory Intent Protocol routing rules. MIP routes incoming content to the right bank with the right policy using declarative rules — with LLM intent escalation when rules can’t decide.

1. How MIP works

MIP evaluates every retain() call through a two-layer pipeline:

Mechanical rules — priority-ordered, first-match, zero LLM cost.
Escalation check — if no rule matched or confidence is low, check the escalation policy.
LLM intent layer — classifies the content and picks a bank, only when mechanical rules can’t decide.

Override rules (override: true) are checked first and short-circuit the pipeline. Use them for compliance locks that the intent layer must never override.

Config file: mip.yaml, referenced from astrocyte.yaml via mip_config_path. See the Memory Intent Protocol design doc for the full architecture.

2. File structure

version: "1.0"

banks:
  - id: default
    description: General-purpose bank
    access: ["*"]
    compliance: gdpr          # optional
  - id: "student-{student_id}"
    access: [agent:tutor, agent:grader]
    compliance: pdpa
  - id: private-encrypted
    compliance: pdpa

rules:
  - name: example-rule
    priority: 10
    override: false           # optional, default false
    match:
      all:
        - content_type: student_answer
        - metadata.student_id: present
    action:
      bank: "student-{metadata.student_id}"
      tags: ["{metadata.topic}"]
      retain_policy: default  # default | redact_before_store | encrypt | reject

intent_policy:
  escalate_when:
    - matched_rules: 0
    - confidence: { lt: 0.8 }
  model_context: |
    Route content to the correct bank. Banks: {banks}. Tags: {tags}.
  constraints:
    cannot_override: [pii-lockdown]
    must_justify: true
    max_tokens: 200

3. Writing rules

Field	Type	Required	Description
`name`	string	yes	Unique rule identifier
`priority`	int	yes	Lower number = higher priority
`match`	block	yes	Conditions to evaluate
`action`	block	yes	What to do when matched
`override`	bool	no	Compliance lock (checked before normal rules)

Match DSL

Operators: present, absent, eq, in, gte, lte, gt, lt

Fields: content_type, source, pii_detected, tags, metadata.*, signals.*

Composition: all: (AND), any: (OR), none: (NOT)

# Shorthand (implicit all)        # Operator form
match:                             match:
  pii_detected: true                 signals.word_count: { gte: 50 }

# Explicit composition             # in operator
match:                             match:
  all:                               all:
    - content_type: student_answer     - content_type: { in: [lesson, quiz] }
    - metadata.student_id: present
  none:
    - source: internal_test

Action DSL

Field	Type	Description
`bank`	string	Target bank (supports `{metadata.key}` interpolation)
`tags`	list	Tags to attach (supports templates)
`retain_policy`	string	`default`, `redact_before_store`, `encrypt`, `reject`
`escalate`	string	`"mip"` to force LLM escalation
`confidence`	float	Rule confidence score (default 1.0)

Practical examples

# 1. PII to encrypted bank (override -- always wins)
- name: pii-lockdown
  priority: 1
  override: true
  match: { pii_detected: true }
  action: { bank: private-encrypted, tags: [pii, compliance], retain_policy: redact_before_store }

# 2. Route by content type
- name: conversation-to-dialogue
  priority: 10
  match: { all: [{ content_type: conversation }] }
  action: { bank: dialogue-bank, tags: [conversation] }

# 3. Route by metadata (template interpolation)
- name: student-answer
  priority: 10
  match:
    all:
      - content_type: student_answer
      - metadata.student_id: present
  action:
    bank: "student-{metadata.student_id}"
    tags: ["{metadata.topic}", "{metadata.difficulty}"]

# 4. Route by tag
- name: flagged-content
  priority: 20
  match: { all: [{ tags: { eq: "review-needed" } }] }
  action: { bank: review-queue, tags: [flagged] }

# 5. Reject noise (short content, no source agent)
- name: reject-noise
  priority: 50
  match:
    all:
      - signals.word_count: { lt: 10 }
      - metadata.source_agent: absent
  action: { retain_policy: reject }

# 6. Fallback -- escalate to intent layer
- name: unmatched-fallback
  priority: 999
  match: { all: [] }
  action: { escalate: mip }

4. Intent policy (LLM escalation)

intent_policy:
  escalate_when:
    - matched_rules: 0          # no mechanical match
    - confidence: { lt: 0.8 }   # uncertain match
  model_context: |
    Route content to the correct bank. Banks: {banks}. Tags: {tags}.
  constraints:
    cannot_override: [pii-lockdown]
    must_justify: true
    max_tokens: 200

Escalation conditions: matched_rules (number of matches, e.g. 0 = none), confidence (top match score below threshold), conflicting_rules (set true to escalate when multiple rules match). constraints.cannot_override protects override rules from LLM override. must_justify: true requires LLM reasoning.

5. Template interpolation

{field.path} placeholders in bank and tags resolve at routing time:

bank: "student-{metadata.student_id}"   # -> "student-stu-42"
tags: ["{metadata.topic}"]              # -> ["algebra"]

Paths: {metadata.key}, {signals.word_count}, {content_type}, {source}. Unresolved placeholders stay literal.

6. Testing rules locally

Unit testing with pytest

from astrocyte.mip.schema import RoutingRule, MatchBlock, MatchSpec, ActionSpec
from astrocyte.mip.rule_engine import RuleEngineInput, evaluate_rules

rule = RoutingRule(
    name="pii-to-secure", priority=100,
    match=MatchBlock(all_conditions=[
        MatchSpec(field="pii_detected", operator="eq", value=True)
    ]),
    action=ActionSpec(bank="secure-vault", retain_policy="encrypt"),
)
input_data = RuleEngineInput(
    content="User email is test@example.com",
    metadata={}, tags=[], pii_detected=True,
    signals={"word_count": 6.0},
)

matches = evaluate_rules([rule], input_data)
assert len(matches) == 1
assert matches[0].rule.action.bank == "secure-vault"

Testing the full router

from astrocyte.mip.router import MipRouter
from astrocyte.mip.rule_engine import RuleEngineInput
from astrocyte.mip.schema import *

config = MipConfig(version="1.0",
    banks=[BankDefinition(id="secure-vault", compliance="pdpa")],
    rules=[RoutingRule(
        name="pii-lockdown", priority=1, override=True,
        match=MatchBlock(all_conditions=[
            MatchSpec(field="pii_detected", operator="eq", value=True)]),
        action=ActionSpec(bank="private-encrypted", tags=["pii"],
                          retain_policy="redact_before_store"))])

router = MipRouter(config)
decision = router.route_sync(RuleEngineInput(content="PII here", pii_detected=True))
assert decision.bank_id == "private-encrypted"
assert decision.resolved_by == "mechanical"

Testing from YAML

from astrocyte.mip.loader import load_mip_config
config = load_mip_config("mip.yaml")
router = MipRouter(config)

decision = router.route_sync(RuleEngineInput(
    content="2x+3=7", content_type="student_answer",
    metadata={"student_id": "stu-42", "topic": "algebra"},
))
assert decision.bank_id == "student-stu-42"

Testing escalation

route_sync() returns None when escalation would fire:

decision = router.route_sync(RuleEngineInput(
    content="Unexpected content", content_type="unknown",
))
assert decision is None  # would escalate to intent layer in production

For async escalation with LLM, use await router.route(input_data) with a mock LLMProvider.

7. Debugging tips

route_sync() for deterministic debugging — no LLM, pure mechanical rules.

evaluate_rules() directly to see all matching rules, not just the winner:

matches = evaluate_rules(rules, input_data)
for m in matches:
    print(m.rule.name, m.confidence)

Priority: lower number = higher priority (1 beats 10).
Log RuleEngineInput to verify what fields are available. Missing metadata is the most common silent failure.
Common mistakes: wrong field path (studentId vs student_id), signals.* values are floats not ints, operator typos (equals vs eq), forgetting override: true on compliance rules.

8. Loading and env vars

from astrocyte.mip.loader import load_mip_config
config = load_mip_config("mip.yaml")  # YAML parse -> ${ENV_VAR} substitution -> validation

Validation errors raise ConfigError at load time (duplicate rule names, invalid override+escalate combos). Use ${ENV_VAR} in any string value: id: "${TENANT_BANK_ID}".

9. Pipeline shaping (advanced, opt-in)

Most rules should stay routing-only. Sections 1–8 cover the entire authoring surface for routing. Pipeline shaping is a separate, opt-in capability: it lets a rule control how memories are chunked, deduplicated, reranked, and synthesized — not just where they live. Reach for it only when routing alone leaves observable quality on the table (e.g., conversational content needs dialogue chunking; legal docs need strict-evidence reflect).

When a rule’s action.pipeline: block is present, MIP propagates a resolved PipelineSpec through RoutingDecision.pipeline and persists _mip.rule and _mip.pipeline_version on each retained chunk. At recall time the orchestrator looks up the bank’s dominant pipeline rule and applies the rerank / reflect overrides.

9.1 Use a preset (recommended)

- name: conversation
  priority: 10
  match: { all: [{ content_type: conversation }] }
  action:
    bank: dialogue-bank
    pipeline:
      version: 1
      preset: conversational

Built-in presets (defined in astrocyte/mip/presets.py):

Preset	Chunker	Dedup	Rerank	Reflect
`conversational`	dialogue, max 800	0.92 / skip_chunk	keyword 0.08, proper-noun 0.15	`temporal_aware`, promotes `[speaker, occurred_at]`
`document`	paragraph, max 1200, overlap 100	0.95 / skip	keyword 0.10, proper-noun 0.05	`default`
`code`	fixed, 1500, overlap 200	0.98 / skip	keyword 0.12, no proper-noun boost	`evidence_strict`
`evidence_strict`	inherits caller	0.98 / skip	keyword 0.10, proper-noun 0.05	`evidence_strict`, promotes `[source, occurred_at]`

9.2 Raw overrides (advanced)

Explicit fields layer on top of (or replace) the preset. Document why in a YAML comment — mip lint flags raw overrides without one.

- name: long-form-essays
  priority: 20
  match: { all: [{ content_type: essay }] }
  action:
    bank: essays
    pipeline:
      version: 2
      preset: document
      # essay reranker tuned during 2026-Q1 retrieval audit, see PR #482
      rerank: { keyword_weight: 0.15 }
      reflect: { promote_metadata: [author, occurred_at, citation] }

9.3 Action vocabulary

Field	Type	Purpose
`pipeline.version`	int (required when block is set)	Persisted on every chunk; recall warns on drift
`pipeline.preset`	string	One of the names above; expanded at load time
`pipeline.chunker.strategy`	`sentence` \| `dialogue` \| `paragraph` \| `fixed`	Overrides extraction profile
`pipeline.chunker.max_size`	int	Max chars per chunk
`pipeline.chunker.overlap`	int	Char overlap between chunks
`pipeline.dedup.threshold`	float (0.0–1.0)	Cosine similarity to flag a chunk as duplicate
`pipeline.dedup.action`	`skip_chunk` \| `skip` \| `warn` \| `update`	What to do with detected dupes
`pipeline.rerank.keyword_weight`	float	Bonus per query-term match in reranker
`pipeline.rerank.proper_noun_weight`	float	Bonus per capitalized query word match
`pipeline.reflect.prompt`	`default` \| `temporal_aware` \| `evidence_strict`	Synthesis prompt variant
`pipeline.reflect.promote_metadata`	list[str], max 5	Metadata fields rendered inline in the memory block

9.4 Guardrails enforced at load time

P2 — pipeline.version is required when any pipeline field is set. Persisted onto each chunk so recall can warn on drift.
P4 — reflect.promote_metadata is hard-capped at 5 fields (raises ConfigError).
P5 — Pipeline blocks require content_type in the match block (raises ConfigError). Pipeline overrides without a content-type gate are almost always a mistake.

9.5 Per-bank reranker scope (P3)

Reranker weights resolve per bank, not per record. The first rule (in priority order) whose action.bank matches the recall bank wins. Templates like student-{id} match concrete student-42 for resolution purposes. If two rules write to the same bank with conflicting reranker weights, that’s a config error — mip lint surfaces it.

9.6 Version drift at recall

Each chunk carries _mip.pipeline_version. When the current rule’s version differs, recall logs a WARNING on the astrocyte.mip logger naming the bank, current version, and the drifted versions present in the result. Operators decide whether to re-index the bank or accept the drift. No hits are dropped.

9.7 CLI tooling

astrocyte mip lint mip.yaml         # validates schema, presets, P2/P4/P5 guardrails
astrocyte mip explain my-rule       # prints the resolved pipeline shape for a sample input

10. Further reading

Memory Intent Protocol — full architecture and rationale
Configuration reference — mip_config_path in astrocyte.yaml
Architecture — how MIP fits into the Astrocyte retain pipeline
Memory API reference — retain/recall/reflect/forget signatures