The Skill That Onboards Other Skills

The Problem

In a modern AI project, skills are markdown files with frontmatter and instructions. Three skills — elegant. Thirty — and strange things start happening: the assistant writes in one style today, a different style tomorrow. A context file disappears. A command fires, but not quite right. No stack trace, no exception, no log line — just behavior that changed, and no idea when or why.

Skills don't formally collide at the type level — they collide in behavior, hours or days after they were installed.

What happens:

Two skills claim the same territory. Both write LinkedIn posts — the assistant picks one non-deterministically. Output style drifts.
Two skills write to the same file. Last-write-wins gradually destroys shared context.
A new skill quietly walks into a neighbor's domain. You find out when downstream stops receiving the data it expected.
A skill freely calls four others. One change in a neighbor propagates through the chain, breaks determinism, and drives five times the token cost.
Two "finalizers" both claim the last word. The artifact oscillates between their styles.

These are not bugs in any individual skill — each one can be well-covered by evals and work perfectly in isolation. The conflict appears where a skill meets its neighbors. Before that meeting, it's invisible.

How It Works

The answer is a skill itself. Type /onboard fb-post-writer and onboard runs an 8-step workflow: reads the new skill's SKILL.md, fires the detectors, runs a sandbox session, collects findings, shows them to you, and — only after confirmation — writes the card to .onboarding-registry/. Nothing to do manually.

The model is borrowed from HR: a new skill is not npm install — it's a hire. Hiring has a stable vocabulary that maps directly:

Hiring	Skill onboarding
Résumé	`SKILL.md` + frontmatter
Interview	Sandbox run on probe prompts
Security clearance	L1–L4 tier
Probation period	First N invocations under heightened scrutiny
Reporting line	Declared domain finalizer
Exit interview	Archive record with reason for removal

Eight steps, and the critical one is five:

Parse SKILL.md — reads the frontmatter and body: trigger, reads, writes, calls, finalizer.
Conflict detection — six parallel detectors: domain usurpation, tool collision, trigger overlap, output-path collision, spaghetti interactions, authority conflict.
Pre-clearance — preliminary L1–L4 classification based on declared permissions.
Sandbox setup — creates an isolated directory, spins up a sub-agent observer.
Sandbox interview — runs the skill on 2–3 synthetic probe prompts; the observer records facts.
Observed vs declared — compares declared behavior to observed behavior, recalculates clearance.
Findings report — aggregates all findings with severity (block / warn / info) and presents them to you.
Registry card — only after your confirmation, writes the card to .onboarding-registry/.

The critical one is five, sandbox interview. The skill runs on 2–3 synthetic probe prompts in an isolated directory under a separate sub-agent that observes and records facts only. Declared behavior is compared to observed behavior:

Declared	Observed	Finding
`reads: [A, B]`	Reads A, B, C	Undeclared read
`writes: [outputs/X/]`	Writes to `state/Y`	Clearance escalated to L3
`calls: [skill-A]`	Calls A and B	Undeclared dependency

Final clearance is assigned based on observed behavior, not declared. A skill claiming L2 in its description that quietly edits shared state gets reclassified to L3 — because the harness will give it whatever settings.json allows regardless.

Six conflict detectors run in parallel: domain usurpation, tool collision, trigger overlap, output-path collision, spaghetti interactions, authority conflict. Severity has exactly three levels — block / warn / info: more granularity doesn't calibrate, it becomes vibes.

Core design decision: advisory, not enforcement. The registry writes a card, recommends patches to the description or to settings.json, but blocks nothing at runtime. Hard-blocking a skill from a vendor package turns the registry into a hostile gatekeeper, and a misclassified quarantine costs more than an ignored warning. Escalation to hard enforcement is a separate explicit step — the user decides.

The registry (.onboarding-registry/) lives in the project root, not under .claude/. Skills come from sources the user doesn't fully control; the registry is the project's view of them, versioned independently of the harness.

Artifact

github.com/dobryakov/ai-agent-onboarding — the onboard skill, registry schema, six detectors, L1–L4 rubric. Just published — no traction yet. The value is in the mechanics, not the star count.

Series signature

Where It Breaks

Advisory doesn't block. The team can ignore the report and get the same conflict anyway. There's no defense against indifference — a deliberate trade for reversibility.
Detectors produce false positives on legitimate domain overlaps (e.g., two writer skills targeting different platforms). policies.md fixes this, but it's manual work.
Lazy intake is a compromise. Skills installed before onboard existed enter the registry as lazy_intake: true — incomplete data, no sandbox interview. Without explicit re-onboarding they stay half-blind.
Sandbox ≠ production. A skill requiring live MCP connectivity to a private service degrades to a "paper interview" — behavior predicted from close reading. Honest fallback, but findings carry lower confidence.
L4 is a formality without a reviewer. The "modifies harness" tier escalates severity, but without a named owner for the changes, the escalation turns to noise.

Who It's For and Why

Risk: the team spins up an agent stack, drops in third-party skill packs (vendor, shared library, personal toolbox) — and two months later spends hours debugging non-deterministic behavior that can't be reproduced by a unit test.

Solution: a skill enters the project the same way a person is hired — with a résumé, interview, probation period, and a record on file. Discipline of introduction, not prohibition.

Metric: conflict found before the skill merges into daily flow — not a week later when the regression has already spread through downstream.

Direct link to #7 Cursor Rules as Governance: that howto governs the code AI writes; this one governs the assistant itself when a new behavioral component is introduced. For a CTO scaling vibe-coding across a team, it's the next layer of the same solution.

Want to build governance for a growing agent stack?

Past ten skills, a system without onboarding discipline starts behaving unpredictably. I help build processes that keep behavioral control as the ecosystem scales.

Email me

Other breakdowns

Engineering breakdown series: real task → methodology → working artifact → honest breakdown of where it fails.

To the series →