Skills and Meta-Tools | Agentic Engineering

Skills blur the boundary between tools and prompts. They're not actions an agent can take, but temporary behavioral modifications that change how the agent reasons about specialized domains.

Skills as Meta-Tools

[2025-12-09]: Skills blur the boundary between tools and prompts. They're not actions an agent can take, but temporary behavioral modifications that change how the agent reasons about specialized domains.

The Meta-Tool Pattern: The "Skill" tool (capital S) acts as a dispatcher to individual skills. When invoked, a skill modifies two contexts simultaneously:

Reasoning context: Injects temporary behavioral instructions
Execution context: Applies tool permission overrides

This creates specialized agent modes without forking the entire agent configuration.

Three Key Differences from Traditional Tools:

Aspect	Traditional Tools	Skills
Mechanism	Direct actions (read file, call API)	Injected instructions + permission changes
Selection	Algorithmic matching (name/parameters)	LLM reasoning about descriptions
Token Cost	~100 tokens per invocation	~1,500+ tokens per invocation
Purpose	Execute specific operations	Specialize agent behavior

The Trade: Skills trade token overhead for contextual specialization. Instead of front-loading every possible skill instruction into the base prompt, skills use progressive disclosure—the agent discovers capabilities through metadata, then loads full instructions only when needed.

Generalization Beyond Claude: The pattern isn't Claude-specific. Any agentic system can implement progressive disclosure:

Discovery layer: Short metadata about available capabilities
Activation layer: Full instructions loaded on-demand
Resource layer: Detailed references and examples (optional)

This balances discoverability (agent knows what's possible) with context efficiency (doesn't pay token cost until invoked).

Example Flow:

1. Agent sees skill metadata: "Python debugging skill available"
2. Agent encounters Python bug, invokes Skill tool with "python-debugging"
3. System injects debugging instructions + enables relevant tools (Bash, Read)
4. Agent now reasons with debugging strategies in context
5. After task completion, injected instructions expire

Why This Matters: Skills represent a third category beyond "tools" and "prompts":

Tools: What the agent can do (read files, make API calls)
Prompts: How the agent thinks (general reasoning patterns)
Skills: Domain-specific thinking modes (temporary reasoning specialization)

In Practice: Claude Code implements this with a skills library—data analysis, git operations, debugging, etc. Each skill packages domain expertise as injectable context rather than requiring separate specialized agents.

See Also:

Context: Progressive Disclosure — Managing context window through layered information
Claude Code: Skills System — Concrete implementation in production
Prompt — Model-invoked vs user-invoked patterns for skill activation

Sources: Claude Code Skills Docs, Skills Deep Dive, Skills Explained

Context Contracts for Agent Capability Declaration

[2026-02-05]: Advanced .claude/ implementations use declarative JSON schemas to declare agent input requirements and output scopes. This enables pre-spawn validation and scope enforcement, extending the progressive disclosure pattern from runtime to orchestration design.

The Pattern

Agents declare context contracts in structured metadata:

{
  "inputs": {
    "spec_file": { "type": "path", "required": true },
    "expertise": { "type": "path", "required": false },
    "memory": { "type": "context", "required": false }
  },
  "outputs": {
    "allowed_modifications": ["src/**/*.ts", "docs/**/*.md"],
    "forbidden_patterns": ["**/node_modules/**", "**/.git/**"]
  }
}

Three Validation Gates

Pre-Spawn Validation:

Verify required context (spec_file, expertise) exists before spawning agent
Prevents agent failures from missing inputs
Reduces wasted tokens on doomed agent invocations

Scope Enforcement:

Hooks validate file modifications against allowed_modifications globs
Block writes outside declared scope
Creates machine-readable capability boundaries

Registry Generation:

Auto-generate agent capability catalog from contracts
Enables programmatic agent discovery ("which agents can modify TypeScript files?")
Powers routing logic without manual maintenance

When to Use Context Contracts

Good fit:

Multi-agent systems with 10+ specialized agents
Production environments requiring audit trails
Systems where agents coordinate through orchestrator routing
Domains with strict scope boundaries (microservices, security zones)

Poor fit:

Simple single-agent workflows
Exploratory development without clear boundaries
Systems where flexibility > enforcement

Connections

To Skills: Context contracts extend progressive disclosure from runtime (Skills load on-demand) to orchestration design (agents declare requirements)
To Tool Restrictions: Contracts provide schema-based validation where tool restrictions provide runtime enforcement
To Self-Improving Experts: Registry generation from contracts enables discovery layer for expert systems

Sources: Advanced external .claude/ implementation patterns documented in KotaDB scouting analysis.

Leading Questions

When should you use a skill vs. a traditional tool?
How do you decide what to put in a skill's discovery metadata vs. activation payload?
Can skills invoke other skills? Should they?
How do you measure whether a skill is worth the token overhead?
What domains benefit most from the skill pattern?
How do you version skills in production?
What happens when multiple skills are active simultaneously?

Connections

To Context: Progressive disclosure pattern
To Prompt: Skills as prompt injection mechanism
To Tool Selection: Skills affect tool availability dynamically
To Scaling Tools: Skills as a scaling pattern for specialized capabilities
To Claude Code: Production implementation
To Cost and Latency: Token cost comparison across feature types