Explicitly document that these capabilities are typically provided by orchestration frameworks (like LangChain, LangGraph) rather than the Toolbox SDK itself, but that Toolbox tools are designed to leverage them.
2.9 KiB
title, type, weight, description
| title | type | weight | description |
|---|---|---|---|
| Pre and Post processing | docs | 1 | Pre and Post processing in GenAI applications. |
Pre and post processing allow developers to intercept and modify interactions between the agent and its tools or the user.
Note
: These capabilities are typically features of orchestration frameworks (like LangChain, LangGraph, or Agent Builder) rather than the Toolbox SDK itself. However, Toolbox tools are designed to fully leverage these framework capabilities to support robust, secure, and compliant agent architectures.
Types of Processing
Pre-processing
Pre-processing occurs before a tool is executed or an agent processes a message. Key types include:
- Input Sanitization & Redaction: Detecting and masking sensitive information (like PII) in user queries or tool arguments to prevent it from being logged or sent to unauthorized systems.
- Business Logic Validation: Verifying that the proposed action complies with business rules (e.g., ensuring a requested hotel stay does not exceed 14 days, or checking if a user has sufficient permission).
- Security Guardrails: Analyzing inputs for potential prompt injection attacks or malicious payloads.
Post-processing
Post-processing occurs after a tool has executed or the model has generated a response. Key types include:
- Response Enrichment: Injecting additional data into the tool output that wasn't part of the raw API response (e.g., calculating loyalty points earned based on the booking value).
- Output Formatting: Transforming raw data (like JSON or XML) into a more human-readable or model-friendly format to improve the agent's understanding.
- Compliance Auditing: Logging the final outcome of transactions, including the original request and the result, to a secure audit trail.
Processing Scopes
While processing logic can be applied at various levels (Agent, Model, Tool), this guide primarily focuses on Tool Level processing, which is most relevant for granular control over tool execution.
Tool Level (Primary Focus)
Wraps individual tool executions. This is best for logic specific to a single tool or a set of tools.
- Scope: Intercepts the raw inputs (arguments) to a tool and its outputs.
- Use Cases: Argument validation, output formatting, specific privacy rules for sensitive tools.
Comparison with Other Levels
It is helpful to understand how tool-level processing differs from other scopes:
- Model Level: Intercepts individual calls to the LLM (prompts and responses). Unlike tool-level, this applies globally to all text sent/received, making it better for global PII redaction or token tracking.
- Agent Level: Wraps the high-level execution loop (e.g., a "turn" in the conversation). Unlike tool-level, this envelopes the entire turn (user input to final response), making it suitable for session management or end-to-end auditing.