- Add generate_block_docs.py script that introspects block code to
generate markdown
- Support manual content preservation via <!-- MANUAL: --> markers
- Add migrate_block_docs.py to preserve existing manual content from git
HEAD
- Add CI workflow (docs-block-sync.yml) to fail if docs drift from code
- Add Claude PR review workflow (docs-claude-review.yml) for doc changes
- Add manual LLM enhancement workflow (docs-enhance.yml)
- Add GitBook configuration (.gitbook.yaml, SUMMARY.md)
- Fix non-deterministic category ordering (categories is a set)
- Add comprehensive test suite (32 tests)
- Generate docs for 444 blocks with 66 preserved manual sections
🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
<!-- Clearly explain the need for these changes: -->
### Changes 🏗️
<!-- Concisely describe all of the changes made in this pull request:
-->
### Checklist 📋
#### For code changes:
- [x] I have clearly listed my changes in the PR description
- [x] I have made a test plan
- [x] I have tested my changes according to the test plan:
<!-- Put your test plan here: -->
- [x] Extensively test code generation for the docs pages
<!-- CURSOR_SUMMARY -->
---
> [!NOTE]
> Introduces an automated documentation pipeline for blocks and
integrates it into CI.
>
> - Adds `scripts/generate_block_docs.py` (+ tests) to introspect blocks
and generate `docs/integrations/**`, preserving `<!-- MANUAL: -->`
sections
> - New CI workflows: **docs-block-sync** (fails if docs drift),
**docs-claude-review** (AI review for block/docs PRs), and
**docs-enhance** (optional LLM improvements)
> - Updates existing Claude workflows to use `CLAUDE_CODE_OAUTH_TOKEN`
instead of `ANTHROPIC_API_KEY`
> - Improves numerous block descriptions/typos and links across backend
blocks to standardize docs output
> - Commits initial generated docs including
`docs/integrations/README.md` and many provider/category pages
>
> <sup>Written by [Cursor
Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit
631e53e0f6. This will update automatically
on new commits. Configure
[here](https://cursor.com/dashboard?tab=bugbot).</sup>
<!-- /CURSOR_SUMMARY -->
---------
Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
8.0 KiB
Exa Websets Search
Blocks for running and managing searches within Exa websets.
Exa Cancel Webset Search
What it is
Cancel a running webset search
How it works
This block stops a running search operation on a webset. Items already found before cancellation are retained in the webset.
Use this when a search is taking too long, returning unexpected results, or is no longer needed. The block returns the number of items found before cancellation.
Inputs
| Input | Description | Type | Required |
|---|---|---|---|
| webset_id | The ID or external ID of the Webset | str | Yes |
| search_id | The ID of the search to cancel | str | Yes |
Outputs
| Output | Description | Type |
|---|---|---|
| error | Error message if the operation failed | str |
| search_id | The ID of the canceled search | str |
| status | Status after cancellation | str |
| items_found_before_cancel | Number of items found before cancellation | int |
| success | Whether the cancellation was successful | str |
Possible use case
Resource Control: Stop searches that are taking longer than expected.
Query Refinement: Cancel searches to adjust query and restart with better parameters.
Partial Results: Stop searches early when you have enough items for your needs.
Exa Create Webset Search
What it is
Add a new search to an existing webset to find more items
How it works
This block adds a new search to an existing webset to expand its contents. You define the search query, target count, and how results should integrate with existing items (append, override, or merge).
Searches support scoped and exclusion sources, criteria validation, and relationship-based "hop" searches to find related entities.
Inputs
| Input | Description | Type | Required |
|---|---|---|---|
| webset_id | The ID or external ID of the Webset | str | Yes |
| query | Search query describing what to find | str | Yes |
| count | Number of items to find | int | No |
| entity_type | Type of entity to search for | "company" | "person" | "article" | "research_paper" | "custom" | "auto" | No |
| entity_description | Description for custom entity type | str | No |
| criteria | List of criteria that items must meet. If not provided, auto-detected from query. | List[str] | No |
| behavior | How new results interact with existing items | "override" | "append" | "merge" | No |
| recall | Enable recall estimation for expected results | bool | No |
| exclude_source_ids | IDs of imports/websets to exclude from results | List[str] | No |
| exclude_source_types | Types of sources to exclude ('import' or 'webset') | List[str] | No |
| scope_source_ids | IDs of imports/websets to limit search scope to | List[str] | No |
| scope_source_types | Types of scope sources ('import' or 'webset') | List[str] | No |
| scope_relationships | Relationship definitions for hop searches | List[str] | No |
| scope_relationship_limits | Limits on related entities to find | List[int] | No |
| metadata | Metadata to attach to the search | Dict[str, Any] | No |
| wait_for_completion | Wait for the search to complete before returning | bool | No |
| polling_timeout | Maximum time to wait for completion in seconds | int | No |
Outputs
| Output | Description | Type |
|---|---|---|
| error | Error message if the operation failed | str |
| search_id | The unique identifier for the created search | str |
| webset_id | The webset this search belongs to | str |
| status | Current status of the search | str |
| query | The search query | str |
| expected_results | Recall estimation of expected results | Dict[str, Any] |
| items_found | Number of items found (if wait_for_completion was True) | int |
| completion_time | Time taken to complete in seconds (if wait_for_completion was True) | float |
Possible use case
Webset Expansion: Add more items to existing websets with new or refined queries.
Multi-Criteria Collection: Run multiple searches with different criteria to build comprehensive datasets.
Iterative Building: Progressively expand websets based on analysis of initial results.
Exa Find Or Create Search
What it is
Find existing search by query or create new - prevents duplicate searches in workflows
How it works
This block implements idempotent search creation. If a search with the same query already exists in the webset, it returns that search. Otherwise, it creates a new one.
Use this pattern to prevent duplicate searches when workflows retry or run multiple times with the same parameters.
Inputs
| Input | Description | Type | Required |
|---|---|---|---|
| webset_id | The ID or external ID of the Webset | str | Yes |
| query | Search query to find or create | str | Yes |
| count | Number of items to find (only used if creating new search) | int | No |
| entity_type | Entity type (only used if creating) | "company" | "person" | "article" | "research_paper" | "custom" | "auto" | No |
| behavior | Search behavior (only used if creating) | "override" | "append" | "merge" | No |
Outputs
| Output | Description | Type |
|---|---|---|
| error | Error message if the operation failed | str |
| search_id | The search ID (existing or new) | str |
| webset_id | The webset ID | str |
| status | Current search status | str |
| query | The search query | str |
| was_created | True if search was newly created, False if already existed | bool |
| items_found | Number of items found (0 if still running) | int |
Possible use case
Retry-Safe Workflows: Safely handle workflow retries without creating duplicate searches.
Deduplication: Avoid running the same search multiple times when called from different workflow branches.
Efficient Operations: Skip search creation when results from identical queries already exist.
Exa Get Webset Search
What it is
Get the status and details of a webset search
How it works
This block retrieves detailed information about a webset search including its query, criteria, progress, and recall estimation.
Use this to monitor search progress, verify search configuration, or investigate search behavior when results don't match expectations.
Inputs
| Input | Description | Type | Required |
|---|---|---|---|
| webset_id | The ID or external ID of the Webset | str | Yes |
| search_id | The ID of the search to retrieve | str | Yes |
Outputs
| Output | Description | Type |
|---|---|---|
| error | Error message if the operation failed | str |
| search_id | The unique identifier for the search | str |
| status | Current status of the search | str |
| query | The search query | str |
| entity_type | Type of entity being searched | str |
| criteria | Criteria used for verification | List[Dict[str, Any]] |
| progress | Search progress information | Dict[str, Any] |
| recall | Recall estimation information | Dict[str, Any] |
| created_at | When the search was created | str |
| updated_at | When the search was last updated | str |
| canceled_at | When the search was canceled (if applicable) | str |
| canceled_reason | Reason for cancellation (if applicable) | str |
| metadata | Metadata attached to the search | Dict[str, Any] |
Possible use case
Progress Tracking: Monitor search completion and items found during long-running operations.
Configuration Review: Retrieve search details to verify criteria and settings are correct.
Debugging: Investigate search configuration when results don't match expectations.