Files
AutoGPT/docs/integrations/block-integrations/video/edit_by_text.md
Nicholas Tindle 5797afd28b docs(blocks): improve video block descriptions and regenerate docs
Better "What it is" descriptions so the generated docs are
self-explanatory without a separate "What it does" section.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-07 16:45:20 -06:00

2.3 KiB

Video Edit By Text

This block edits a video by modifying its transcript — segments absent from the supplied transcript are cut from the output video, powered by the Replicate API.

Edit Video By Text

What it is

Edit a video by modifying its transcript — segments you remove from the transcript are cut from the output video

How it works

The block sends the input video and the desired transcript to the Replicate API using the jd7h/edit-video-by-editing-text model in "edit" mode. The model aligns the provided transcript against the original speech in the video and removes any video segments whose speech is not present in the supplied transcript. The split_at parameter controls alignment granularity: word (default) aligns cuts at word boundaries for natural-sounding edits, while character allows finer sub-word alignment for more precise control. The block returns the edited video (stored via the workspace file system) along with the transcript that was used.

Inputs

Input Description Type Required
video_in Input video file to edit (URL, data URI, or local path) str (file) Yes
transcription Modified transcript of the input video — segments absent from this text will be cut from the output video str Yes
split_at Alignment granularity for transcript matching: 'word' aligns cuts at word boundaries, 'character' allows finer sub-word alignment "word" | "character" No

Outputs

Output Description Type
error Error message if the operation failed str
video_out Edited video file (path or data URI) str (file)
transcription Transcription used for editing str

Possible use case

Interview Cleanup: Remove filler words, false starts, or off-topic tangents from recorded interviews by editing the transcript and regenerating the video.

Content Highlights: Extract key segments from long-form video content by keeping only the relevant portions of the transcript.

Automated Moderation: Remove flagged or inappropriate speech segments from user-generated video content by stripping those lines from the transcript.