mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-04-08 03:00:28 -04:00

Files

Nicholas Tindle 5797afd28b docs(blocks): improve video block descriptions and regenerate docs

Better "What it is" descriptions so the generated docs are
self-explanatory without a separate "What it does" section.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-07 16:45:20 -06:00

2.3 KiB

Raw Blame History

Video Edit By Text

This block edits a video by modifying its transcript — segments absent from the supplied transcript are cut from the output video, powered by the Replicate API.

Edit Video By Text

What it is

Edit a video by modifying its transcript — segments you remove from the transcript are cut from the output video

How it works

The block sends the input video and the desired transcript to the Replicate API using the jd7h/edit-video-by-editing-text model in "edit" mode. The model aligns the provided transcript against the original speech in the video and removes any video segments whose speech is not present in the supplied transcript. The split_at parameter controls alignment granularity: word (default) aligns cuts at word boundaries for natural-sounding edits, while character allows finer sub-word alignment for more precise control. The block returns the edited video (stored via the workspace file system) along with the transcript that was used.

Inputs

Input	Description	Type	Required
video_in	Input video file to edit (URL, data URI, or local path)	str (file)	Yes
transcription	Modified transcript of the input video — segments absent from this text will be cut from the output video	str	Yes
split_at	Alignment granularity for transcript matching: 'word' aligns cuts at word boundaries, 'character' allows finer sub-word alignment	"word" \| "character"	No

Outputs

Output	Description	Type
error	Error message if the operation failed	str
video_out	Edited video file (path or data URI)	str (file)
transcription	Transcription used for editing	str

Possible use case

Interview Cleanup: Remove filler words, false starts, or off-topic tangents from recorded interviews by editing the transcript and regenerating the video.

Content Highlights: Extract key segments from long-form video content by keeping only the relevant portions of the transcript.

Automated Moderation: Remove flagged or inappropriate speech segments from user-generated video content by stripping those lines from the transcript.

2.3 KiB Raw Blame History