mirror of https://github.com/Significant-Gravitas/AutoGPT.git synced 2026-04-08 03:00:28 -04:00

Files

Nicholas Tindle 5797afd28b docs(blocks): improve video block descriptions and regenerate docs

Better "What it is" descriptions so the generated docs are
self-explanatory without a separate "What it does" section.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-07 16:45:20 -06:00

1.5 KiB

Raw Blame History

Video Transcribe

This block transcribes speech from a video file to text using the Replicate API.

Transcribe Video

What it is

Extract spoken words from a video and return them as a text transcription

How it works

The block sends the input video to the Replicate API using the jd7h/edit-video-by-editing-text model in "transcribe" mode. This model analyzes the audio track of the video, performs speech recognition, and returns the detected speech as text. The block handles multiple API response formats (dictionary, list, string, and file output) to reliably extract the transcript text.

Inputs

Input	Description	Type	Required
video_in	Input video file to transcribe (URL, data URI, or local path)	str (file)	Yes

Outputs

Output	Description	Type
error	Error message if the operation failed	str
transcription	Text transcription extracted from the video	str

Possible use case

Subtitle Generation: Transcribe video dialogue to create subtitle or caption files for accessibility and localization.

Searchable Video Archives: Convert speech in recorded meetings, interviews, or lectures into searchable text for indexing and retrieval.

LLM Content Pipeline: Feed video transcripts into language models for summarization, analysis, or content repurposing workflows.

1.5 KiB Raw Blame History