94
edits
(→IMAGE) |
(→Tasks) |
||
| Line 191: | Line 191: | ||
====Tasks==== | ====Tasks==== | ||
=====identify_audio_transitions===== | =====identify_audio_transitions===== | ||
Identify moments of distinct changes in audio content (e.g. speaker changes). | Identify moments of distinct changes in audio content (e.g. speaker changes). | ||
| Line 283: | Line 269: | ||
- '''ocr_results''':''array'' // list of [start HH:MM:SS, end HH:MM:SS, [x, y], string]] tuples | - '''ocr_results''':''array'' // list of [start HH:MM:SS, end HH:MM:SS, [x, y], string]] tuples | ||
=====transcribe===== | |||
Attempt to create a timestamped transcript for the video. The transcript will either be ripped from CC data or estimated using speech to text algorithms. | |||
Powered by [???] | |||
<u>'''Inputs'''</u> | |||
None | |||
<u>'''Outputs'''</u> | |||
- '''transcript''':''array'' // list of [HH:MM:SS, transcript] tuples | |||
- '''transcription_method''':''enum('cc','stt')'' // method used to generate the transcript | |||
===Audio=== | ===Audio=== | ||
edits