Drumbeat/MoJo/hackfest/berlin/projects/MetaProject: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
No edit summary |
||
| Line 18: | Line 18: | ||
===VIDEO MEDIA=== | ===VIDEO MEDIA=== | ||
Valid Inputs: URL, Video (format?) | *Valid Inputs:* URL, Video (format?) | ||
Optional Inputs: Transcript, Faces, Known Metadata | |||
Returned Metadata: | *Optional Inputs:* Transcript, Faces, Known Metadata | ||
*Returned Metadata:* | |||
- Transcript | - Transcript | ||
- Moments of audio transition (new speaker) | - Moments of audio transition (new speaker) | ||
| Line 30: | Line 32: | ||
===AUDIO MEDIA=== | ===AUDIO MEDIA=== | ||
Valid Inputs: URL, Audio (mp3, wav) | *Valid Inputs:* URL, Audio (mp3, wav) | ||
Optional Inputs: Transcript, Voice Samples, Known Metadata | |||
Returned Metadata: | *Optional Inputs:* Transcript, Voice Samples, Known Metadata | ||
*Returned Metadata:* | |||
- Transcript | - Transcript | ||
- Moments of audio transition (new speaker) | - Moments of audio transition (new speaker) | ||
| Line 40: | Line 44: | ||
===IMAGE MEDIA=== | ===IMAGE MEDIA=== | ||
Valid Inputs: URL, Image (jpg, gif, bmp, png) | *Valid Inputs:* URL, Image (jpg, gif, bmp, png) | ||
Optional Inputs: Faces, Known Metadata | |||
Returned Metadata: | *Optional Inputs:* Faces, Known Metadata | ||
*Returned Metadata:* | |||
- OCR data and it's coordinate location | - OCR data and it's coordinate location | ||
- Object identification | - Object identification | ||
- Face identification [only done if faces are provided] | - Face identification [only done if faces are provided] | ||
Revision as of 13:20, 26 September 2011
The Meta Project is a tool which provides a simple service: take in any piece of media, spit out all the meta possible.
Meta Standards Resources
(Add links and summaries to documents discussing metadata)
Known APIs and Tools
(Add links and summaries of toolkits and APIs which can help generate data!)
Desired Functionality
TEXT MEDIA
Valid Inputs: URL, Plain Text, HTML Optional Inputs: Known Metadata Returned Metadata:
- Primary Themes (Document-wide) - Primary Themes (Per-paragraph) - Suggested Tags - Entities (Names, Locations) and their locations in text
VIDEO MEDIA
- Valid Inputs:* URL, Video (format?)
- Optional Inputs:* Transcript, Faces, Known Metadata
- Returned Metadata:*
- Transcript - Moments of audio transition (new speaker) - Moments of video transition (new scene) - OCR data (any text that appears on image) and their timestamps - Entities (Names, Locations) and their timestamps - Suggested Tags - Face identification and their timestamp ranges [only done if faces are provided]
AUDIO MEDIA
- Valid Inputs:* URL, Audio (mp3, wav)
- Optional Inputs:* Transcript, Voice Samples, Known Metadata
- Returned Metadata:*
- Transcript - Moments of audio transition (new speaker) - Entities (Names, Locations) and their timestamps - Suggested Tags - Voice identification and their timestamp ranges [only done if voice samples are provided]
IMAGE MEDIA
- Valid Inputs:* URL, Image (jpg, gif, bmp, png)
- Optional Inputs:* Faces, Known Metadata
- Returned Metadata:*
- OCR data and it's coordinate location - Object identification - Face identification [only done if faces are provided]