OMwiki:Tech

(Difference between revisions)
(Known issues:: ffmpeg on Internet Archive fixed :-))
(updates)
Line 1: Line 1:
__NOTOC__
__NOTOC__
=='''Known issues:'''==
=='''Known issues:'''==
-
(Listed in order of severity.)
 
*Find a way to add <code>style="width:352px;"</code> to embed code in 352×240 streams and fix aspect ratio for 352×240 thumbnails when in category view.
*Find a way to add <code>style="width:352px;"</code> to embed code in 352×240 streams and fix aspect ratio for 352×240 thumbnails when in category view.
Line 14: Line 13:
-
*[http://cinelerra.org/ Cinelerra-CV] is buggy (compilation from source helps). Document rendering options on other video editors (e.g., [http://www.xiph.org/quicktime/ XiphQT]) and help develop [http://lumiera.org Lumiera].  
+
=='''Scripts needed:'''==
-
 
+
*Retool [http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/MetavidWiki/maintenance/download_from_archive_org.php?view=markup download_from_archive_org.php] to work with different source videos and meta tags from the [http://archive.org Internet Archive], using [[OMwiki:Tech/OMwiki-import.php|this wikipage]] as a working draft.
-
*Fix severe problems with firewire_ohci/firewire_core talking to DV controller chips.  [https://bugzilla.redhat.com/show_bug.cgi?id=449252 Example ticket]
+
*Draft a script to import MetaVidWiki clips from apps that can produce [[wikipedia:SubRip|.srt]] or [[wikipedia:Continuous_Media_Markup_Language|.cmml]] captions (''e.g.,'' [http://gnome-subtitles.sourceforge.net/ Gnome Subtitles])
 +
*Rework [http://modcopy.sourceforge.net/ modcopy] to avoid mmap calls on JFFS2 filesystems (''e.g.'', the OLPC-XO).  [http://sourceforge.net/apps/trac/modcopy/ticket/9 Ticket]
<br>
<br>
-
=='''Scripts needed:'''==
+
 
-
*'''Retool [http://svn.wikimedia.org/viewvc/mediawiki/trunk/extensions/MetavidWiki/maintenance/download_from_archive_org.php?view=markup download_from_archive_org.php] to work with different source videos and meta tags from the [http://archive.org Internet Archive], using [[OMwiki:Tech/OMwiki-import.php|this wikipage]] as a working draft.'''
+
==''Things that make life difficult:''==
-
*Draft a script to import MetaVidWiki clips from apps that can produce [[wikipedia:SubRip|.srt]] or [[wikipedia:Continuous_Media_Markup_Language|.cmml]] captions (''e.g.,'' [http://gnome-subtitles.sourceforge.net/ Gnome Subtitles])
+
 
-
*Rework [http://modcopy.sourceforge.net/ modcopy] to avoid mmap calls on JFFS2 filesystems (''e.g.'', the OLPC-XO).  [http://sourceforge.net/apps/trac/modcopy/ticket/9 Ticket]
+
*ffmpeg/mplayer seem to handle duplicate Theora frames poorly, breaking A/V sync and thus limiting video reuse.
 +
 
 +
*[http://cinelerra.org/ Cinelerra-CV] is buggy and somewhat dated. Document rendering options on other video editors (''e.g.'', [http://www.xiph.org/quicktime/ XiphQT]) and help develop [http://lumiera.org Lumiera].  
 +
 
 +
*Fix or forgo non-compliment DV hardware.  [https://bugzilla.redhat.com/show_bug.cgi?id=449252 Example ticket]
 +
 
 +
*USB headsets crash spontaneously (error message <code>usb_set_interface failed</code>, at least in Debian Squeeze).
<br>
<br>
-
==New Features==
+
 
-
*[http://lists.wikimedia.org/pipermail/metavid-l/2009-August/000055.html Email thread on MetaVid-l]
+
 
 +
==New features==
 +
*[http://lists.wikimedia.org/pipermail/metavid-l/2009-August/000055.html Ideas on MetaVid-l]
*Add a Vorbis-only option on video streams for low-bandwidth connections.
*Add a Vorbis-only option on video streams for low-bandwidth connections.
-
*Turn on [http://www.mediawiki.org/wiki/Extension:OpenID OpenID sign-ins.]  Need to figure out how to link non-PEAR libraries to non-re-compilable PHP.
+
*Turn on [http://www.mediawiki.org/wiki/Extension:OpenID OpenID logins.]  Need to figure out how to link non-PEAR libraries to non-recompilable PHP.
-
*Replace animated GIFs with low-FPS, enlargeable, jump-to-timecode Theora.  Alternatively, use [http://pad.ma/ Pad.ma]-style mouse-overs (see [https://wiki.pad.ma/browser/padma.dev/padma/static/javascript/info.js?rev=padma.dev%2C354&order=size&desc=1 <nowiki>$(imagePoster).load(function() {...})</nowiki>]).  The thumbnails should cycle through the meeting in large time chunks when non-mouseover'd, and then display clickable frames according the how far the mouse is from the left edge when mouseover'd.
+
*Replace animated GIFs with low-FPS, enlargeable, jump-to-timecode Theora.  Alternatively, use [http://pad.ma/ Pad.ma]-style mouse-overs (see [https://wiki.pad.ma/browser/padma.dev/padma/static/javascript/info.js?rev=padma.dev%2C354&order=size&desc=1 <nowiki>$(imagePoster).load(function() {...})</nowiki>]).  The thumbnails should cycle through the meeting in large time chunks when non-mouseover'd, and then display clickable frames according the how far the mouse is from the left edge when mouseover'd.  Include ability to pause all animations and step-through lock-step upon mouse clicks.
*Add YUV4MPEG support in [http://www.blender.org/ Blender] to enable direct output to [http://v2v.cc/~j/ffmpeg2theora/ ffmpeg2theora] ([http://lists.mplayerhq.hu/pipermail/libav-user/2009-March/002639.html email thread]), building upon [[File:Henningsen_7oct2009_blender-svn.tar.bz2|Stephan Henningsen's lead.]]  Also build-in yuv4ogg support as per [http://web.mit.edu/xiphmont/Public/theora/demo8.html xiphmont's lead.]
*Add YUV4MPEG support in [http://www.blender.org/ Blender] to enable direct output to [http://v2v.cc/~j/ffmpeg2theora/ ffmpeg2theora] ([http://lists.mplayerhq.hu/pipermail/libav-user/2009-March/002639.html email thread]), building upon [[File:Henningsen_7oct2009_blender-svn.tar.bz2|Stephan Henningsen's lead.]]  Also build-in yuv4ogg support as per [http://web.mit.edu/xiphmont/Public/theora/demo8.html xiphmont's lead.]
Line 44: Line 52:
*Identify a way to output time ranges generated from multi-speaker audio files, with each time range corresponding to the duration of how long a person spoke for ([http://cmusphinx.sourceforge.net/ CMU Sphinx?]).  Hopefully add speaker identification based on the vocal profile, and maybe even facial recognition and emotional categorization (''e.g.'', laughter).
*Identify a way to output time ranges generated from multi-speaker audio files, with each time range corresponding to the duration of how long a person spoke for ([http://cmusphinx.sourceforge.net/ CMU Sphinx?]).  Hopefully add speaker identification based on the vocal profile, and maybe even facial recognition and emotional categorization (''e.g.'', laughter).
-
*Figure out a content delivery network and target areas.
+
*Figure out a content delivery network and targeted areas.
 +
 
====Dreamlist====
====Dreamlist====
*Build open-firmware cameras that record directly to Ogg (2-pass Theora plus Vorbis); also include simplified on-camera editing.  Elphel has completely-open [http://www3.elphel.com/353_turnkey network cameras.]
*Build open-firmware cameras that record directly to Ogg (2-pass Theora plus Vorbis); also include simplified on-camera editing.  Elphel has completely-open [http://www3.elphel.com/353_turnkey network cameras.]
-
*Have those same cameras set to blur-out individuals broadcasting a 'do not record my voice, person, or both' flag.  Via iPhones, RFID, or colored sticker as mentioned in Yochai Benkler's keynote at OVC 2009.
+
*Accellerated Theora encoding using hardware acceleration (TI C64x+ DSP or an Elphel card connected ''via'' PCI?)
 +
*Have those same cameras set to blur-out individuals broadcasting a 'do not record my voice, person, or both' flag ''via'' iPhones, RFID, or colored stickers as mentioned in Yochai Benkler's keynote at OVC 2009.
*Have individuals appear as virtual avatars in-video (''i.e.'', Second Life-esque), according to user preference.
*Have individuals appear as virtual avatars in-video (''i.e.'', Second Life-esque), according to user preference.

Revision as of 01:49, 25 November 2009

Known issues:

  • Find a way to add style="width:352px;" to embed code in 352×240 streams and fix aspect ratio for 352×240 thumbnails when in category view.
  • Video playback might take ~6 seconds to start for timecodes towards the end of the meeting (pending seeking support in oggz-chop). Also, Google Chrome won't start playback if oggz-chop is used.
  • Speaker labels work for annotations but not transcripts. Also need to figure out AJAX auto-complete magic.
  • Search works but has a few issues (punctuation stripped, 'play inline' plays audio although video is paused, search-by-date not yet enabled for all videos)


Scripts needed:




Things that make life difficult:

  • ffmpeg/mplayer seem to handle duplicate Theora frames poorly, breaking A/V sync and thus limiting video reuse.
  • Cinelerra-CV is buggy and somewhat dated. Document rendering options on other video editors (e.g., XiphQT) and help develop Lumiera.
  • USB headsets crash spontaneously (error message usb_set_interface failed, at least in Debian Squeeze).




New features

  • Add a Vorbis-only option on video streams for low-bandwidth connections.
  • Turn on OpenID logins. Need to figure out how to link non-PEAR libraries to non-recompilable PHP.
  • Replace animated GIFs with low-FPS, enlargeable, jump-to-timecode Theora. Alternatively, use Pad.ma-style mouse-overs (see $(imagePoster).load(function() {...})). The thumbnails should cycle through the meeting in large time chunks when non-mouseover'd, and then display clickable frames according the how far the mouse is from the left edge when mouseover'd. Include ability to pause all animations and step-through lock-step upon mouse clicks.
  • Squash bugs (esp., video non-playback) in mwEmbed's libSequencer.
  • Identify a way to output time ranges generated from multi-speaker audio files, with each time range corresponding to the duration of how long a person spoke for (CMU Sphinx?). Hopefully add speaker identification based on the vocal profile, and maybe even facial recognition and emotional categorization (e.g., laughter).
  • Figure out a content delivery network and targeted areas.


Dreamlist

  • Build open-firmware cameras that record directly to Ogg (2-pass Theora plus Vorbis); also include simplified on-camera editing. Elphel has completely-open network cameras.
  • Accellerated Theora encoding using hardware acceleration (TI C64x+ DSP or an Elphel card connected via PCI?)
  • Have those same cameras set to blur-out individuals broadcasting a 'do not record my voice, person, or both' flag via iPhones, RFID, or colored stickers as mentioned in Yochai Benkler's keynote at OVC 2009.
  • Have individuals appear as virtual avatars in-video (i.e., Second Life-esque), according to user preference.


Communicate:



Non-technical development

All videos and text are published under the CC-BY 3.0 U. S. or CC-BY-SA 3.0. copyright licenses.  Details.