k j

Video Lens: Rapid Playback and Exploration of Large Video Collections and Associated Metadata

Justin Matejka, Tovi Grossman, George Fitzmaurice
January 2014 · Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (UIST)

Abstract

We present Video Lens, a framework which allows users to visualize and interactively explore large collections of videos and associated metadata. The primary goal of the framework is to let users quickly find relevant sections within the videos and play them back in rapid succession. The individual UI elements are linked and highly interactive, supporting a faceted search paradigm and encouraging exploration of the data set. We demonstrate the capabilities and specific scenarios of Video Lens within the domain of professional baseball videos. A user study with 12 participants indicates that Video Lens efficiently supports a diverse range of powerful yet desirable video query tasks, while a series of interviews with professionals in the field demonstrates the framework's benefits and future potential.

Figures

Figure 1. Overview of the Video Lens interface consisting of a video player in the top left surrounded by the Multi-Attribute Grid, Single Attribute Controllers, and Video Timelines which support the
Figure 2. Two example Single-Attribute Controllers. On top, “Speed” is a continuous attribute, and on the bottom, “Type” is a discreet attribute.
Figure 3. A collection of SACs before and during a hover operation.
Figure 4. Example selections made in continuous and dis-creet Single-attribute Controllers.
Figure 5. Workflow of the search/highlight feature.
Figure 6. Elements within the Multi-Attribute Grid (MAG) area of the interface.
Figure 7. Examples of Multi-Attribute Grid with different combinations of attributes mapped to the dimensions.
Figure 8. Color and Size dimension appearances for each of the attribute variable types.
Figure 9. A section of the video timelines. The red dot high-lights the event which is currently being played.
Figure 10. Video Playback section of the interface.
Figure 11. The next video clip in a sequence is always being preloaded by a second background video player.
Figure 12. Object classes and relationships from the base-ball dataset.
Figure 13. Audio waveform from a 2:15 long section of a baseball game showing the spikes during pitch events.
Figure 14. Graph of fitness values for a single game over a range of possible metadata offset times, with the maximum fitness value and time highlighted.
Figure 15.  Frame 0.5 sec into clip for using the one second auto-advance mode for three pitches before and after the au-dio-based timing micro adjustments are applied. (Note the varied position of th
Figure 16. Watching the lowest pitches hit for a homerun.
Figure 17. Watching all strikeouts in a particular game.
Figure 18. Analyzing the pitches thrown by Jon Lester.
Figure 19. Completion times for each of the study tasks. Me-dian time shown with black bar.
Figure 20. Subjective results from participant survey.

BibTeX

@inproceedings{10.1145/2642918.2647366,
 abstract = {We present Video Lens, a framework which allows users to visualize and interactively explore large collections of videos and associated metadata. The primary goal of the framework is to let users quickly find relevant sections within the videos and play them back in rapid succession. The individual UI elements are linked and highly interactive, supporting a faceted search paradigm and encouraging exploration of the data set. We demonstrate the capabilities and specific scenarios of Video Lens within the domain of professional baseball videos. A user study with 12 participants indicates that Video Lens efficiently supports a diverse range of powerful yet desirable video query tasks, while a series of interviews with professionals in the field demonstrates the framework's benefits and future potential.},
 address = {New York, NY, USA},
 author = {Matejka, Justin and Grossman, Tovi and Fitzmaurice, George},
 booktitle = {Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology},
 doi = {10.1145/2642918.2647366},
 isbn = {9781450330695},
 keywords = {visualization, interface, video},
 location = {Honolulu, Hawaii, USA},
 numpages = {10},
 pages = {541–550},
 publisher = {Association for Computing Machinery},
 series = {UIST '14},
 title = {Video Lens: Rapid Playback and Exploration of Large Video Collections and Associated Metadata},
 url = {https://doi.org/10.1145/2642918.2647366},
 year = {2014}
}