Abstract | ||
---|---|---|
This paper presents a middle-level video representation named video primal sketch (VPS), which integrates two regimes of models: (i) sparse coding model using static or moving primitives to explicitly represent moving corners, lines, feature points, etc., (ii) FRAME /MRF model reproducing feature statistics extracted from input video to implicitly represent textured motion, such as water and fire. The feature statistics include histograms of spatio-temporal filters and velocity distributions. This paper makes three contributions to the literature: (i) Learning a dictionary of video primitives using parametric generative models; (ii) Proposing the spatio-temporal FRAME and motion-appearance FRAME models for modeling and synthesizing textured motion; and (iii) Developing a parsimonious hybrid model for generic video representation. Given an input video, VPS selects the proper models automatically for different motion patterns and is compatible with high-level action representations. In the experiments, we synthesize a number of textured motion; reconstruct real videos using the VPS; report a series of human perception experiments to verify the quality of reconstructed videos; demonstrate how the VPS changes over the scale transition in videos; and present the close connection between VPS and high-level action models. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/s10851-015-0563-2 | Journal of Mathematical Imaging and Vision |
Keywords | Field | DocType |
Middle-level vision,Video representation,Textured motion,Dynamic texture synthesis,Primal sketch | Computer vision,Histogram,Block-matching algorithm,Neural coding,Computer science,Motion compensation,Parametric statistics,Artificial intelligence,Generative grammar,Sketch | Journal |
Volume | Issue | ISSN |
abs/1502.02965 | 2 | 0924-9907 |
Citations | PageRank | References |
4 | 0.39 | 29 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Zhi Han | 1 | 225 | 18.58 |
Zongben Xu | 2 | 3203 | 198.88 |
Song-Chun Zhu | 3 | 6580 | 741.75 |