Video coding using a self-adaptive redundant dictionary consisting of spatial and temporal prediction candidates - Citegraph

Paper Info

Title
Video coding using a self-adaptive redundant dictionary consisting of spatial and temporal prediction candidates

Abstract
All standard video coders are based on the prediction plus transform representation of an image block, which predicts the current block using various intra- and inter-prediction modes and then represents the prediction error using a fixed orthonormal transform. We propose to directly represent a mean-removed block using a redundant dictionary consisting of all possible inter-prediction candidates with integer motion vectors (mean-removed). In general the dictionary may also contain some intra-prediction candidates and some pre-designed fixed dictionary atoms. However, simulation results reported in this papers are obtained by using the inter-prediction candidates only. We determine the coefficients by minimizing the L0 norm of the coefficients subject to a constraint on the sparse approximation error. We show that using such a self-adaptive dictionary can lead to a very sparse representation, with significantly fewer non-zero coefficients than using the DCT transform on the prediction error. We further propose a modified orthogonal matching pursuit (OMP) algorithm which othonormalizes each new chosen atom with respect to all previously chosen and orthonormalized atoms. Each image block is represented by the quantized coefficients corresponding to the othonormalized atoms, to overcome the inefficiency associated with using non-orthonormal atoms. Each image block is represented by its mean, which is predictively coded, the indices of the chosen atoms, and the quantized coefficients. Each variable is coded based on its unconditional distribution. Simulation results show that the proposed coder can achieve significant gain over the H.264 coder (implemented using x264) and achieve similar performance comparing to the HEVC reference encoder (HM).

Year	DOI	Venue
2014	10.1109/ICME.2014.6890314	ICME
Keywords	Field	DocType
temporal prediction,othonormalized atoms,video coders,image matching,approximation theory,spatial prediction,dct transform,integer motion vectors,modified orthogonal matching pursuit algorithm,sparse approximation error,discrete cosine transforms,image block,video coding,mean-removed block,self-adaptive redundant dictionary,sparse representation,h.264 coder,fixed orthonormal transform,image motion analysis,encoding,dictionaries,vectors	Matching pursuit,Integer,Lapped transform,K-SVD,Pattern recognition,Computer science,Sparse approximation,Orthonormal basis,Encoder,Artificial intelligence,Encoding (memory)	Conference
ISSN	Citations	PageRank
1945-7871	1	0.41
References	Authors
0	2

Authors (2 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yuanyi Xue	1	55	5.37
Yao Wang	2	19	4.00

1