Abstract | ||
---|---|---|
The H.264/AVC Fractional Motion Estimation (FME) with rate-distortion constrained mode decision can improve the rate-distortion efficiency by 2---6 dB in peak signal-to-noise ratio. However, it comes with considerable computation complexity. Acceleration by dedicated hardware is a must for real-time applications. The main difficulty for FME hardware implementation is parallel processing under the constraint of the sequential flow and data dependency. We analyze seven inter-correlative loops extracted from FME procedure and provide decomposing methodologies to obtain efficient projection in hardware implementation. Two techniques, 4脳4 block decomposition and efficiently vertical scheduling, are proposed to reuse data among the variable block size and to improve the hardware utilization. Besides, advanced architectures are designed to efficiently integrate the 6-taps 2D finite impulse response, residue generation, and 4脳4 Hadamard transform into a fully pipelined architecture. This design is finally implemented and integrated into an H.264/AVC single chip encoder that supports realtime encoding of 720脳480 30fps video with four reference frames at 81 MHz operation frequency with 405 K logic gates (41.9% area of the encoder). |
Year | DOI | Venue |
---|---|---|
2008 | 10.1007/s11265-008-0213-7 | Signal Processing Systems |
Keywords | Field | DocType |
H.264/AVC,Motion estimation,VLSI architecture,Video coding | Block size,Reference frame,Data dependency,Logic gate,Computer science,Parallel computing,Real-time computing,Encoder,Motion estimation,Finite impulse response,Hadamard transform | Journal |
Volume | Issue | ISSN |
53 | 3 | 1939-8018 |
Citations | PageRank | References |
6 | 0.45 | 12 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yi-Hau Chen | 1 | 154 | 16.98 |
Tung-Chien Chen | 2 | 791 | 78.22 |
Shao-Yi Chien | 3 | 1603 | 154.48 |
Yu-Wen Huang | 4 | 1116 | 114.02 |
Liang-Gee Chen | 5 | 3637 | 383.22 |