A Row-Parallel 8 x 8 2-D DCT Architecture Using Algebraic Integer-Based Exact Computation

作者:Madanayake Arjuna*; Cintra Renato J; Onen Denis; Dimitrov Vassil S; Rajapaksha Nilanka; Bruton L T; Edirisuriya Amila
来源:IEEE Transactions on Circuits and Systems for Video Technology, 2012, 22(6): 915-929.
DOI:10.1109/TCSVT.2011.2181232

摘要

An algebraic integer (AI)-based time-multiplexed row-parallel architecture and two final reconstruction step (FRS) algorithms are proposed for the implementation of bivariate AI-encoded 2-D discrete cosine transform (DCT). The architecture directly realizes an error-free 2-D DCT without using FRSs between row-column transforms, leading to an 8 x 8 2-D DCT that is entirely free of quantization errors in AI basis. As a result, the user-selectable accuracy for each of the coefficients in the FRS facilitates each of the 64 coefficients to have its precision set independently of others, avoiding the leakage of quantization noise between channels as is the case for published DCT designs. The proposed FRS uses two approaches based on: 1) optimized Dempster-Macleod multipliers, and 2) expansion factor scaling. This architecture enables low-noise high-dynamic range applications in digital video processing that requires full control of the finite-precision computation of the 2-D DCT. The proposed architectures and FRS techniques are experimentally verified and validated using hardware implementations that are physically realized and verified on field-programmable gate array (FPGA) chip. Six designs, for 4-bit and 8-bit input word sizes, using the two proposed FRS schemes, have been designed, simulated, physically implemented, and measured. The maximum clock rate and block rate achieved among 8-bit input designs are 307.787 MHz and 38.47 MHz, respectively, implying a pixel rate of 8 x 307.787 approximate to 2.462 GHz if eventually embedded in a real-time video-processing system. The equivalent frame rate is about 1187.35 Hz for the image size of 1920 x 1080. All implementations are functional on a Xilinx Virtex-6 XC6VLX240T FPGA device.

  • 出版日期2012-6