Paper Type |
: |
Research Paper |
Title |
: |
Optimization of ECAT through DA-DCT |
Country |
: |
India |
Authors |
: |
S. Indumathi1 and Dr. M. Sailaja |
 |
: |
10.9790/2834-0313950  |
Abstract: Discrete cosine transform (DCT) is a widely used tool in image and video compression applications. Recently, the high-throughput DCT designs have been adopted to fit the requirements of real-time application. Operating the shifting and addition in parallel, an error-compensated adder-tree (ECAT) is proposed to deal with the truncation errors and to achieve low-error and high-speed discrete cosine transform (DCT) design. Instead of the 12 bits used in previous works, 9-bit Distributed Arithmetic was proposed. DA-based DCT design with an error-compensated adder-tree (ECAT) is the proposed architecture in which, ECAT operates shifting and addition in parallel by unrolling all the words required to be computed. Furthermore, the Error-Compensated Circuit alleviates the truncation error for high accuracy design. Based on low-error ECAT, the DA-precision in this work is chosen to be 9 bits instead of the traditional 12 bits. Therefore, the hardware size and cost is reduced, and the speed is improved using the proposed ECAT.
Keywords- Adders, DCT- Discrete Cosine Transform, DA- Distributed Arithmetic, ECAT- Error-Compensated Adder-Tree.
[1] Yuan-Ho Chen; Tsin-Yuan Chang; Chung-Yi Li;, "High Throughput DA-Based DCT With High Accuracy Error-Compensated Adder Tree," Very Large Scale Integration (VLSI) Systems, IEEE Transactions on, vol.19, no.4, pp.709-714, April 2011 doi: 10.1109 / TVLSI.2009.2037968.
[2] Y. Chang and C.Wang, "New systolic array implementation of the 2-D discrete cosine transform and its inverse," IEEE Trans. Circuits Syst.Video Technol., vol. 5, no. 2, pp. 150–157, Apr. 1995.
[3] S. Ghosh, S. Venigalla, and M. Bayoumi, "Design and implementation of a 2D-DCT architecture using coefficient distributed arithmetic," in Proc. IEEE Compute. Soc. Ann. Symp. VLSI, 2005, pp. 162– 166.
[4] S. Uramoto, Y. Inoue, A. Takabatake, J. Takeda, Y. Yamashita, H.Yerane, and M. Yoshimoto, "A 100-MHz 2-D discrete cosine transform core processor," IEEE J. Solid-State Circuits, vol. 27, no. 4, pp492–499, Apr. 1992.
[5] S. Yu and E. E. S. , Jr., "DCT implementation with distributed arithmetic,"IEEE Trans. Comput., vol. 50, no. 9, pp. 985–991, Sep. 2001.
[6] P. K. Meher, "Unified systolic-like architecture for DCT and DST using distributed arithmetic," IEEE Trans. Circuits Syst. I, Reg. Papers, vol.53, no. 12, pp. 2656–2663, Dec. 2006.
[7] A. M. Shams, A. Chidanandan, W. Pan, and M. A. Bayoumi, "NEDA:A low-power high-performance DCT architecture," IEEE Trans. SignalProcess., vol. 54, no. 3, pp. 955–964, Mar. 2006.
[8] M. R. M. Rizk and M. Ammar, "Low power small area high performance 2D-DCT architecture," in Proc. Int. Design Test Workshop,2007, pp. 120–125.
[9] Y. Chen, X. Cao, Q. Xie, and C. Peng, "An area efficient high performanceDCT distributed architecture for video compression," in Proc.Int. Conf. Adv. Comm. Technol., 2007, pp. 238–241.
[10] C. Peng, X. Cao, D. Yu, and X. Zhang, "A 250 MHz optimized distributed architecture of 2D 8×8 DCT," in Proc. Int. Conf. ASIC, 2007,pp. 189–192.