1*dfc6aa5cSAndroid Build Coastguard Worker /* 2*dfc6aa5cSAndroid Build Coastguard Worker * jfdctfst.c 3*dfc6aa5cSAndroid Build Coastguard Worker * 4*dfc6aa5cSAndroid Build Coastguard Worker * This file was part of the Independent JPEG Group's software: 5*dfc6aa5cSAndroid Build Coastguard Worker * Copyright (C) 1994-1996, Thomas G. Lane. 6*dfc6aa5cSAndroid Build Coastguard Worker * libjpeg-turbo Modifications: 7*dfc6aa5cSAndroid Build Coastguard Worker * Copyright (C) 2015, D. R. Commander. 8*dfc6aa5cSAndroid Build Coastguard Worker * For conditions of distribution and use, see the accompanying README.ijg 9*dfc6aa5cSAndroid Build Coastguard Worker * file. 10*dfc6aa5cSAndroid Build Coastguard Worker * 11*dfc6aa5cSAndroid Build Coastguard Worker * This file contains a fast, not so accurate integer implementation of the 12*dfc6aa5cSAndroid Build Coastguard Worker * forward DCT (Discrete Cosine Transform). 13*dfc6aa5cSAndroid Build Coastguard Worker * 14*dfc6aa5cSAndroid Build Coastguard Worker * A 2-D DCT can be done by 1-D DCT on each row followed by 1-D DCT 15*dfc6aa5cSAndroid Build Coastguard Worker * on each column. Direct algorithms are also available, but they are 16*dfc6aa5cSAndroid Build Coastguard Worker * much more complex and seem not to be any faster when reduced to code. 17*dfc6aa5cSAndroid Build Coastguard Worker * 18*dfc6aa5cSAndroid Build Coastguard Worker * This implementation is based on Arai, Agui, and Nakajima's algorithm for 19*dfc6aa5cSAndroid Build Coastguard Worker * scaled DCT. Their original paper (Trans. IEICE E-71(11):1095) is in 20*dfc6aa5cSAndroid Build Coastguard Worker * Japanese, but the algorithm is described in the Pennebaker & Mitchell 21*dfc6aa5cSAndroid Build Coastguard Worker * JPEG textbook (see REFERENCES section in file README.ijg). The following 22*dfc6aa5cSAndroid Build Coastguard Worker * code is based directly on figure 4-8 in P&M. 23*dfc6aa5cSAndroid Build Coastguard Worker * While an 8-point DCT cannot be done in less than 11 multiplies, it is 24*dfc6aa5cSAndroid Build Coastguard Worker * possible to arrange the computation so that many of the multiplies are 25*dfc6aa5cSAndroid Build Coastguard Worker * simple scalings of the final outputs. These multiplies can then be 26*dfc6aa5cSAndroid Build Coastguard Worker * folded into the multiplications or divisions by the JPEG quantization 27*dfc6aa5cSAndroid Build Coastguard Worker * table entries. The AA&N method leaves only 5 multiplies and 29 adds 28*dfc6aa5cSAndroid Build Coastguard Worker * to be done in the DCT itself. 29*dfc6aa5cSAndroid Build Coastguard Worker * The primary disadvantage of this method is that with fixed-point math, 30*dfc6aa5cSAndroid Build Coastguard Worker * accuracy is lost due to imprecise representation of the scaled 31*dfc6aa5cSAndroid Build Coastguard Worker * quantization values. The smaller the quantization table entry, the less 32*dfc6aa5cSAndroid Build Coastguard Worker * precise the scaled value, so this implementation does worse with high- 33*dfc6aa5cSAndroid Build Coastguard Worker * quality-setting files than with low-quality ones. 34*dfc6aa5cSAndroid Build Coastguard Worker */ 35*dfc6aa5cSAndroid Build Coastguard Worker 36*dfc6aa5cSAndroid Build Coastguard Worker #define JPEG_INTERNALS 37*dfc6aa5cSAndroid Build Coastguard Worker #include "jinclude.h" 38*dfc6aa5cSAndroid Build Coastguard Worker #include "jpeglib.h" 39*dfc6aa5cSAndroid Build Coastguard Worker #include "jdct.h" /* Private declarations for DCT subsystem */ 40*dfc6aa5cSAndroid Build Coastguard Worker 41*dfc6aa5cSAndroid Build Coastguard Worker #ifdef DCT_IFAST_SUPPORTED 42*dfc6aa5cSAndroid Build Coastguard Worker 43*dfc6aa5cSAndroid Build Coastguard Worker 44*dfc6aa5cSAndroid Build Coastguard Worker /* 45*dfc6aa5cSAndroid Build Coastguard Worker * This module is specialized to the case DCTSIZE = 8. 46*dfc6aa5cSAndroid Build Coastguard Worker */ 47*dfc6aa5cSAndroid Build Coastguard Worker 48*dfc6aa5cSAndroid Build Coastguard Worker #if DCTSIZE != 8 49*dfc6aa5cSAndroid Build Coastguard Worker Sorry, this code only copes with 8x8 DCTs. /* deliberate syntax err */ 50*dfc6aa5cSAndroid Build Coastguard Worker #endif 51*dfc6aa5cSAndroid Build Coastguard Worker 52*dfc6aa5cSAndroid Build Coastguard Worker 53*dfc6aa5cSAndroid Build Coastguard Worker /* Scaling decisions are generally the same as in the LL&M algorithm; 54*dfc6aa5cSAndroid Build Coastguard Worker * see jfdctint.c for more details. However, we choose to descale 55*dfc6aa5cSAndroid Build Coastguard Worker * (right shift) multiplication products as soon as they are formed, 56*dfc6aa5cSAndroid Build Coastguard Worker * rather than carrying additional fractional bits into subsequent additions. 57*dfc6aa5cSAndroid Build Coastguard Worker * This compromises accuracy slightly, but it lets us save a few shifts. 58*dfc6aa5cSAndroid Build Coastguard Worker * More importantly, 16-bit arithmetic is then adequate (for 8-bit samples) 59*dfc6aa5cSAndroid Build Coastguard Worker * everywhere except in the multiplications proper; this saves a good deal 60*dfc6aa5cSAndroid Build Coastguard Worker * of work on 16-bit-int machines. 61*dfc6aa5cSAndroid Build Coastguard Worker * 62*dfc6aa5cSAndroid Build Coastguard Worker * Again to save a few shifts, the intermediate results between pass 1 and 63*dfc6aa5cSAndroid Build Coastguard Worker * pass 2 are not upscaled, but are represented only to integral precision. 64*dfc6aa5cSAndroid Build Coastguard Worker * 65*dfc6aa5cSAndroid Build Coastguard Worker * A final compromise is to represent the multiplicative constants to only 66*dfc6aa5cSAndroid Build Coastguard Worker * 8 fractional bits, rather than 13. This saves some shifting work on some 67*dfc6aa5cSAndroid Build Coastguard Worker * machines, and may also reduce the cost of multiplication (since there 68*dfc6aa5cSAndroid Build Coastguard Worker * are fewer one-bits in the constants). 69*dfc6aa5cSAndroid Build Coastguard Worker */ 70*dfc6aa5cSAndroid Build Coastguard Worker 71*dfc6aa5cSAndroid Build Coastguard Worker #define CONST_BITS 8 72*dfc6aa5cSAndroid Build Coastguard Worker 73*dfc6aa5cSAndroid Build Coastguard Worker 74*dfc6aa5cSAndroid Build Coastguard Worker /* Some C compilers fail to reduce "FIX(constant)" at compile time, thus 75*dfc6aa5cSAndroid Build Coastguard Worker * causing a lot of useless floating-point operations at run time. 76*dfc6aa5cSAndroid Build Coastguard Worker * To get around this we use the following pre-calculated constants. 77*dfc6aa5cSAndroid Build Coastguard Worker * If you change CONST_BITS you may want to add appropriate values. 78*dfc6aa5cSAndroid Build Coastguard Worker * (With a reasonable C compiler, you can just rely on the FIX() macro...) 79*dfc6aa5cSAndroid Build Coastguard Worker */ 80*dfc6aa5cSAndroid Build Coastguard Worker 81*dfc6aa5cSAndroid Build Coastguard Worker #if CONST_BITS == 8 82*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_0_382683433 ((JLONG)98) /* FIX(0.382683433) */ 83*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_0_541196100 ((JLONG)139) /* FIX(0.541196100) */ 84*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_0_707106781 ((JLONG)181) /* FIX(0.707106781) */ 85*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_1_306562965 ((JLONG)334) /* FIX(1.306562965) */ 86*dfc6aa5cSAndroid Build Coastguard Worker #else 87*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_0_382683433 FIX(0.382683433) 88*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_0_541196100 FIX(0.541196100) 89*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_0_707106781 FIX(0.707106781) 90*dfc6aa5cSAndroid Build Coastguard Worker #define FIX_1_306562965 FIX(1.306562965) 91*dfc6aa5cSAndroid Build Coastguard Worker #endif 92*dfc6aa5cSAndroid Build Coastguard Worker 93*dfc6aa5cSAndroid Build Coastguard Worker 94*dfc6aa5cSAndroid Build Coastguard Worker /* We can gain a little more speed, with a further compromise in accuracy, 95*dfc6aa5cSAndroid Build Coastguard Worker * by omitting the addition in a descaling shift. This yields an incorrectly 96*dfc6aa5cSAndroid Build Coastguard Worker * rounded result half the time... 97*dfc6aa5cSAndroid Build Coastguard Worker */ 98*dfc6aa5cSAndroid Build Coastguard Worker 99*dfc6aa5cSAndroid Build Coastguard Worker #ifndef USE_ACCURATE_ROUNDING 100*dfc6aa5cSAndroid Build Coastguard Worker #undef DESCALE 101*dfc6aa5cSAndroid Build Coastguard Worker #define DESCALE(x, n) RIGHT_SHIFT(x, n) 102*dfc6aa5cSAndroid Build Coastguard Worker #endif 103*dfc6aa5cSAndroid Build Coastguard Worker 104*dfc6aa5cSAndroid Build Coastguard Worker 105*dfc6aa5cSAndroid Build Coastguard Worker /* Multiply a DCTELEM variable by an JLONG constant, and immediately 106*dfc6aa5cSAndroid Build Coastguard Worker * descale to yield a DCTELEM result. 107*dfc6aa5cSAndroid Build Coastguard Worker */ 108*dfc6aa5cSAndroid Build Coastguard Worker 109*dfc6aa5cSAndroid Build Coastguard Worker #define MULTIPLY(var, const) ((DCTELEM)DESCALE((var) * (const), CONST_BITS)) 110*dfc6aa5cSAndroid Build Coastguard Worker 111*dfc6aa5cSAndroid Build Coastguard Worker 112*dfc6aa5cSAndroid Build Coastguard Worker /* 113*dfc6aa5cSAndroid Build Coastguard Worker * Perform the forward DCT on one block of samples. 114*dfc6aa5cSAndroid Build Coastguard Worker */ 115*dfc6aa5cSAndroid Build Coastguard Worker 116*dfc6aa5cSAndroid Build Coastguard Worker GLOBAL(void) 117*dfc6aa5cSAndroid Build Coastguard Worker jpeg_fdct_ifast(DCTELEM *data) 118*dfc6aa5cSAndroid Build Coastguard Worker { 119*dfc6aa5cSAndroid Build Coastguard Worker DCTELEM tmp0, tmp1, tmp2, tmp3, tmp4, tmp5, tmp6, tmp7; 120*dfc6aa5cSAndroid Build Coastguard Worker DCTELEM tmp10, tmp11, tmp12, tmp13; 121*dfc6aa5cSAndroid Build Coastguard Worker DCTELEM z1, z2, z3, z4, z5, z11, z13; 122*dfc6aa5cSAndroid Build Coastguard Worker DCTELEM *dataptr; 123*dfc6aa5cSAndroid Build Coastguard Worker int ctr; 124*dfc6aa5cSAndroid Build Coastguard Worker SHIFT_TEMPS 125*dfc6aa5cSAndroid Build Coastguard Worker 126*dfc6aa5cSAndroid Build Coastguard Worker /* Pass 1: process rows. */ 127*dfc6aa5cSAndroid Build Coastguard Worker 128*dfc6aa5cSAndroid Build Coastguard Worker dataptr = data; 129*dfc6aa5cSAndroid Build Coastguard Worker for (ctr = DCTSIZE - 1; ctr >= 0; ctr--) { 130*dfc6aa5cSAndroid Build Coastguard Worker tmp0 = dataptr[0] + dataptr[7]; 131*dfc6aa5cSAndroid Build Coastguard Worker tmp7 = dataptr[0] - dataptr[7]; 132*dfc6aa5cSAndroid Build Coastguard Worker tmp1 = dataptr[1] + dataptr[6]; 133*dfc6aa5cSAndroid Build Coastguard Worker tmp6 = dataptr[1] - dataptr[6]; 134*dfc6aa5cSAndroid Build Coastguard Worker tmp2 = dataptr[2] + dataptr[5]; 135*dfc6aa5cSAndroid Build Coastguard Worker tmp5 = dataptr[2] - dataptr[5]; 136*dfc6aa5cSAndroid Build Coastguard Worker tmp3 = dataptr[3] + dataptr[4]; 137*dfc6aa5cSAndroid Build Coastguard Worker tmp4 = dataptr[3] - dataptr[4]; 138*dfc6aa5cSAndroid Build Coastguard Worker 139*dfc6aa5cSAndroid Build Coastguard Worker /* Even part */ 140*dfc6aa5cSAndroid Build Coastguard Worker 141*dfc6aa5cSAndroid Build Coastguard Worker tmp10 = tmp0 + tmp3; /* phase 2 */ 142*dfc6aa5cSAndroid Build Coastguard Worker tmp13 = tmp0 - tmp3; 143*dfc6aa5cSAndroid Build Coastguard Worker tmp11 = tmp1 + tmp2; 144*dfc6aa5cSAndroid Build Coastguard Worker tmp12 = tmp1 - tmp2; 145*dfc6aa5cSAndroid Build Coastguard Worker 146*dfc6aa5cSAndroid Build Coastguard Worker dataptr[0] = tmp10 + tmp11; /* phase 3 */ 147*dfc6aa5cSAndroid Build Coastguard Worker dataptr[4] = tmp10 - tmp11; 148*dfc6aa5cSAndroid Build Coastguard Worker 149*dfc6aa5cSAndroid Build Coastguard Worker z1 = MULTIPLY(tmp12 + tmp13, FIX_0_707106781); /* c4 */ 150*dfc6aa5cSAndroid Build Coastguard Worker dataptr[2] = tmp13 + z1; /* phase 5 */ 151*dfc6aa5cSAndroid Build Coastguard Worker dataptr[6] = tmp13 - z1; 152*dfc6aa5cSAndroid Build Coastguard Worker 153*dfc6aa5cSAndroid Build Coastguard Worker /* Odd part */ 154*dfc6aa5cSAndroid Build Coastguard Worker 155*dfc6aa5cSAndroid Build Coastguard Worker tmp10 = tmp4 + tmp5; /* phase 2 */ 156*dfc6aa5cSAndroid Build Coastguard Worker tmp11 = tmp5 + tmp6; 157*dfc6aa5cSAndroid Build Coastguard Worker tmp12 = tmp6 + tmp7; 158*dfc6aa5cSAndroid Build Coastguard Worker 159*dfc6aa5cSAndroid Build Coastguard Worker /* The rotator is modified from fig 4-8 to avoid extra negations. */ 160*dfc6aa5cSAndroid Build Coastguard Worker z5 = MULTIPLY(tmp10 - tmp12, FIX_0_382683433); /* c6 */ 161*dfc6aa5cSAndroid Build Coastguard Worker z2 = MULTIPLY(tmp10, FIX_0_541196100) + z5; /* c2-c6 */ 162*dfc6aa5cSAndroid Build Coastguard Worker z4 = MULTIPLY(tmp12, FIX_1_306562965) + z5; /* c2+c6 */ 163*dfc6aa5cSAndroid Build Coastguard Worker z3 = MULTIPLY(tmp11, FIX_0_707106781); /* c4 */ 164*dfc6aa5cSAndroid Build Coastguard Worker 165*dfc6aa5cSAndroid Build Coastguard Worker z11 = tmp7 + z3; /* phase 5 */ 166*dfc6aa5cSAndroid Build Coastguard Worker z13 = tmp7 - z3; 167*dfc6aa5cSAndroid Build Coastguard Worker 168*dfc6aa5cSAndroid Build Coastguard Worker dataptr[5] = z13 + z2; /* phase 6 */ 169*dfc6aa5cSAndroid Build Coastguard Worker dataptr[3] = z13 - z2; 170*dfc6aa5cSAndroid Build Coastguard Worker dataptr[1] = z11 + z4; 171*dfc6aa5cSAndroid Build Coastguard Worker dataptr[7] = z11 - z4; 172*dfc6aa5cSAndroid Build Coastguard Worker 173*dfc6aa5cSAndroid Build Coastguard Worker dataptr += DCTSIZE; /* advance pointer to next row */ 174*dfc6aa5cSAndroid Build Coastguard Worker } 175*dfc6aa5cSAndroid Build Coastguard Worker 176*dfc6aa5cSAndroid Build Coastguard Worker /* Pass 2: process columns. */ 177*dfc6aa5cSAndroid Build Coastguard Worker 178*dfc6aa5cSAndroid Build Coastguard Worker dataptr = data; 179*dfc6aa5cSAndroid Build Coastguard Worker for (ctr = DCTSIZE - 1; ctr >= 0; ctr--) { 180*dfc6aa5cSAndroid Build Coastguard Worker tmp0 = dataptr[DCTSIZE * 0] + dataptr[DCTSIZE * 7]; 181*dfc6aa5cSAndroid Build Coastguard Worker tmp7 = dataptr[DCTSIZE * 0] - dataptr[DCTSIZE * 7]; 182*dfc6aa5cSAndroid Build Coastguard Worker tmp1 = dataptr[DCTSIZE * 1] + dataptr[DCTSIZE * 6]; 183*dfc6aa5cSAndroid Build Coastguard Worker tmp6 = dataptr[DCTSIZE * 1] - dataptr[DCTSIZE * 6]; 184*dfc6aa5cSAndroid Build Coastguard Worker tmp2 = dataptr[DCTSIZE * 2] + dataptr[DCTSIZE * 5]; 185*dfc6aa5cSAndroid Build Coastguard Worker tmp5 = dataptr[DCTSIZE * 2] - dataptr[DCTSIZE * 5]; 186*dfc6aa5cSAndroid Build Coastguard Worker tmp3 = dataptr[DCTSIZE * 3] + dataptr[DCTSIZE * 4]; 187*dfc6aa5cSAndroid Build Coastguard Worker tmp4 = dataptr[DCTSIZE * 3] - dataptr[DCTSIZE * 4]; 188*dfc6aa5cSAndroid Build Coastguard Worker 189*dfc6aa5cSAndroid Build Coastguard Worker /* Even part */ 190*dfc6aa5cSAndroid Build Coastguard Worker 191*dfc6aa5cSAndroid Build Coastguard Worker tmp10 = tmp0 + tmp3; /* phase 2 */ 192*dfc6aa5cSAndroid Build Coastguard Worker tmp13 = tmp0 - tmp3; 193*dfc6aa5cSAndroid Build Coastguard Worker tmp11 = tmp1 + tmp2; 194*dfc6aa5cSAndroid Build Coastguard Worker tmp12 = tmp1 - tmp2; 195*dfc6aa5cSAndroid Build Coastguard Worker 196*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 0] = tmp10 + tmp11; /* phase 3 */ 197*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 4] = tmp10 - tmp11; 198*dfc6aa5cSAndroid Build Coastguard Worker 199*dfc6aa5cSAndroid Build Coastguard Worker z1 = MULTIPLY(tmp12 + tmp13, FIX_0_707106781); /* c4 */ 200*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 2] = tmp13 + z1; /* phase 5 */ 201*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 6] = tmp13 - z1; 202*dfc6aa5cSAndroid Build Coastguard Worker 203*dfc6aa5cSAndroid Build Coastguard Worker /* Odd part */ 204*dfc6aa5cSAndroid Build Coastguard Worker 205*dfc6aa5cSAndroid Build Coastguard Worker tmp10 = tmp4 + tmp5; /* phase 2 */ 206*dfc6aa5cSAndroid Build Coastguard Worker tmp11 = tmp5 + tmp6; 207*dfc6aa5cSAndroid Build Coastguard Worker tmp12 = tmp6 + tmp7; 208*dfc6aa5cSAndroid Build Coastguard Worker 209*dfc6aa5cSAndroid Build Coastguard Worker /* The rotator is modified from fig 4-8 to avoid extra negations. */ 210*dfc6aa5cSAndroid Build Coastguard Worker z5 = MULTIPLY(tmp10 - tmp12, FIX_0_382683433); /* c6 */ 211*dfc6aa5cSAndroid Build Coastguard Worker z2 = MULTIPLY(tmp10, FIX_0_541196100) + z5; /* c2-c6 */ 212*dfc6aa5cSAndroid Build Coastguard Worker z4 = MULTIPLY(tmp12, FIX_1_306562965) + z5; /* c2+c6 */ 213*dfc6aa5cSAndroid Build Coastguard Worker z3 = MULTIPLY(tmp11, FIX_0_707106781); /* c4 */ 214*dfc6aa5cSAndroid Build Coastguard Worker 215*dfc6aa5cSAndroid Build Coastguard Worker z11 = tmp7 + z3; /* phase 5 */ 216*dfc6aa5cSAndroid Build Coastguard Worker z13 = tmp7 - z3; 217*dfc6aa5cSAndroid Build Coastguard Worker 218*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 5] = z13 + z2; /* phase 6 */ 219*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 3] = z13 - z2; 220*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 1] = z11 + z4; 221*dfc6aa5cSAndroid Build Coastguard Worker dataptr[DCTSIZE * 7] = z11 - z4; 222*dfc6aa5cSAndroid Build Coastguard Worker 223*dfc6aa5cSAndroid Build Coastguard Worker dataptr++; /* advance pointer to next column */ 224*dfc6aa5cSAndroid Build Coastguard Worker } 225*dfc6aa5cSAndroid Build Coastguard Worker } 226*dfc6aa5cSAndroid Build Coastguard Worker 227*dfc6aa5cSAndroid Build Coastguard Worker #endif /* DCT_IFAST_SUPPORTED */ 228