Revision as of 21:57, 18 March 2013

Back to Psych 221 Projects 2013

Background

Camera Basics

A digital camera captures color images by recording data in different channels of the visible spectrum (red, green, blue). Most cameras accomplish this by using a single type of CCD or CMOS sensor at each pixel. This sensor measures the intensity of light that is focused on it, but cannot distinguish between colors. Therefore, a color filter is placed over each sensor, and the sensor only records the intensity of a specific color at that pixel. The arrangement of these color filters over an image is known as a Color Filter Array. Figure 1 shows a very common CFA known as the Bayer Array.

In the Bayer Array, green color filters are arranged in a checkerboard pattern, whereas the red and blue color filters are arranged by alternating rows. The original reasoning for using this distribution of colors was to mimic the physiology of the human eye. During daylight vision, the luminance perception of the retina uses both L and M cones, which are more sensitive to green light. The red and blue filters control the sensitivity of the eye to chrominance.

CFA Interpolation

Since each color channel is only tallied at specific coordinates, the remaining pixels in that channel must be estimated in some way. There are many interpolation schemes - a few simple examples are the bilinear,bicubic, and smooth hue transition interpolations shown below. Other interpolation schemes, such as median filter, gradient-based, and adaptive color plane, become increasingly complex. A wide survey of interpolation methods is available, along with an analysis of the advantages and disadvantages of each.

Bilinear/Bicubic

A linear combination of the nearest N neighbors in any direction for N = 1 (bilinear) or N = 3 (bicubic). For a given matrix of known values R(x,y), the interpolated set of values r(x,y) can be computed:

r (x, y) = \sum_{u, v = - N}^{N} h_{r} (u, v) R (x - u, y - v)

where h_r is the 2-D kernel containing interpolation weights based on relative neighbors of a given pixel.

h_{1} = \frac{1}{4} [\begin{matrix} 0 & 1 & 0 \\ 1 & 4 & 1 \\ 0 & 1 & 0 \end{matrix}], h_{3} = \frac{1}{256} [\begin{matrix} 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & - 9 & 0 & - 9 & 0 & 0 \\ 0 & - 9 & 0 & 81 & 0 & - 9 & 0 \\ 1 & 0 & 81 & 256 & 81 & 0 & 1 \\ 0 & - 9 & 0 & 81 & 0 & - 9 & 0 \\ 0 & 0 & - 9 & 0 & - 9 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 \end{matrix}]

Smooth Hue Transition

Operates on the assumption that the hue of a natural image (chrominance/luminance, or red/green and blue/green) changes smoothly in local regions of that image. In order to calculate this, the green channel is first interpolated using bilinear interpolation. A missing red point r(x,y) is calculated from known red values R(x,y) and green values G(x,y).

r (x, y) = G (x, y) * \frac{1}{2} (\frac{R (x - 1, y)}{G (x - 1, y)} + \frac{R (x + 1, y)}{G (x + 1, y)})

The sum shown above is valid for when the interpolation pixel has two adjacent pixels in the same row. The equation is altered slightly for two adjacent pixels in the same column, or when there are four adjacent pixels in the corners. The missing blue pixels are calculated in the same way.

Detecting Forgeries

Regardless of the type of interpolation used in a given image, the estimated pixels should exhibit a strong correlation, or dependence, on their surrounding pixels. If one can categorize each pixel as either correlated or independent with respect to other pixels, one should see a periodic pattern that mimics the CFA used to construct the image. Even if an altered image does not have any visual cues that point to its forgery, inspecting the correlation of pixels to one another can potentially expose which parts of an image were altered. It is important to note that, even if an image has been altered, the mentioned correlations may still be kept intact. In this way, detecting forgeries through CFA interpolation is one tool out of many used to detect forgeries.

Methods

Finding Correlations

The first step in detecting forgeries is categorizing the pixels in an image into two groups: those that are correlated with neighboring pixels and those that independent. Since the camera, CFA, an interpolation method are unknown for a given image, a robust algorithm must be used to iteratively deduce this categorization.

Expectation/Maximization

The EM algorithm has a wide variety of uses for working on a statistical set of data. In the context of this problem, each step of the algorithm has a specific purpose. In the expectation step, the probability of each pixel in a given image being correlated to its neighbors is calculated based on the latest guess of interpolation coefficients. In the maximization step, new interpolation coefficients are calculated based on the probability map that reveals which pixels are indeed interpolated. Iterating through this process converges to a stable estimate of both the probability map and interpolation coefficients. If a pixel is correlated to neighboring pixels, it takes the form:

f (x, y) = \sum_{u, v = - N}^{N} α_{u, v} f (x + u, y + v) + n (x, y)

where $α_{u, v}$ are the interpolation coefficients and n(x,y) is the residual error of the linear model and ideally a Gaussian distribution with zero mean. The probability that a sample f(x,y) exists given that it is correlated to its neighbors is given by the equation:

P (x, y) = \frac{1}{σ \sqrt{2 π}} e x p [- \frac{1}{2 σ^{2}} {(f (x, y) - \sum_{u, v = - N}^{N} α_{u, v} f (x + u, y + v))}^{2}]

where $σ$ is the standard deviation of this Gaussian distribution and is updated in the M-step. The new estimate of the interpolation coefficients is found by minimizing the quadratic error function:

E (α) = \sum_{x, y} w (x, y) {(f (x, y) - \sum_{u, v = - N}^{N} α_{u, v} f (x + u, y + v))}^{2}

By setting the derivative of this equation equal to zero and rearranging terms, the following equation exists for each relative index (s,t) of the set interpolation coefficients $α (u, v)$ :

\sum_{u, v = - N}^{N} α_{u, v} (\sum_{x, y} w (x, y) f (x + s, y + t) f (x + u, y + v)) = \sum_{x, y} w (x, y) f (x + s, y + t) f (x, y)

where w(x,y) = P(x,y). By performing this for each coefficient, a system of M equations and M unknowns, M being the number of interpolation coefficients, is set up. In this way, a new series of coefficients can be computed.
In order to make the computation optimized for MATLAB, the summations were converted into matrix multiplications. For example, for a given interpolation coefficient $α_{s, t}$ , the w(x,y) , f(x+s,y+t) , and f(x,y) matrices can be written as row vectors of size 1 x N where N is the number of pixels under analysis. Thus the right hand side of the equation can be represented with matrix multiplations:

$\sum_{x, y} w (x, y) f (x + s, y + t) f (x, y) = [\begin{matrix} w (x, y) \end{matrix}] . * [\begin{matrix} f (x + s, y + t) \end{matrix}] * {[\begin{matrix} f (x, y) \end{matrix}]}^{T}$

Similarly, the left hand side of the equation can be re-written. If all interpolation coefficients are taken into account, the entire system of equations can be represented:

$[\begin{matrix} w (x, y) \\ . \\ . \\ w (x, y) \end{matrix}] . * [\begin{matrix} f (x + 1, y + 0) \\ f (x + 0, y + 1) \\ . \\ f (x + s, y + t) \end{matrix}] * {[\begin{matrix} f (x + 1, y + 0) \\ f (x + 0, y + 1) \\ . \\ f (x + u, y + v) \end{matrix}]}^{T} * [\begin{matrix} α_{1, 0} \\ α_{0, 1} \\ . \\ α_{u, v} \end{matrix}] = [\begin{matrix} w (x, y) \\ . \\ . \\ w (x, y) \end{matrix}] . * [\begin{matrix} f (x + 1, y + 0) \\ f (x + 0, y + 1) \\ . \\ f (x + s, y + t) \end{matrix}] * {[\begin{matrix} f (x, y) \\ f (x, y) \\ . \\ f (x, y) \end{matrix}]}^{T}$

For clarity, the resulting sizes of these matrices is contained below.

$[\begin{matrix} M x M \end{matrix}] * [\begin{matrix} α_{1} \\ α_{2} \\ . \\ α_{M} \end{matrix}] = [\begin{matrix} M x 1 \end{matrix}]$

In this way, a new set of coefficients can then be easily computed. This rearrangement of the data provided much faster computations, and it was very easy to add more coefficients to the system. The EM algorithm was allowed to run until the coefficients generated in successive iterations differed by an amount less than some arbitrarily small threshold.

2-D FFT

Once a probability map was generated from the EM algorithm, the next step was to determine if the periodic pattern of the CFA was present in the map. To do this, a two-dimensional FFT was performed on the probability map. The default 2-D FFT magnitude plot in MATLAB contains all of the low frequency information in the upper left corner, and the high frequency components along the left and bottom edges. In order to make the desired periodicity easier to observe, the FFT was up-sampled, high-pass filtered, and shifted so that the low frequency components are at the center of the image.

FFT BEFORE/AFTER PICTURE HERE

Similarity Measure

In order to build a classifier, a reliable quantity must be measured that is different by some threshold for real and fake images. The method chosen utilizes the similarities between the FFT of the probability map of an image and the FFT of an ideal synthetic grid. To generate a synthetic grid for a given color channel, any pixel that corresponds to that channel's color is marked with a 1 at that location; otherwise, all other locations contain a zero. In order to avoid the event that the probability map and synthetic grid are out of phase, which would yield a low similarity when compared directly, the magnitudes of the FFT results are compared. If $P (ω_{x}, ω_{y})$ is the FFT of the probability map, and $S (ω_{x}, ω_{y})$ is the FFT of the synthetic grid, then the similarity measurement M is:

M = \sum_{ω_{x}, ω_{y}} | P (ω_{x}, ω_{y}) | \cdot | S (ω_{x}, ω_{y}) |

For windows in which the probability map is close to that expected, this similarity measure should remain constant. If the probability map distribution varies significantly, the similarity measure will deviate from the average value. This can be true for any channel.

Results

EM Algorithm Performance

Talk about window sizes, number of iterations, what scenes are harder to match, etc.

Comparing Real and Fake Image Pairs

What we saw when comparing the real and fake image pairs, threshold values, etc.

Setting a Threshold

When we set a threshold, what % of images pass the test. False errors, etc. Talk about robustness.

Conclusions

We'll talk about, if given an image, the chances we can detect if its fake without a real image pair. Talk about future work - whatever that means.

References

1. Popescu, A.C.; Farid, H.,Exposing digital forgeries in color filter array interpolated images, Signal Processing, IEEE Transactions on , vol.53, no.10, pp.3948,3959, Oct. 2005

2. R. Ramanath, W. E. Snyder, G. L. Bilbro, and W. A. Sander III, Demosaicking methods for Bayer color arrays, Journal of Electronic Imaging, vol. 11, no. 3, pp. 306315, July 2002.

3. A. Dempster, N. Laird, and D. Rubin,Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, vol. 99, no. 1, pp. 138, 1977.

Appendix I - MATLAB Code

Code

File:CodeFile.zip

Appendix II - Work partition (if a group project)

We held hands.

@@ Line 11: / Line 11: @@
 <br>
 === CFA Interpolation ===
-Since each color channel is only tallied at specific coordinates, the remaining pixels in that channel must be estimated in some way.  There are many interpolation schemes - a few simple examples are the bilinear,bicubic, and smooth hue transition interpolations shown below.  Other interpolation schemes, such as median filter, gradient-based, and adaptive color plane, become increasingly complex.  A wide survey of interpolation methods is available at (REFERENCE), along with an analysis of the advantages and disadvantages of each.
+Since each color channel is only tallied at specific coordinates, the remaining pixels in that channel must be estimated in some way.  There are many interpolation schemes - a few simple examples are the bilinear,bicubic, and smooth hue transition interpolations shown below.  Other interpolation schemes, such as median filter, gradient-based, and adaptive color plane, become increasingly complex.  A wide survey of interpolation methods is available, along with an analysis of the advantages and disadvantages of each.
 ==== Bilinear/Bicubic ====
 A linear combination of the nearest N neighbors in any direction for N = 1 (bilinear) or N = 3 (bicubic).  For a given matrix of known values ''R(x,y)'', the interpolated set of values ''r(x,y)'' can be computed:
@@ Line 35: / Line 35: @@
 The first step in detecting forgeries is categorizing the pixels in an image into two groups:  those that are correlated with neighboring pixels and those that independent.  Since the camera, CFA, an interpolation method are unknown for a given image, a robust algorithm must be used to iteratively deduce this categorization.
 ===Expectation/Maximization===
-The EM algorithm has a wide variety of uses for working on a statistical set of data (REFERENCE).  In the context of this problem, each step of the algorithm has a specific purpose.  In the expectation step, the probability of each pixel in a given image being correlated to its neighbors is calculated based on the latest guess of interpolation coefficients.  In the maximization step, new interpolation coefficients are calculated based on the probability map that reveals which pixels are indeed interpolated.  Iterating through this process converges to a stable estimate of both the probability map and interpolation coefficients.  If a pixel is correlated to neighboring pixels, it takes the form:
+The EM algorithm has a wide variety of uses for working on a statistical set of data.  In the context of this problem, each step of the algorithm has a specific purpose.  In the expectation step, the probability of each pixel in a given image being correlated to its neighbors is calculated based on the latest guess of interpolation coefficients.  In the maximization step, new interpolation coefficients are calculated based on the probability map that reveals which pixels are indeed interpolated.  Iterating through this process converges to a stable estimate of both the probability map and interpolation coefficients.  If a pixel is correlated to neighboring pixels, it takes the form:
 :<math>f(x,y) = \sum_{u,v=-N}^{N} \alpha_{u,v}f(x+u,y+v)+n(x,y)</math>
@@ Line 87: / Line 87: @@
 We'll talk about, if given an image, the chances we can detect if its fake without a real image pair.  Talk about future work - whatever that means.
-= References - Resources and related work =
+= References =
+. Popescu, A.C.; Farid, H.,''Exposing digital forgeries in color filter array interpolated images'', Signal Processing, IEEE Transactions on , vol.53, no.10, pp.3948,3959, Oct. 2005
-Didn't keep track of these.
+. R. Ramanath, W. E. Snyder, G. L. Bilbro, and W. A. Sander III, ''Demosaicking methods for Bayer color arrays'', Journal of Electronic Imaging, vol. 11, no. 3, pp. 306315, July 2002.
+. A. Dempster, N. Laird, and D. Rubin,''Maximum likelihood from incomplete data via the EM algorithm'', Journal of the Royal Statistical Society, vol. 99, no. 1, pp. 138, 1977.
 = Appendix I - MATLAB Code =

IsmailPeters: Difference between revisions

Revision as of 21:57, 18 March 2013

Contents

Background

Camera Basics

CFA Interpolation

Bilinear/Bicubic

Smooth Hue Transition

Detecting Forgeries

Methods

Finding Correlations

Expectation/Maximization

2-D FFT

Similarity Measure

Results

EM Algorithm Performance

Comparing Real and Fake Image Pairs

Setting a Threshold

Conclusions

References

Appendix I - MATLAB Code

Code

Appendix II - Work partition (if a group project)

Navigation menu

IsmailPeters: Difference between revisions

Revision as of 21:57, 18 March 2013

Background

Camera Basics

CFA Interpolation

Bilinear/Bicubic

Smooth Hue Transition

Detecting Forgeries

Methods

Finding Correlations

Expectation/Maximization

2-D FFT

Similarity Measure

Results

EM Algorithm Performance

Comparing Real and Fake Image Pairs

Setting a Threshold

Conclusions

References

Appendix I - MATLAB Code

Code

Appendix II - Work partition (if a group project)

Navigation menu

Search