Chapter 2: OBJECT LIGHT FIELD RECONSTRUCTION FROM
A. A shift-variant BRDF model and analytical solution
Previous authors in the computer graphics and computer vision fields have made a variety of simplifying assumptions to BRDFs [12,13]. For example, the BRDF has been approximated as shift-invariant to compute the forward scattering problem in the frequency domain [16] and the diffuse component of the BRDF has been approximated as a constant (commonly known as the Lambertian term) to obtain solutions to the inverse scattering problem [4]. In addition, the BRDF has been modeled as separable, such as for data compression purposes [17,18]. These assumptions, although convenient for computer graphic related purposes, are not suitable for analysis of the reconstruction.
Fig. 2-6 illustrates two types of BRDFs expected in real scenes. Fig. 2-6 (a) shows a large and slowly varying diffuse component superimposed on a small quasi-shift-invariant specular component. Fig. 2-6 (b) shows a BRDF that contains only a quasi-shift-invariant specular component, but is restricted to a limited angular range.
Since the specular component is considerably wider than the measurement range, the boxed area can be considered
17
as a diffuse component. In the following, we introduce a BRDF model in eq. (2-14) that covers these common shift-variant BRDFs, enabling an analytical solution to eq. (2-9). Such analytical solutions are required to fully understand the errors in reconstruction and the effect of regularization methods. Note that our BRDF model combines Durandβs model [16] for the forward scattering analysis and Shirley's model [19] for matte/specular
materials such as polished woods and tiles. Our model is thus expected to have wide applicability to real-world scattering surfaces.
Fig. 2-6. Analytically modeling two candidate BRDF functions. (a) Large diffuse component and small specular component. (b) Wide specular component with limited measurable range.
Fig. 2-6 (a) is modeled mathematically by including a shift-invariant specular term ππ πππ that is multiplied by a slowly varying window function ππ€πππ to allow for commonly observed changes in specular peak-values as a function of object angle ππ [16]. A symmetric diffuse component πππππ(ππ) Γ πππππ(ππ) is then added that is separable and reciprocal in terms of the input and output angles [19]:
ππ΅π π·πΉ(ππ, ππ) = ππ€πππ(ππ)ππ πππ(ππβ ππ) + πππππ(ππ)πππππ(ππ)
In the computer graphics and computer vision fields, the diffuse component is usually expressed as a constant Lambertian term. In our treatment, we allow the diffuse component to take on any functional form, as long as the function is separable with respect to the input and output angles. This generalization allows for a far more complex diffuse term to be expressed. However, the separability constraint implies that no object information is (2-14)
18
conveyed to the measured signal as explained in the subsequent sections. Moreover, if ππ πππ is sufficiently narrow compared to ππ€πππ (i.e. ππ πππ is considered to be delta-function like), then ππ€πππ(ππ) Γ ππ πππ(ππβ ππ) β ππ€πππ(ππ) Γ ππ πππ(ππβ ππ) meaning the model has Helmholtz reciprocity considering the symmetricity of the specular and diffuse component.
We first introduce a weighted inner product [29] (denoted by a double bracket in eq. (2-15)) and weighted basis (denoted by a tilde in eq. (2-16)) to simplify the frequency-domain expression of eq. (2-9). This basis is orthonormal with respect to the weighted inner product, so πΏπππ can be expressed as a linear combination of πΜπ as shown in eq. (2-17), where πΜπ is a coefficient of πΏπππ with respect to πΜπ. (The relation between ππ
By substituting eq. (2-9) and (2-17) into (2-11) and referring to eq. (2-12), a new frequency domain expression can be obtained in eq. (2-18). Note that the cosine obliquity factor has been cancelled and π΅Μπ,π is simply the 2-D Fourier coefficient of the BRDF. In the following, the system matrix [π΅Μ] contains the elements π΅Μπ,π.
{
By substituting the BRDF model of eq. (2-14) into eq. (2-18), the following expression for π΅Μπ,π is obtained:
(2-15)
(2-16)
(2-17)
(2-18)
19
Because we have assumed that the diffuse component is separable, the second term of eq. (2-19) can be written as eq. (2-23), where π·π is the Fourier coefficient of πππππ defined in eq. (2-24).
Finally, by substituting eq. (2-20) and (2-23) into (2-19), π΅Μπ,π is expressed in the following simple form:
π΅Μπ,π= ππ,πππβπ+ π·ππ·πβ
Recall that we have assumed that ππ€πππ is a slowly varying function. To make further progress in the analytical solution of eq. (2-18), we approximate it as ππ€πππ(ππ) β 1. We then have
20
By substituting eq. (2-27) into (2-18), and dividing both sides by ππ,π, we have
πΜπ= ππ
ππ,πβ β π·π
ππ,ππ·πβπΜπ
β
π=ββ
Eq. (2-28) is a discrete form of the Fredholm integral equation of the second kind [30], and it has a closed-form solution because of the separable kernel in the discrete sum. By expressing eq. (2-28) with a constant πΌ defined in eq. (2-29) and multiplying both sides by π·πβ, eq. (2-30) is obtained.
By calculating the discrete sum of p for both sides of eq. (2-30), the following is obtained:
β π·πβπΜπ
The left-hand-side of eq. (2-31) is equal to πΌ, and therefore this equation can be solved in terms of πΌ as expressed in eq. (2-32). The expression for πΌ does not include unknown πΜπ, and therefore, an analytical closed-form
21
B. Understanding the structure of the system matrix [π©]
From the analytical expressions in the previous section, it is possible to interpret the structure and eigenvalues of the system matrix [π΅] in eq. (2-13). From the relation between ππ and πΜπ in eq. (a3) in appendix A1, π΅π,π in eq. (2-13) and π΅Μπ,π in eq. (2-18) have the following relation: π΅π,πβ π΅Μπ,πβ , where π½π½0 0 is a constant related to the cosine obliquity factor and is given by eq. (a2) in appendix A1, and therefore, the structure of [π΅] can be directly seen from [π΅Μ].
Let us first consider the simplest case where ππ€πππ is unity. Fig. 2-7 (a) illustrates π΅Μπ,π in eq. (2-27) where it can be seen that the diagonal elements of [π΅Μ] are ππ,π, or the Fourier coefficients of ππ πππ in eq. (2-21), and the off-diagonal elements are π·ππ·πβ or the outer product of the Fourier coefficients of πππππ in eq. (2-24).
Moreover, it can be understood from eq. (2-33) that ππ,π can be considered as the approximate eigenvalues of [π΅Μ] because the right-hand-side of eq. (2-33) includes division by ππ,π. As is commonly known, many actual BRFDs contain specular components whose shape can be approximated by a Gaussian function and whose Fourier coefficients monotonically decrease as a function of frequency [12,13], and therefore, the diagonal elements tend to have near zero values at higher frequencies. The extent of the non-zero portion of these diagonal elements of the matrix indicates the range of angular frequencies that can be recovered without being adversely affected by measurement noise. The length of this recoverable region is Fourier-transform related to the sharpness of the specular component (see fig. 2-7 (b)).
(2-33) (2-32)
22
Second, let us consider a more general BRDF where ππ€πππ is not unity but rather a slowly varying function. From eq. (2-25), the diagonal elements ππ,π are seen to be convolved with ππβπ, leading to a blur in the diagonal elements. Fig. 2-7 (b) shows an experimentally measured system matrix [π΅] from a coated white paper. The width of the blur in this figure is a measure of the shift-variance of the specular component. In many common scattering materials where ππ€πππ is slowly varying and the width of this blur is small, the ππ,π terms are still good approximations to the actual eigenvalues.
Fig. 2-7. Understanding the structure of the system matrix [π©] . (a) Schematic structure of [π©Μ]. (b) Experimentally obtained [π©].
C. Influence of measurement noise on reconstruction error
In many inverse problem solutions, the relative reconstruction error can be expressed as the product of the system matrix condition number and the inverse of the measurement system signal-to-noise ratio (SNR) [25,32]. In this section, we will derive a similar expression by making use of the analytical solution of the previous section together with a newly defined BRDF parameter which we call the degree of specularity.
We assume the linear noise model shown in eq. (2-34) as a description of the signal πΏππππ detected by the camera. The measurement noise πππ is assumed to be an additive zero-mean wide-sense stationary (WSS) random process [33].
23
πΏππππ (ππ) = β« ππ΅π π·πΉ(ππ, ππ) cos πππΏπππ(ππ)πππ
π π
+ πππ
Since the process is WSS, the noise can be characterized by random variables π΄π that are the Fourier coefficients (or more generally the Karhunen-LoΓ©ve coefficients [33,35]) of πππ with respect to eq. (2-7) and (2-8). When taking the measurement noise into consideration, eq. (2-11) becomes
πΏππππ (ππ) β πππ= β {ππβ π΄π}ππ(ππ)
β
π=ββ
Therefore, an analytical solution considering the measurement noise, denoted πΜπππππ π, can be obtained in eq. (2-35) simply by replacing ππ with ππβ π΄π from eq. (2-33), where πΜπ is an error caused by the noise. shown below, where the pth coefficient of the power spectral density of the measurement noise is denoted as πππ·π
(note: for definition, see eq. (a15) in appendix A7. In addition, πΈ[π΄π] = 0 and πΈ [|π΄π|2] = πππ·π from
24
By substituting the above equation, eq. (2-32) and (2-33) into eq. (2-36), the relative noise error can be obtained as:
where the constant πΎ is defined in eq. (2-38) as the degree of specularity:
πΎ β‘
The approximation in eq. (2-38) comes from the fact that value of the second term in the numerator can often be approximated as near zero (for further details, see appendix A2). The approximated πΎ is totally determined by the BRDF, and has the following properties; it becomes 1 for perfectly specular BRDF i.e. π·π = 0 for all π, and becomes 0 for perfectly diffusive BRDF i.e. ππ,π= 0 for all π. In section 2.6, it will be experimentally shown that eq. (2-38) can accurately represent the characteristic of BRDFs from a variety of scattering surfaces.
If the noise πππ is white and Poissonian and the mean of the measured signal is sufficiently large, its statistics can be approximated by a white-Gaussian noise with a standard deviation which is a function (generally, the square root) of the signal mean. By defining the standard deviation of πππ as π(π0), where π0 (average value of πΏππππ from eq. (2-11)) represents the signal mean, we can make the following approximation:
βπππ·πβ π(π0). (Note: For satisfying Parsevalβs law, π(π0) needs to be divided by βπ when an π-point DFT is taken for computation.)
D. Intuitive understanding of the reconstruction error
An inspection of eq. (2-37) reveals the following;
(2-37)
(2-38)
25
1) The first term |π0,0βππ,π| in eq. (2-37) can be interpreted as the condition number [25,32] of a matrix obtained by totally removing the diffuse component from the system matrix [π΅], assuming ππ,π is a
monotonically decreasing function of π and truncating the matrix to include all terms less than or equal to π. This matches our intuition that BRDFs having sharper and/or narrower specular peaks will retain higher frequency components in the object reconstruction. By contrast, BRDFs having more rounded and/or wider specular peaks will lose higher object frequency information. A similar effect was noted in fig. 2-7. In addition, the above-mentioned specular width is measured relative to the domain range [π, π]. For example, as the range becomes wider, the relative specular width can be seen to be narrower.
2) Instead of the more common definition of the signal-to-noise ratio (or SNR) in eq. (2-39), the second term in eq. (2-37) containing πΎ acts as the effective signal-to-noise ratio. We define this in eq. (2-40) as πππ πππ.
πππ = π0
βπππ·π
πππ πππβ‘ π0πΎ
βπππ·π
The numerator π0πΎ represents the part of πΏππππ originating from the specular component of the BRDF.
This indicates that BRDFs having larger diffuse components result in smaller values of πΎ and πππ πππ, making the object reconstruction more difficult. In the case of a perfectly diffusive BRDF (πΎ = 0), the πππ πππ becomes zero, meaning no object information can be retrieved. (Note: We defined the diffuse component as a separable function, and in general, any non-separable component of the BRDFs can convey object information.) In addition, the πΎ coefficient depends on the domain range [π, π]. For example, if the measurable range in fig. 2-6 (b) becomes wider, πΎ becomes larger.
(2-39)
(2-40)
26
For categorizing different types of BRDFs from the viewpoint of solving the inverse problem, the two parameters,
|π0,0βππ,π| and πππ πππ, are informative because they directly reflect the reconstruction error which determines the quality of the reconstructed light field πΏπππ.
E. Examples of noise analysis on wavelength and polarization
Observing the scene with longer wavelength light, such as infrared and terahertz wavelengths, can sometimes lead to a BRDF with a narrower specular peak [37]. This reduces |π0,0βππ,π| in eq. (2-37) meaning the error is suppressed and higher frequency object information can be preserved.
Moreover, the specular and diffuse reflections each have polarization properties governed by the characteristics of the scattering surface. The πππ πππ can be taken to determine suitable usage of polarizers for better light field reconstruction. If we assume the simplest situation, such as P-polarized specular reflection and un-polarized Lambertian diffuse, it can be shown that the highest πππ πππ can be achieved simply by using a P-polarizer during measurement. Some imaging systems attempt to remove the diffuse component by subtracting a second measurement with S-polarization from that with P-polarization [38]. However, this procedure does not improve the πππ πππ because the subtraction adds more noise. See appendix A3 for details.
2.5 Two fundamental regularization methods derived from the analytical solution
A. Fourier truncation regularization
As explained in section 2.4.B, the Fourier coefficients ππ,π of the specular component serve as approximate eigenvalues of the system matrix [π΅] in eq. (2-13). Thus, removing small values of ππ,π by truncating the higher frequency components of the system matrix [π΅] can serve as a simple regularizer. Note that
27
this is similar to the commonly known truncation procedure for Fourier series when performing deconvolution (or inverse filtering) and truncation of the SVD when performing general inversions [22,26]. We previously derived an expression for the relative noise error in eq. (2-37). However, this expression is not appropriate for the present purpose since it lacks the error inherent in a truncated Fourier series [29]. Instead, we will derive a general error expression for the mean-squared error (MSE) to include both errors, and then use the first derivative of this MSE with respect to Fourier terms p as a criterion for selecting the best truncation point for regularization.
- Total MSE expressed as a sum of the truncation error and the noise error
πΏπππ is the original object and πΏππππππ is the reconstructed object defined in eq. (2-42) to include both
By expanding eq. (2-41) (see appendix A4 for details), the total MSE can be obtained in eq. (2-43) as a function of the truncation term ππππ₯. The first and second terms represent the truncation error, and the third term represents the measurement noise error.
- The first derivative of MSE as a criterion to determine the best truncation term for regularization (2-41)
(2-42)
(2-43)
28
As can be seen from eq. (2-43) and fig. 2-8 (a), the behavior of the MSE is governed by the two error sources. The truncation error monotonically decreases as ππππ₯ increases and eventually converges to zero, whereas the noise error monotonically increases as ππππ₯ increases and eventually diverges. This indicates that the MSE can have a minimum at a particular value ππππ₯ and the first derivative of the MSE can be used to find the best truncation point. By approximating a derivative as a finite difference and an integration as a discrete summation, the discrete Leibniz integral rule is expressed as shown below, where π is a constant.
π
By applying the above rule to eq. (2-43), the following equation can be obtained (note: the first line assumes |πΜπ| and |πΜπ| are symmetric with respect to π = 0 because they are computed from real functions.):
By substituting eq. (2-33) and (2-35) into the above equation, the analytical first derivative of the MSE can be obtained in eq. (2-44), which is completely determined by the known information. Consequently, the best truncation term can be determined by finding the πΜπππ₯ which makes eq. (2-44) near zero. If the measurement noise is white and approximated as Gaussian with a standard deviation π(π0), βπππ·ππππ₯β π(π0).
Note that eq. (2-43) does not guarantee an unique global minimum and eq. (2-44) may have some oscillations and multiple zeros. In this case, it is reasonable to fit eq. (2-44) with some slow function and take its zero-crossing point as πΜπππ₯.
(2-44)
29
- Relation between the first derivative of MSE and the relative noise error
Let us assume a single delta-function object (i.e. a point source) expressed as πΏπππ(ππ) = πΏ(ππ), which leads to πΜπ= πΜ0 for any π from eq. 33) and (a3) in appendix A1. Noting that the first term of eq. (2-44) is πΜππππ₯, we can replace this term with πΜ0. By considering the relation: π₯2β π¦2= (π₯ β π¦) Γ (π₯ + π¦), we have
|βπππ·πΜπππ₯
ππΜπππ₯,πΜπππ₯| β |π0β πΌπ·0
π0,0 | = 0
By referring to eq. (2-37), the following can be obtained from the above equation:
| π0,0
ππΜπππ₯,πΜπππ₯| |βπππ·πΜπππ₯
π0β πΌπ·0| = 1 = βΜπΜπππ₯
Finally, we see that the Fourier term πΜπππ₯ (the MSE minimizer) corresponds to a Fourier term which makes the relative noise error βΜπ in eq. (2-37) equal to one, as illustrated in fig. 2-8 (b). Therefore, it can be said that the πΜπππ₯ directly reflects the BRDF characteristics, such as ππ,π, πΎ and πππ πππ explained in section 2.4.C and
2.4.D. Although this requires the object to be a delta function, it is reasonable to expect the similar tendency of πΜπππ₯ for more general objects.
Fig. 2-8. (a) Mean-squared error (MSE) as a sum of two error sources. (b) Relation between πΜπππ and βΜπ assuming a single delta object.
30
B. Wiener filter regularization
By assuming a perfectly shift-invariant BRDF, a Wiener filter similar to those applied to image restoration can be derived [2,26,27]. In this work, however, we require a more general form of the Wiener filter that can be applied to the shift-variant BRDF of section 2.4. The derivation of the Wiener filter based on the Bayesian theorem [22,33] and the minimization principle [29] is outlined in appendix A5 ~ A7 for completeness (see also [23,24,25]), and the resulting Wiener filter is shown in eq. (2-45), where π΅π,π β is the complex-conjugate of π΅π,π defined in eq. (2-13) (note: π and π correspond to π and π in eq. (2-13) respectively), and πΜπ is the Fourier coefficient of the Wiener filter estimate πΏΜ . (Definitions of other coefficients: ππππ π from eq. (2-11), π΅π,π from eq. (2-13), ππ ,π from eq. (2-21), π·π from eq. (2-24), πΌ from eq. (2-32), π½0 from eq. (a2) in
If the measurement noise is white and approximated as Gaussian with a standard deviation π(π0), eq. (2-45) is simplified to eq. (2-46). The second term is a regularizer defined as the inverse of πππ π ππππππ in eq.
(2-47). We will use these expressions for the experimental verifications.
β {π΅π,π β β π΅π,ππΜπ
31
2.6 Experimental verification
In this section, the relative noise error and the two regularization methods derived in the previous sections will be experimentally verified with a variety of scattering materials.
A. Experimental setup and BRDF measurements
Fig. 2-9. (a) Experimental set-up (top view). (b) One-dimensional radiometric intensity from LCD used as π³πππ. (c) and (d) Actual photographs of the set-up in the enclosure.
Fig. 2-9 (a) shows a schematic of the experimental set-up which reproduces the imaging scenario of fig.
1-1 assuming one-dimensional objects, and fig. 2-9 (c) and (d) are actual photographs of the set-up. It consists of shot-noise limited CMOS camera in the visible wavelength (Photometrics Prime) mounted on a computer controlled angular stage, and an LCD monitor. Though the methods and analyses of the previous sections are applicable to the entire light field, we measure only the angular component from a single spatial location in the
32
current experiment since it is sufficient for verifying the above conclusions. For this reason, the camera captures one single point on the scattering surface π¦π = ππ= 0 (restricted by eq. (2-4)) from a variety of different camera angles (29 ~ 61 degree range with 0.5 degree resolution) resulting in the measured radiometric intensity πΏππππ (ππ). In addition, rather than use real objects as shown in fig. 1-1, we employ a well-calibrated LCD monitor to generate the object light field distribution πΏπππ(ππ) (shown in fig. 2-9 (b)) appropriate for the point on the scattering surface restricted by eq. (2-5). For suppressing ambient and stray light, the enclosure is designed with an interior entirely covered with black felt, and baffles are employed to reduce unwanted reflections.
Since our experiments are designed for one-dimensional objects, the measurement of the BRDF is greatly simplified. The measurement consists of illuminating the scattering surface with a single narrow slit of light from the LCD screen and recording the scattered light with the camera as a function of output angle of the BRDF ππ. This measurement is repeated many times, each for a different slit location corresponding to various input angles of the BRDF ππ. Fig. 2-10 shows measured BRDFs at three input angles with their corresponding degree of specularity πΎ as defined by eq. (2-38) of four scattering materials.
Fig. 2-10 (c) shows how the measurements are fit to the BRDF model of eq. (2-14) to compute our analytical expressions. The window function ππ€πππ is sufficiently slowly varying in all samples that it can be ignored in the analytical expressions and the regularization methods derived in the previous sections. The shift-invariant specular component ππ πππ is approximated as the area above the dotted line connecting both edges of the solid curve having a peak around the center of the angular domain, and the area below the dotted line represents the diffuse component. As the simplest expression, the separable function of the diffuse component is approximated as πππππ(ππ) Γ πππππ(ππ) β (π1ππ+
π2) Γ (π1ππ+ π2), and the constants π1 and π2 are determined so the expression can fit the dotted line in a least square sense. In addition, the camera's measurement noise was previously measured, and its standard deviation was obtained as a function of signal mean.
33
Fig. 2-10. Experimentally measured BRDF samples: (a) Brushed metal, (b) coated white-paper with highlights #1, (c) coated white-paper with highlights #2, and (d) white paint with a pearl finish applied to glass.
B. Experimental verification of the relative noise error
The relative noise error βπ defined in eq. (2-48) (see eq. (2-10) for ππ) was computed from experimentally measured data through the following steps; 1) a sinusoidal intensity pattern of the pth Fourier component on a bias was reproduced on the LCD screen, 2) the camera data πΏππππ (including measurement noise) was recorded as a function of camera angle, and 3) eq. (2-13) was then solved for ππππππ π with a truncated system matrix [π΅] that included only Fourier terms from zero to π. This procedure was repeated 500 times to generate the standard deviation statistic for ππππππ π and eq. (2-48) was computed for a specific value of p.
βπβ‘
βπππ[ππππππ π]
|π0| , where ππππππ πβ‘ ππ+ ππππ π πππππ (2-48)
34
Fig 2-11 shows a comparison between the experimentally obtained values of βπ from eq. (2-48) and the theoretically calculated values of βΜπ from eq. (2-37) for the four different BRDF samples in fig. 2-10. (Note:
The theoretical βΜπ can be directly compared with the experimental βπ, because βπβ βΜπ from eq. (a3) in appendix A1.) Good agreements can be seen in all of the samples, which shows that the analytical BRDF model
The theoretical βΜπ can be directly compared with the experimental βπ, because βπβ βΜπ from eq. (a3) in appendix A1.) Good agreements can be seen in all of the samples, which shows that the analytical BRDF model