Bayesian Model Selection - Structural Dissonance in Galaxy Decomposition: The red sequence and

An important consideration for any study involving the fitting of multiple candidate models is the selection of a ‘best fit’ model. For example, pure S´ersic systems can be erroneously classified as possessing a disk due to weak ripples in their surface brightness profiles. Thus, it is necessary to apply a statistical test to identify true 2-component galaxies (e.g. an F-test, as performed by Simard et al., 2011). In this thesis, I use the Bayesian Information Criterion (BIC, Schwarz, 1978) to remove galaxies which would be overfit by a 2-component (S´ersic +

exponential) model. BIC modifies the standardχ2_{assessment of goodness-of-fit to penalise}

the addition of unnecessary free parameters. Thus, this statistic can be used to identify galaxies for which the addition of a (disk) component does not significantly improve the model fit. The general form of BIC is

BIC = χ2_{+ k· ln(n)} _(3.1)

wherek is the number of model free parameters, and n is the number of independent data points. When comparing two fitting models, A and B, the fit which results in the lowest BIC is considered the preferred model. Thus, if∆BIC = BICA− BICB > 0, model B provides

a better fit than model A, regardless of any difference in number of free parameters.

A BIC test is selected over the similar Akaike Information Criterion (AIC, Akaike, 1974) as it more strongly penalises unnecessary model parameters. Thus, the BIC is a more thor- ough statistic for the identification of overfit models (e.g. bulge + disk fits to 1-component galaxies). As this work aims to provide a clean sample of 2-component galaxies, ‘false- negative’ detections of overfitting (i.e. 2-component galaxies categorised as 1-component) is preferable to ‘false positive’ detections (i.e. 1-component galaxies categorised as 2- component). Furthermore, in contrast to an F-test, the BIC allows ‘non-nested’ model comparison even where simpler models cannot be expressed in the form of more complex models (i.e. comparison of boxy S´ersic and S´ersic + exponential models).

Independence of data points is a key assumption of the BIC, however the individual pixels in the image thumbnails cannot be considered statistically independent. Instead, model selection must be evaluated from all independent resolution elements. Following the pre- scription of Simard et al. (2011), the number of pixels,npix, in Equation 3.1 is substituted for

the number of resolution elements,nres = npix/Npsf, whereNpsf is the size of the resolution

element in pixels. As a practical method of determining the resolution element size,Npsf is

calculated as the area within the psf half-width at half maximum (fwhm/2, as determined via fitting).

For consistency, the fittingχ2 _{must also be evaluated across independent resolution ele-}

ments. However, the identification of which pixels contribute to each resolution element is non-trivial. Instead, we approximate the resolution-chi-squared as,χ2

approach is equivalent to evaluation ofχ2

res via summation of the average contributions to

χ2_{of each resolution element. The resolution-modified BIC,}_BIC

res, is thus: BICres= χ2 Npsf + k· ln npix Npsf . (3.2)

Since measurement ofNpsf has an associated error (. 1%, determined from the individ-

ual star images used to produce the master psf), the use ofBICresintroduces uncertainty to

the value of∆BIC. This error, σres, is estimated via Monte Carlo simulation. Thus, when

comparing models we select the simpler model unless∆BICres > 3σres. This is illustrated in

Figure 3.3, where the distribution of∆BICresis plotted inσresbins. The majority of galaxies

possess a∆BICressignificantly higher than 0, with a long tail towards negative values. Set-

ting the acceptance limit for 2-component galaxies at3σresthus only affects a small minority

of galaxies, but ensures an analysis sample uncontaminated by misclassified 1-component galaxies.

Model selection tests via BICres and pixel-wise BIC (BICpix; i.e. Equation 3.1) were

compared to visual model selection via inspection of 1D surface brightness profiles. For the majority (∼83%) of (uncontaminated, symmetric) galaxies, all three tests select the same ‘best-fit’ model (e.g. upper panel of Figure 3.4). Where the test results do not agree, BICpix

selects models which are overfit (relative to the visually-selected ‘best-fit’; e.g. middle panel of Figure 3.4), while BICresselects an underfit model (∼10% and ∼ 3% of cases respectively;

e.g. lower panel of Figure 3.4). The remaining ∼4% of cases are ambiguous.

As identification of a clean sample of well-fit 2-component (bulge + disk) galaxies is the primary goal of model selection in this work, the incorrect rejection of a small number of bulge + disk galaxies is preferable to contamination by overfit 1-component galaxies. Thus, I use BICresto distinguish between the goodness-of-fit of the S´ersic (k = 7), S´ersic +

exponential (k = 11), and and boxy S´ersic (k = 8) models in this work.

A 1D BIC test is also used during multi-band fitting to select between component gradient models (see Section 3.7). Radial colour profiles (for the image and models) are measured in elliptical annuli (in theu and i bands) within the unmasked target ellipse. BIC is evaluated as per Equation 3.1, wheren is the number of annuli, for k = 8 (no gradient), k = 10 (×2; disk or bulge gradient), andk = 12 (bulge and disk gradients).

Figure 3.4: 1D surface brightness profiles and galaxy thumbnails (both g band) for three example galaxies. Upper: SDSS DR8 ObjID 1237667323261026354, for which BICres,

BICpix, and visual classification all select a 2-component best fit. Middle: SDSS DR8 Ob-

jID 1237667444585857184, incorrectly assigned a 2-component best fit by BICpix (over-

fit). Lower: SDSS DR8 ObjID 1237665440442155197, incorrectly assigned a 1-component best fit by BICres(underfit). Black data points indicate the galaxy surface brightnesses mea-

sured in elliptical annuli. The bulge/disk/total model surface brightnesses from the best 2-component fit are included as red/blue/black lines respectively. Values for∆BICpix and

∆BICres(S´ersic model − S´ersic + exponential model) are indicated for each example. A red

scale bar corresponding to1000

In document Structural Dissonance in Galaxy Decomposition: The red sequence and evolutionary pathways in the Coma Cluster (Page 72-76)