• No results found

An important consideration for any study involving the fitting of multiple candidate models is the selection of a ‘best fit’ model. For example, pure S´ersic systems can be erroneously classified as possessing a disk due to weak ripples in their surface brightness profiles. Thus, it is necessary to apply a statistical test to identify true 2-component galaxies (e.g. an F-test, as performed by Simard et al., 2011). In this thesis, I use the Bayesian Information Criterion (BIC, Schwarz, 1978) to remove galaxies which would be overfit by a 2-component (S´ersic +

exponential) model. BIC modifies the standardχ2assessment of goodness-of-fit to penalise

the addition of unnecessary free parameters. Thus, this statistic can be used to identify galaxies for which the addition of a (disk) component does not significantly improve the model fit. The general form of BIC is

BIC = χ2+ k· ln(n) (3.1)

wherek is the number of model free parameters, and n is the number of independent data points. When comparing two fitting models, A and B, the fit which results in the lowest BIC is considered the preferred model. Thus, if∆BIC = BICA− BICB > 0, model B provides

a better fit than model A, regardless of any difference in number of free parameters.

A BIC test is selected over the similar Akaike Information Criterion (AIC, Akaike, 1974) as it more strongly penalises unnecessary model parameters. Thus, the BIC is a more thor- ough statistic for the identification of overfit models (e.g. bulge + disk fits to 1-component galaxies). As this work aims to provide a clean sample of 2-component galaxies, ‘false- negative’ detections of overfitting (i.e. 2-component galaxies categorised as 1-component) is preferable to ‘false positive’ detections (i.e. 1-component galaxies categorised as 2- component). Furthermore, in contrast to an F-test, the BIC allows ‘non-nested’ model com- parison even where simpler models cannot be expressed in the form of more complex models (i.e. comparison of boxy S´ersic and S´ersic + exponential models).

Independence of data points is a key assumption of the BIC, however the individual pix- els in the image thumbnails cannot be considered statistically independent. Instead, model selection must be evaluated from all independent resolution elements. Following the pre- scription of Simard et al. (2011), the number of pixels,npix, in Equation 3.1 is substituted for

the number of resolution elements,nres = npix/Npsf, whereNpsf is the size of the resolution

element in pixels. As a practical method of determining the resolution element size,Npsf is

calculated as the area within the psf half-width at half maximum (fwhm/2, as determined via fitting).

For consistency, the fittingχ2 must also be evaluated across independent resolution ele-

ments. However, the identification of which pixels contribute to each resolution element is non-trivial. Instead, we approximate the resolution-chi-squared as,χ2

approach is equivalent to evaluation ofχ2

res via summation of the average contributions to

χ2of each resolution element. The resolution-modified BIC,BIC

res, is thus: BICres= χ2 Npsf + k· ln npix Npsf  . (3.2)

Since measurement ofNpsf has an associated error (. 1%, determined from the individ-

ual star images used to produce the master psf), the use ofBICresintroduces uncertainty to

the value of∆BIC. This error, σres, is estimated via Monte Carlo simulation. Thus, when

comparing models we select the simpler model unless∆BICres > 3σres. This is illustrated in

Figure 3.3, where the distribution of∆BICresis plotted inσresbins. The majority of galaxies

possess a∆BICressignificantly higher than 0, with a long tail towards negative values. Set-

ting the acceptance limit for 2-component galaxies at3σresthus only affects a small minority

of galaxies, but ensures an analysis sample uncontaminated by misclassified 1-component galaxies.

Model selection tests via BICres and pixel-wise BIC (BICpix; i.e. Equation 3.1) were

compared to visual model selection via inspection of 1D surface brightness profiles. For the majority (∼83%) of (uncontaminated, symmetric) galaxies, all three tests select the same ‘best-fit’ model (e.g. upper panel of Figure 3.4). Where the test results do not agree, BICpix

selects models which are overfit (relative to the visually-selected ‘best-fit’; e.g. middle panel of Figure 3.4), while BICresselects an underfit model (∼10% and ∼ 3% of cases respectively;

e.g. lower panel of Figure 3.4). The remaining ∼4% of cases are ambiguous.

As identification of a clean sample of well-fit 2-component (bulge + disk) galaxies is the primary goal of model selection in this work, the incorrect rejection of a small number of bulge + disk galaxies is preferable to contamination by overfit 1-component galaxies. Thus, I use BICresto distinguish between the goodness-of-fit of the S´ersic (k = 7), S´ersic +

exponential (k = 11), and and boxy S´ersic (k = 8) models in this work.

A 1D BIC test is also used during multi-band fitting to select between component gradient models (see Section 3.7). Radial colour profiles (for the image and models) are measured in elliptical annuli (in theu and i bands) within the unmasked target ellipse. BIC is evaluated as per Equation 3.1, wheren is the number of annuli, for k = 8 (no gradient), k = 10 (×2; disk or bulge gradient), andk = 12 (bulge and disk gradients).

Figure 3.4: 1D surface brightness profiles and galaxy thumbnails (both g band) for three example galaxies. Upper: SDSS DR8 ObjID 1237667323261026354, for which BICres,

BICpix, and visual classification all select a 2-component best fit. Middle: SDSS DR8 Ob-

jID 1237667444585857184, incorrectly assigned a 2-component best fit by BICpix (over-

fit). Lower: SDSS DR8 ObjID 1237665440442155197, incorrectly assigned a 1-component best fit by BICres(underfit). Black data points indicate the galaxy surface brightnesses mea-

sured in elliptical annuli. The bulge/disk/total model surface brightnesses from the best 2-component fit are included as red/blue/black lines respectively. Values for∆BICpix and

∆BICres(S´ersic model − S´ersic + exponential model) are indicated for each example. A red

scale bar corresponding to1000