2.4 Typing techniques
2.4.5 Multilocus sequence typing
MLST was first developed in 1991, and the technique uses comparative DNA sequencing of conserved housekeeping genes to characterise organisms (Maiden et al. 1998). House- keeping genes are essential in the process of cellular metabolism of any life form. They are present in the core genome of all strains and encode proteins that are under stable selection for conservation of metabolic function (Maiden et al. 1998). In MLST, stretches of nucleotide sequences of approximately 400 – 600 bp from seven loci from a complete genome are chosen for analysis. The length of nucleotide sequences are chosen to give a reliable single run on automated sequencing instruments. MLST employs a universal nomenclature scheme for storing and interpreting nucleotide sequence data. Each allele
fragment is assigned a unique number in the order of discovery. For example aspA-1
would be the first unique MLST allele identified for aspA locus (Maiden 2006). For
each locus, distinct allelic sequences are assigned with allelic numbers and each isolate is therefore designated with seven numbers constituting an allelic profile which, in turn, is
2.4 Typing techniques 35
given a sequence type (ST) or genotypic number. The isolates that share at least four alle- les in common are grouped under a common central genotype, referred to as the founder ST or the known central ancestor, the clonal complex genotype.
MLST is reported to provide discrimination equivalent to 15 to 20 loci as examined by
other techniques, such as MLEE (Dingle et al. 2005). An MLST system for C. jejuni
was developed by Dingle et al. (2001) and is increasingly used in epidemiological studies
(Manning et al. 2003, Clark et al. 2005) and population structure analysis ofCampylobac-
terspp. The housekeeping genes forCampyloabcterspp. were chosen based on criteria
such as chromosomal location, where the minimum distance was 70 kb between each gene, suitability to primer design and sequence diversity in the pilot studies employed by Dingle et al. (2001). A key advantage of MLST is that it can be used for population ge- netic studies as well as a typing tool for molecular epidemiological investigations (Maiden 2006).
Table 2.3 shows the housekeeping genes that were selected from the whole genome se-
quence available in the Genbank database (Parkhill et al. 2000) for theC. jejuni MLST
scheme. Figure 2.2 shows the location of the seven housekeeping genes used for MLST
on a circular genome ofC. jejuni. The genome size of C. jejuniis 1.6 mega base pairs
(mbp) and the locations of genes are dispersed around the genome.
Table 2.3:Genes and gene positions used in a MLST typing scheme forC. jejuni.
Genes Name Function Gene positionsa
aspA Aspartase Amino acid metabolism 96074. . .97480
glnA Glutamine synthetase Amino acid metabolism 658331. . .656901
gltA Citrate synthase Tri carboxylic acid cycle 1605251. . .1603983
glyA Serine hydroxy methyl transferase Energy metabolism 367219. . .368463
glmM∗ Phospho glucosamine mutase Amino acid metabolism 327143. . .328480
tkt Transketolase Energy metabolism 1569190. . .1571088
atpA/uncA ATP synthase a subunit Energy metabolism 111488. . .112993
aAdapted fromC. jejuniNCTC 11168 (Parkhill et al. 2000).
Figure 2.2:Circular genome ofC. jejunithat shows the location of seven housekeeping genes on the chromosome
2.4 Typing techniques 37
The detection of outbreaks of gastrointestinal disease caused byC. jejunihas been facili-
tated by the use of MLST. Several reports are available on the use of MLST as an investi- gation tool (Dingle et al. 2002, Sails et al. 2003b, Urwin & Maiden 2003, McTavish et al. 2008). An important component of the MLST approach is the availability of databases (e.g. PubMLST) for use by public health and research communities. In turn researchers can submit the results of their findings to these databases (Maiden 2006). The following
reports are examples that have utilisedC. jejuniMLST for epidemiological investigation.
Sails et al. (2003b) utilised MLST, PFGE and flagellin A gene typing in the investigation of a human campylobacteriosis outbreak in the United Sates of America. They investi- gated 47 isolates from 12 different outbreaks and analysed the STs that were associated with more than one outbreak. Similarly, MLST has been used to resolve a controversy re- lated to the source of infection in Lancashire, England (Wilson et al. 2008). In this study
Wilson et al. (2008) compared 1,145 animal and environmental C. jejuni isolates with
1,231 human C. jejuni isolates. They found that 20% of the animal STs accounted for
80% of human disease with ST-21 and ST-61 being the most frequent isolated genotypes. Sheppard et al. (2009) surveyed 5,247 clinical isolates from 28 diagnostic laboratories in 15 health boards in Scotland, from July 2005 to September 2006 using MLST to deter- mine sources of human infection. The authors carried out a population structure analysis and found that ST-257 and ST-61 were more common in chickens and cattle, respectively. In New Zealand, French (2008) compared the epidemiology of ruminant and poultry as-
sociated human cases of C. jejuni by performing a case-case comparison. Spatial and
temporal attributes were compared among the two case groups, that is humans infected
with poultry isolatesversushumans infected with cattle isolated. In this study of 56 STs
from 521 human samples they found ST-474 to be the dominant strain which was strongly associated with poultry. Similarly, Mullner et al. (2009) combined MLST and a modified Hald mathematical model to quantify the relative contribution of potential sources (poul- try, cattle, sheep and environmental water) to human campybacteriosis identified in the Manawatu region of New Zealand. They inferred that the ruminant associated cases were most likely from environmental and occupational exposures rather than foodborne expo- sures (Mullner et al. 2009).
McCarthy & Giesecke (2001), McCarthy et al. (2007) employed MLST to analyse the
and found that host association is stronger than temporal or geographic effects. Similarly,
the natural populations ofC. jejuniwere examined by French et al. (2005) in a farmland
ecosystem in the United Kingdom. They examined 172 isolates, and obtained 65 different sequence types (ST). There was an over representation of the ST-61 complex in the cattle isolates and the isolates from wildlife and water mostly belonged to the ST-45 complex.
A cross-sectional study ofC. jejunipopulations in the same 100 km2area in Cheshire in
the United Kingdom, characterised 327C. jejuniisolates from cattle, wildlife and envi-
ronmental sources using MLST (Kwan et al. 2008). Kwan et al. (2008) identified 91 STs and 18 clonal complexes (CC), with most of them belonging to ST-21, ST-45 and ST-61 complexes. These CCs have been shown to be frequently associated with human disease worldwide (Kwan et al. 2008, French et al. 2009a). In addition, Kwan et al. (2008) found that ST-21 and ST-61 were significantly associated with cattle and ST-45, ST-952 and ST-677 were associated with wild birds, wild rabbits and environmental water.
The population structure ofC. jejuni in wild birds was studied by Colles et al. (2008) in
Oxfordshire in the United Kingdom. This study was carried out between August 2002 and February 2003 within a mixed population of geese, starlings, lambs and free range chick- ens in the same farm ecosystem. A total of 331 faecal samples from geese, 954 samples from starlings and 975 samples from free range chickens were collected and compared
with 540C. jejuni human clinical isolates using MLST. Colles et al. (2008) found that
CC-21 and CC-45 were the most abundant clonal complexes present in geese. These CCs were also shared by starlings and free range chickens. On a clonal frame tree (Prim 1957), isolates from geese and starlings formed separate clusters, and isolates from geese and chickens were closely related. This, in turn, was interpreted by the authors to mean that geese were a source of infection for poultry. In addition Colles et al. (2008) found that the geese genotypes were not monopyletic (not phylogenetically isolated) and shared common ancestors with genotypes from chickens and starlings. The authors, in addition, suggested that sequence data from more loci would need to be examined to provide a greater level of discrimination. Another serial cross-sectional survey was carried out in a wild bird population comprised of 2,084 individual birds (Hughes et al. 2009). Hughes et al. (2009) found that wild birds can carry both livestock and poultry associated geno- types (ST-42, ST-48 and ST-45) as well as novel genotypes (ST-3001, ST-3002, ST-3003,
2.4 Typing techniques 39
STs of wild birds in the livestock included in this study, Hughes et al. (2009) suggested that the direction of infection is from livestock to wild birds. In addition, the identifica-
tion of unique STs in wild birds in this study was indicative of genetic recombinationin
vivo(Hughes et al. 2009). A longitudinal study was conducted in New Zealand to assess
the potential risk of wild bird faecal contamination in children play areas using MLST (French et al. 2009a). This study was conducted between November 2004 and February 2005 where, one-half of the isolates recovered from wild bird faecal material belonged to ST-45, a genotype associated with many species of animals and human disease. The authors raised a possibility that the genotypes ST-177 and ST-682 isolated in this study might have originated from European birds during their introduction to New Zealand in the 19th century from the United Kingdom.