• No results found

3.3 Summary

5.2.2 Contribution of sbcC to genome fidelity and fitness

5.2.2.4 Mutation analysis

5.2.2.4.2 Mutations of interest

Amongst the sequenced lineages, there were instances of the same mutations being detected in independent lineages. Such instances of parallelism suggest that either these mutations confer a selective advantage or serve as a mutation hotspot. Examples of such mutations include BPS, indels and GCRs, as described below.

5.2.2.4.2.1REL4536 ΔsbcC::cat-1K-1 and REL4536 ΔsbcC::cat-1K-8

REL4536 ΔsbcC::cat-1K-1 and REL4536 ΔsbcC::cat-1K-8, for instance, exhibited two instances of parallelism; a roughly 33 kb deletion and a 5 bp indel was detected in clones from both lineages. As these lineages had always been maintained on separate plates during the course of evolution, we are confident that these mutations arose independently in the lineages (Table A.7). It seems likely that the 33 kb deletion was

mediated via homologous recombination of IS3 elements flanking the region. A total of

30 genes were deleted and are listed in Table A.8. Many of the deleted genes encode

hypothetical proteins of unknown function and are likely to be associated with cryptic prophage P22 activity. Genes involved in transcription regulation were also deleted. For example, appY, which encodes the transcriptional regulator AppY, is induced by the

global regulator ArcAB. In turn, ArcAB induces the expression of genes involved in anaerobic energy metabolism, survival at stationary phase and phosphate starvation (294). Another deleted gene of interest is envY, which encodes the DNA-binding

transcriptional regulator EnvY. EnvY is involved in the regulation of genes encoding cellular envelope proteins that are required for survival at low temperatures and during stationary phase (295). Loss of this region of the genome has previously been reported (296) and was also observed in E. coli REL4536 clones evolved under anaerobic

conditions for 2,000 generations (205) by Finn (2015). However, the selective advantage conferred by this deletion is not immediately apparent.

The 5 bp indel, found in both the REL4536 ΔsbcC::cat-1K-1 and REL4536 ΔsbcC::cat- 1K-8 sequenced clones, restored the original coding frame of the pseudogene dcuS.

161 DcuS, along with DcuR, is part of a two component regulatory system; DcuS is a dicarboxylate-sensing histidine kinase that reports the external dicarboxylate concentration to DcuR, while DcuR regulates genes involved in the anaerobic fumarate respiratory system (297). Hence, it is possible that this mutation may have reactivated an anaerobic respiratory pathway (297) and subsequently, been selected for. The same mutation was also observed in all sequenced E. coli REL4536 clones evolved under

anaerobic conditions for 2,000 generations by Finn (2015) (205).

5.2.2.4.2.2The adhE gene

Amongst the 10 sequenced clones, four different non-synonymous mutations in adhE

were detected (Table 5.6). The adhE gene encodes AdhE, which can function as an

alcohol dehydrogenase and a coenzyme A-dependent acetaldehyde dehydrogenase. Under conditions where fermentation is possible, AdhE can catalyse the reduction of acetyl-CoA to ethanol. Alternatively, during aerobic growth, AdhE can catalyse the oxidation of acetaldehyde to acetyl-CoA (298). In summary, AdhE is involved in the maintenance of the redox balance under anaerobic conditions and previous studies have reported that E. coli cells were not able to grow under anaerobic growth conditions

without the gene. The same study also observed that additional mutations in the pta

gene rescued the cells and allowed them to grow (298).

Table 5.6. Details of adhE mutations detected in this study.

Lineage Reference

Position* Change Mutation type Amino Acid Change

REL4536 ΔsbcC::cat-1K-12 1,439,509 A Æ G Transition Glu (568) Gly REL4536 ΔsbcC::cat-1K-14 1,439,821 A Æ C Transversion Asp (672) Ala REL4536 AN-1K-3 1,438,030 A Æ G Transition Tyr (75) Cys REL4536 AN-1K-4 1,439,673 G Æ A Transition Ala (623) Thr REL4536 AN-1K-6 1,438,030 A Æ G Transition Tyr (75) Cys

*Position of mutation on reference

E. coli REL4536 genome.

As different mutations in adhE were found in independent lineages, with or without an

active sbcC gene, it is likely that mutations in adhE generally confer a selective

advantage to growth under anaerobic conditions. Lending support to this theory are findings by Finn (2015), where BPSs in adhE were also observed in all sequenced

162

E. coli REL4536 clones evolved under anaerobic conditions for 2,000 generations

(205). All of the BPSs listed in Table 5.6 resulted in the substitution of an amino acid

residue for another, and so, were likely to confer a conformational change of the protein. Thus, it is possible that AdhE protein function was disrupted, which may result in the diversion of the fermentation pathway to increased succinate or acetate production to serve as additional energy sources for growth (299, 300).

5.2.2.4.2.3IS insertions

Other instances of the same mutations being detected in independent lineages involved IS elements. For example, IS150 insertion was detected in cycA at the same position

(4,381,583 in the REL4536 reference genome) in both REL4536 ΔsbcC::cat-1K-8 and REL4536 ΔsbcC::cat-1K-14 clones (Table A.7). Encoded by cycA¸ the CycA

transporter is involved in the uptake of glycine, serine and alanine. It has previously been demonstrated that cells with inactivated cycA exhibited decreased transport of the

three amino acids and were resistant to d-cycloserine inhibition (301). While it is possible that these mutations could have derived from cross contamination between lineages, as both REL4536 ΔsbcC::cat-1K-8 and REL4536 ΔsbcC::cat-1K-14 lineages had been maintained on the same plates, IS150 insertion was found at exactly the same

position in sequenced REL4536 AN clones (for lineages 1, 3, 4 and 6). Therefore, it is probable that this position is a hotspot for IS150 insertion, which may aid in adaptation

to either the anaerobic environment or the 24-well growth vessels.

IS150 insertion was also detected in ydiQ at the same position (974,185 in the REL4536

reference genome) in the REL4536 ΔsbcC::cat-1K-6, REL4536 ΔsbcC::cat-1K-7 and REL4536 ΔsbcC::cat-1K-14 clones (Table A.7), suggesting the possibility of this

position being another hotspot for IS150 insertion. YdiQ, encoded for by ydiQ, is a

putative electron transport flavoprotein subunit that may be involved in the electron transport chain between the fatty acid oxidation and respiration pathways under anaerobic conditions (302). Additionally, IS150 insertion was also detected in ydiU at

the same position (963,716 in the REL4536 reference genome) in the REL4536

ΔsbcC::cat-1K-1, REL4536 ΔsbcC::cat-1K-6, REL4536 ΔsbcC::cat-1K-7 and

REL4536 ΔsbcC::cat-1K-14 clones, suggesting this position may be another hotspot for IS150 insertion. YdiU, encoded for by ydiU, is a conserved protein with unknown

163 function. However, the advantages of inactivating both ydiQ and ydiU via insertion of

IS elements remain unclear.

Another instance of parallelism was detected in clones from REL4536 ΔsbcC::cat-1K-7 and REL4536 ΔsbcC::cat-1K-14, where an IS150 element was deleted from the pflB

gene, resulting in the reactivation of the gene (Table A.7). The pflB gene encodes

pyruvate formate lyase I, an enzyme involved in the conversion of pyruvate to formate during anaerobic fermentation. Therefore, the presence of a functional pflB gene under

anaerobic conditions should be beneficial and advantageous under anaerobic growth conditions. Moreover, reactivation of the pflB gene was also reported by Finn (2015) for

some sequenced REL4536 clones evolved under anaerobic and fluctuating conditions for 2,000 generations (205). Additionally, a C Æ T transition in the pflB gene at the

same position (1,766,332 in the REL4536 reference genome) was detected in the REL4536 ΔsbcC::cat-1K-14 and REL4536 AN-1K-4 clones. As this mutation was synonymous, and thus did not result in the substitution of amino acids, the beneficial effect of this mutation is not immediately apparent.