University of Warwick institutional repository:
http://go.warwick.ac.uk/wrap
A Thesis Submitted for the Degree of PhD at the University of Warwick
http://go.warwick.ac.uk/wrap/60669
This thesis is made available online and is protected by original copyright.
Please scroll down to view the document itself.
Library Declaration and Deposit Agreement
1.
STUDENT DETAILS
Please complete the following:FXOOQDPH««««««««««««««««««««««««««««««««««« 8QLYHUVLW\,'QXPEHU««««««««««««««««««««««««««««««
2.
THESIS DEPOSIT
2.1 I understand that under my registration at the University, I am required to deposit my thesis with the University in BOTH hard copy and in digital format. The digital version should normally be saved as a single pdf file.
2.2 The hard copy will be housed in the University Library. The digital version will be deposited in the 8QLYHUVLW\¶V,QVWLWXWLRQDO5HSRVLWRU\(WRAP). Unless otherwise indicated (see 2.3 below) this will be made openly accessible on the Internet and will be supplied to the British Library to be made available online via its Electronic Theses Online Service (EThOS) service.
[At present, theVHVVXEPLWWHGIRUD0DVWHU¶VGHJUHHE\5HVHDUFK0$06F//006RU00HG6FLare not being deposited in WRAP and not being made available via EthOS. This may change in future.] 2.3 In exceptional circumstances, the Chair of the Board of Graduate Studies may grant permission for an embargo to be placed on public access to the hard copy thesis for a limited period. It is also possible to apply separately for an embargo on the digital version. (Further information is available in the Guide to Examinations for Higher Degrees by Research.)
2.4 ,I\RXDUHGHSRVLWLQJDWKHVLVIRUD0DVWHU¶VGHJUHHE\5HVHDUFKSOHDVHFRPSOHWHVHFWLRQDEHORZ For all other research degrees, please complete both sections (a) and (b) below:
(a) Hard Copy
I hereby deposit a hard copy of my thesis in the University Library to be made publicly available to readers (please delete as appropriate) EITHER immediately OR after an embargo period of «««... months/years as agreed by the Chair of the Board of Graduate Studies. I agree that my thesis may be photocopied. YES / NO (Please delete as appropriate) (b) Digital Copy
I hereby deposit a digital copy of my thesis to be held in WRAP and made available via EThOS. Please choose one of the following options:
EITHER My thesis can be made publicly available online. YES / NO (Please delete as appropriate) OR My thesis can be made publicly available only after«>date] (Please give date)
YES / NO (Please delete as appropriate) OR My full thesis cannot be made publicly available online but I am submitting a separately identified additional, abridged version that can be made available online.
YES / NO (Please delete as appropriate) OR My thesis cannot be made publicly available online. YES / NO (Please delete as appropriate)
Aleksey Jironkin
3.
GRANTING OF NON-EXCLUSIVE RIGHTS
Whether I deposit my Work personally or through an assistant or other agent, I agree to the following: Rights granted to the University of Warwick and the British Library and the user of the thesis through this agreement are non-exclusive. I retain all rights in the thesis in its present version or future versions. I agree that the institutional repository administrators and the British Library or their agents may, without changing content, digitise and migrate the thesis to any medium or format for the purpose of future preservation and accessibility.
4.
DECLARATIONS
(a) I DECLARE THAT:I am the author and owner of the copyright in the thesis and/or I have the authority of the authors and owners of the copyright in the thesis to make this agreement. Reproduction of any part of this thesis for teaching or in academic or other forms of publication is subject to the normal limitations on the use of copyrighted materials and to the proper and full acknowledgement of its source.
The digital version of the thesis I am supplying is the same version as the final, hard-bound copy submitted in completion of my degree, once any minor corrections have been completed.
I have exercised reasonable care to ensure that the thesis is original, and does not to the best of my knowledge break any UK law or other Intellectual Property Right, or contain any confidential material.
I understand that, through the medium of the Internet, files will be available to automated agents, and may be searched and copied by, for example, text mining and plagiarism detection software.
(b) IF I HAVE AGREED (in Section 2 above) TO MAKE MY THESIS PUBLICLY AVAILABLE DIGITALLY, I ALSO DECLARE THAT:
I grant the University of Warwick and the British Library a licence to make available on the Internet the thesis in digitised format through the Institutional Repository and through the British Library via the EThOS service.
If my thesis does include any substantial subsidiary material owned by third-party copyright holders, I have sought and obtained permission to include it in any version of my thesis available in digital format and that this permission encompasses the rights that I have granted to the University of Warwick and to the British Library.
5.
LEGAL INFRINGEMENTS
I understand that neither the University of Warwick nor the British Library have any obligation to take legal action on behalf of myself, or other rights holders, in the event of infringement of intellectual property rights, breach of contract or of any other right, in the thesis.
Please sign this agreement and return it to the Graduate School Office when you submit your thesis.
Computational And Experimental Analysis Of
Plant Promoters: Identifying Functional Elements
by
Aleksey Jironkin
Thesis
Submitted to the University of Warwick
for the degree of
Doctor of Philosophy
Contents
List of Tables
vi
List of Figures
viii
Acknowledgments
xii
Declarations
xiii
Abstract
xiv
Abbreviations
xvi
Chapter 1 Introduction
1
1.1
Motivation
. . . .
2
1.2
Regulation of gene expression . . . .
3
1.2.1
TF families in Arabidopsis
. . . .
7
1.3
Gene regulatory networks . . . .
10
1.3.1
Types of gene regulatory networks . . . .
11
1.3.2
Strategies for uncovering gene regulatory networks . . . .
12
1.4
Stress response in Arabidopsis . . . .
14
1.4.1
PAMP Triggered Immunity . . . .
14
1.4.2
Effector Triggered Immunity . . . .
15
1.4.3
The role of hormones in stress response . . . .
16
1.4.4
Hormone crosstalk fine-tunes the defence response in
Ara-bidopsis . . . .
20
1.5
Infection by Botrytis . . . .
22
1.5.1
Botrytis Infection process . . . .
22
1.5.2
Changes in Arabidopsis transcriptome in response to Botrytis
22
1.6
Aims and objectives . . . .
23
Chapter 2 Conserved Noncoding Sequences Highlight Shared
Com-ponents of Regulatory Networks in Dicotyledonous Plants
26
2.1
Introduction . . . .
26
2.1.1
Phylogenetic Footprinting . . . .
27
2.1.2
Selection of Compatible Plant Species for Phylogenetic
Foot-printing . . . .
27
2.1.3
Previous studies . . . .
28
2.2
Methods . . . .
31
2.2.1
Databases . . . .
31
2.2.2
APPLES Framework . . . .
31
2.2.3
Ortholog and Paralog Identification . . . .
31
2.2.4
Sequence Alignments . . . .
32
2.2.5
Converting Alignment Scores to Conservation Scores . . . . .
33
2.2.6
Filtering Out Potential Protein Coding Regions . . . .
35
2.2.7
PSSM Clustering . . . .
37
2.2.8
Motif Overpresentation . . . .
37
2.2.9
GO Term Analysis . . . .
38
2.2.10 Prediction of Nucleosome Positioning
. . . .
38
2.2.11 DNase-Seq Analysis . . . .
39
2.3
Results . . . .
40
2.3.1
Multispecies Analysis Yields Hundreds of CNSs . . . .
40
2.3.2
CNSs Show Positional Bias toward TSSs . . . .
44
2.3.3
CNSs Are Highly Enriched in TFBS Motifs . . . .
46
2.3.4
Identification of Previously Experimentally Validated Promoter
Binding Elements . . . .
48
2.3.5
Prediction of Nucleosome Positioning
. . . .
48
2.3.6
GO Term Overrepresentation Unveils Key Biological and
Molec-ular Functions of Genes Associated with CNSs . . . .
50
2.3.7
Predicted CNSs Occur in Open Chromatin Areas . . . .
51
2.4
Discussion . . . .
53
2.4.1
Neo-/Non-/Sub-functionalisation of Conserved Genes
. . . .
53
2.4.2
Genome Availability and Annotation Quality Impact on CNS
Predictions . . . .
56
2.4.3
CNSs Are Likely To Be Functional Regions Of ncDNA . . . .
57
Chapter 3 Elucidating Functional Elements and Gene Regulatory
Network Using Yeast One-Hybrid Screens
63
3.1
Introduction . . . .
63
3.1.1
Available Experimental Techniques For Probing Protein-DNA
interactions . . . .
64
3.1.2
Biological Context . . . .
68
3.2
Methods . . . .
71
3.2.1
Gene Selection . . . .
71
3.2.2
Yeast One-Hybrid . . . .
71
3.2.3
Image Based Positive Result Inference . . . .
79
3.2.4
pairwise mini-library Y1H screen . . . .
80
3.3
Results . . . .
83
3.3.1
Initial Gene Selection using WIGWAMS tool . . . .
83
3.3.2
Clock Regulated Stress Response . . . .
86
3.4
High-throughput identification of direct protein-DNA interactions
us-ing Y1H library screen . . . .
88
3.4.1
Subscreen of TCP Transcription Factors using Y1H library
screen . . . .
89
3.4.2
Confirmation of Observed Y1H Interactions Using Pairwise
Screens . . . .
90
3.4.3
WRKY TF Subscreen . . . .
94
3.4.4
Summary of the Y1H Screen Results . . . .
97
3.5
Discussion . . . 106
3.5.1
Image Based Method For Identification of Positive Results In
Y1H . . . 106
3.5.2
Y1H Screen Reliability and Reproducibility . . . 110
3.5.3
Importance of Correct 3AT Concentration . . . 111
3.5.4
Combined Y1H Screen Can Uncover Common Regulators . . 118
3.5.5
Y1H Library Screen Can Uncover Subgroups of Co-Regulated
Genes . . . 119
3.5.6
Circadian Clock Timing Stress Response . . . 121
3.6
Conclusions . . . 123
Chapter 4 Computational Approaches To Identify Regulatory
Ele-ments In Arabidopsis
126
4.1
Introduction . . . 126
4.1.2
Verification in plants . . . 128
4.1.3
DNase Assays . . . 129
4.2
Methods . . . 133
4.2.1
Promoter DNA analysis . . . 133
4.2.2
DNase Analysis . . . 133
4.2.3
Plant growth . . . 134
4.2.4
Fungal growth . . . 134
4.2.5
Phenotype Analysis . . . 134
4.2.6
Microarray analysis
. . . 135
4.2.7
Microarray scanning . . . 137
4.2.8
Expression Analysis . . . 137
4.2.9
Gene Ontology (GO) analysis . . . 138
4.2.10 Pathway Analysis with MapMan . . . 138
4.3
Results . . . 139
4.3.1
New TF specific motifs conserved within the promoter
frag-ments of gene tested in Y1H screen . . . 139
4.3.2
De novo
motifs show DNase I footprint in genome-wide
loca-tions in Arabidopsis leaves and buds tissue . . . 142
4.3.3
Mutations of the new binding sites alter protein-DNA
inter-actions of associated TFs . . . 146
4.3.4
Phenotype screen of
erf14,
pif7
and
athb25
KO plants show
increased susceptibility to infection with Botrytis . . . 150
4.3.5
Microarray analysis of
erf14
KO plants reveal new targets of
the TF . . . 151
4.3.6
Reported Y1H interactions for
AtERF14
and
ESE1
are
con-firmed in protoplasts . . . 158
4.4
Discussion . . . 161
4.4.1
Characterisation of novel
cis-acting elements in the promoters
of gene screened in the Y1H experiments
. . . 161
4.4.2
Sequence specificity of AP2 domain proteins . . . 163
4.4.3
DNase I analysis . . . 164
4.4.4
The
de novo
motifs interact with specific TFs . . . 165
4.4.5
Summary of new motif interaction patterns . . . 167
4.4.6
Role of
PIF7
and
AtHB25
in Botrytis infection . . . 176
4.4.7
Role of AtERF14 in Botrytis infection . . . 176
4.4.9
Context of regulation by
AtERF14
and
ESE1
TFs and their
respected motifs . . . 185
4.5
Conclusions . . . 186
Chapter 5 General Discussion
187
Appendix A Conserved Noncoding Sequences Highlight Shared
Com-ponents of Regulatory Networks in Dicotyledonous Plants
191
Appendix B Elucidating Functional Elements and Gene Regulatory
Network Using Yeast One-Hybrid Screens
196
Appendix C Computational Approaches To Identify Regulatory
List of Tables
2.1
Summary of aligned regions and associated genes from orthologous
promoters . . . .
36
2.2
Summary of aligned regions and associated genes from orthologous
promoters . . . .
43
2.3
Expected true positive CNSs . . . .
43
2.4
GO Term analysis of the genes in the CNS set . . . .
51
3.1
Y1H amplification primers . . . .
72
3.2
SOC media . . . .
74
3.3
Cloning summary of fragments that were screened at Warwick
labo-ratory. Green represents successful transformation into the specified
product. . . .
75
3.4
Cloning summary for the first cluster destined for the USA. Green
represents successful transformation into the specified product. . . .
76
3.5
Cloning for the second cluster destined for the USA. Green represents
successful transformation into the specified product. . . .
77
3.6
TRAFCO mix . . . .
82
3.7
Genes in the “JAZ” cluster . . . .
83
3.8
Genes in the “TCP” cluster . . . .
85
3.9
Gene in the “WRKY” cluster . . . .
87
3.10 TCP mini-library Y1H results . . . .
91
3.11 Summary of the sequence verification of TFs for the mini-library . .
92
3.12 Summary of the overall Y1H results . . . .
93
3.13 Summary of the WRKY mini-screen . . . .
94
3.14 FPR and FNR definitions . . . 113
3.15 Summary of FPR and FNR for Y1H library screen . . . 114
4.2
AP2/ERF motifs derived using MEME and Y1H results . . . 141
4.3
Motifs present derived using MEME and Y1H results for PIF7, bZIP52,
AtHB25 and NAC098 . . . 143
4.4
Library arrangement used in mutagenesis . . . 148
4.5
Differentially expressed genes in
erf14
24 hpi with Botrytis . . . 154
4.6
Most overrepresented GO terms
erf14
differentially expressed genes.
155
4.7
GO Term analysis of genes differentially expressed in
erf14
. . . 155
4.8
GO Term analysis of differentially expressed genes in protoplasts
over-expressing
AtERF14
. . . 157
4.9
JAZ1 and MYB15, that were found to be interacting with
AtERF14
in Y1H, are also significantly differentially expressed in
erf14
KO
lines compared to WT and in protoplasts overexpressing
AtERF14. . 158
4.10 Direct targets of
AtERF14
in protoplasts and
erf14
plants . . . 160
A.1 Manually curated list of plant “Master Regulators” . . . 192
A.2 Numbers of aligned regions and associated genes from paralogous
promoters . . . 193
B.1 Summary of Y1H library results. . . 197
C.1 All of the samples are derived from the normal distribution as
List of Figures
1.1
Overview of gene regulation and transcription mechanism. . . .
5
1.2
Arabidopsis circadian clock model . . . .
12
1.3
An overview of plant defence response mechanisms . . . .
15
1.4
An overview of JA signalling mechanism . . . .
19
1.5
Networking by phytohormones in the plant immune response . . . .
21
2.1
Arabidopsis, papaya, poplar and grape phylogenetic tree . . . .
29
2.2
The alignment score histograms between Arabidopsis and 3 other
species . . . .
34
2.3
Venn diagrams of distribution of Arabidopsis orthologs across
com-parator species . . . .
40
2.4
Positional Bias of the CNSs towards the TSSs . . . .
45
2.5
Arabidopsis CNSs enrichment for TF binding sites . . . .
47
2.6
Predictions of nucleosome occupancy in the CNSs . . . .
49
2.7
Histogram of DNase sequencing read from Arabidopsis leaf tissue in
the CNSs . . . .
52
2.8
Subfunctionalisation in regulatory regions of Arabidopsis paralogs
.
55
3.1
Diagram of the steps in the Y1H library screen . . . .
69
3.2
Expression of genes in “JAZ” cluster . . . .
84
3.3
Expression of genes in “TCP” cluster . . . .
85
3.4
mRNA expression levels of
WRKY11
and
WRKY40
during infection
with Botrytis . . . .
87
3.5
mRNA expression levels of the genes in the “WRKY” cluster . . . .
88
3.6
Cumulative distributions of histograms of pixel intensities for Y1H-161 95
3.7
Cumulative histograms of intensities for spots predicted to be positive
interactions by the automatic algorithm for Y1H-161 fragment. . . .
98
3.7
Cumulative histograms of intensities for spots predicted to be positive
3.8
Final Y1H screen results after library and pairwise screens above each
promoter fragment. Green fragment location and identifier, blue
-CNS location and score. . . 101
3.8
Final Y1H screen results after library and pairwise screens above each
promoter fragment. Green fragment location and identifier, blue
-CNS location and score. . . 102
3.8
Final Y1H screen results after library and pairwise screens above each
promoter fragment. Green fragment location and identifier, blue
-CNS location and score. . . 103
3.8
Final Y1H screen results after library and pairwise screens above each
promoter fragment. Green fragment location and identifier, blue
-CNS location and score. . . 104
3.8
Final Y1H screen results after library and pairwise screens above each
promoter fragment. Green fragment location and identifier, blue
-CNS location and score. . . 105
3.9
Typical image of Y1H pairwise plate . . . 107
3.10 Image of Y1H-177 SD-LTH plate . . . 109
3.11 mRNA expression levels in response to infection with
B. cinerea
of
common Y1H regulators (blue) and corresponding target genes (red). 120
3.12 mRNA expression levels during infection with Botrytis of two TFs
(blue) predicted to be regulating a small subgroup of genes (red) in
Y1H screens. . . 121
3.13 Gene regulatory network from Y1H interactions . . . 125
4.1
An overview of DNase-Seq . . . 131
4.2
DNase footprint for SEP3 binding site . . . 132
4.3
DNAse footprint profiles of ORA59, AtERF14, ESE1 and AtERF7
binding motifs
. . . 144
4.3
DNase footprint profiles for newly discovered motifs . . . 145
4.4
Mutagenesis setup for Y1H-174 and AtERF14, ESE1 and AtERF7 . 147
4.5
Images of WT and mutagenised promoters on SD-LTH agar plates vs
mini-library arrangements (Table 4.4). Bright spots indicate growing
yeast colonies. . . 149
4.6
Pairwise of serial dilutions of Y1H-174 WT and
m5
vs select TFs . . 150
4.8
qPCR confirming AtERF14 KO in Arabidopsis . . . 153
4.9
mRNA expression levels of
AtERF14
(red) and associated targets
(blue) during infection with Botrytis. . . 160
4.10 DNase I cutting profile of published and
de novo
motifs in Arabidopsis
leafs for ORA59 TF. . . 165
4.11 DNase profile of new and unknown motif . . . 166
4.12
At1g19180
intergenic region with predicted motifs and DNase profile 168
4.13
At1g80840
intergenic region with predicted motifs and DNase profile 169
4.14
At2g35930
intergenic region with predicted motifs and DNase profile 170
4.15
At2g44840
intergenic region with predicted motifs and DNase profile 171
4.16
At3g23250
intergenic region with predicted motifs and DNase profile 172
4.17
At3g25760
intergenic region with predicted motifs and DNase profile 173
4.18
At3g25780
intergenic region with predicted motifs and DNase profile 174
4.19
At4g31550
intergenic region with predicted motifs and DNase profile 175
4.20 mRNA expression profiles of
AtERF14
(red) and associated targets
(blue), found to be differentially expressed in
erf14
KO line, and have
correlated expression during the infection with Botrytis supporting
regulatory link. . . 178
4.20 mRNA expression profiles of
AtERF14
(red) and associated targets
(blue), found to be differentially expressed in
erf14
KO line, and have
correlated expression during the infection with Botrytis supporting
regulatory link. . . 179
4.20 mRNA expression profiles of
AtERF14
(red) and associated targets
(blue), found to be differentially expressed in
erf14
KO line, and have
correlated expression during the infection with Botrytis supporting
regulatory link. . . 180
4.21 mRNA expression profiles of
ESE1
(red) and associated targets (blue),
found to be differentially expressed in protoplasts overexpressing
ESE1
TF, and have correlated expression during the infection with Botrytis
supporting regulatory link.
. . . 182
4.21 mRNA expression profiles of
ESE1
(red) and associated targets (blue),
Acknowledgments
First of all I would like to thank my supervisors, especially Katherine for providing
support and guidance throughout my PhD. At the same time I would like to thank
Systems Biology DTC and Vicky Buchanan-Wollaston for the support provided by
the department.
I would also like to thank all the people I had pleasure of meeting and
working with, including but not limited to: Johanna Rhodes, Miguel, Peter Krusche,
Adam Talbot, Sarah Harvey, Nigel Dyer, Steve Kiddle, Richard Hickman, Ana
Mendes, Ben Wareham, Damon Daniels, Ben Hamilton, Leslie, all the members of
the Systems Biology DTC, PRESTA group. Special thanks go out to Peijun Zhang,
Claire Hills, Alison Jackson, Justyna, Emily Breeze, who have provided immense
amount of help and support in the lab and writing. Without Alison Jackson’s help
in the lab many of the results presented here would not have happened. I would like
to extend a separate thanks to all my close friends: Jo, Phil, Ali, John, Chris, Max
and Mato: we had great time off work with“Game Of Thrones”. Thanks to all of
my housemates over the years for putting up with me! Last but not list I would like
to thank my main man Cai Wingfield for his helpful comments and discussions.
Finally, I would like to thank my mum, grandmother, aunt, uncle Nastia and
Zlata for all the years of support they have given me, including the offer to write
this thesis for me. Zoe for being the biggest joy of my life.
This thesis was typeset with L
ATEX 2
ε
1by the author.
1LA
Declarations
Abstract
Understanding the regulatory DNA sequences are becoming increasingly
im-portant in understanding the way plants integrate signalling cues mediated through
the actions of the transcription factors (TFs). This thesis presents an
interdisci-plinary investigations into regulatory elements found in the promoter regions of a
model organism
Arabidopsis thaliana.
The intergenic DNA sequences are studied between sets of orthologous genes
in
A. thaliana
and 3 other related species to uncover hundreds of evolutionary
con-served noncoding sequences (CNSs). The CNSs are found to be more skewed towards
the annotated transcription start sites (TSSs) and enriched in previously identified
transcription factors binding motifs. Furthermore, the nucleosomes are predicted to
have strong presence in the uncovered CNS than random intergenic sequences alone.
Altogether the evidence presented in the thesis points to the functional nature of
the CNSs.
Then, the promoters of genes thought to be co-regulated together and
tran-scriptionally active during infection with fungal pathogen
Botrytis cinerea
are
ex-perimentally tested for direct protein-DNA interaction using high-throughput Yeast
One-Hybrid (Y1H) library screens against the TFs found in
A. thaliana. The
result-ing predictions were further validated usresult-ing pairwise Y1H screen to suggest potential
common regulation by
ORA59,
PIF7,
ESE1,
At4g38900
and
ERF14, and uncovering
a complex gene regulatory network (GRN) associated with the tested genes.
Resulting publications and those in preparation
L. Baxter
1,
A. Jironkin
1, R. Hickman
1, J. Moore, C. Barrington, P. Krusche,
N.P. Dyer, V. Buchanan-Wollaston, A. Tiskin, J. Beynon, K. Denby, S. Ott,
“Conserved noncoding sequences highlight shared components of regulatory
networks in dicotyledonous plants”,
Plant Cell, vol. 24, no. 10, pp. 3949–65,
Oct. 2012.
K. Polanski, J. Rhodes, C. Hill, P. Zhang, D. Jenkins, S. Kiddle,
A. Jironkin,
J. Beynon, V. Buchanan-Wollaston, S. Ott, K. Denby, “Wigwams: identifying
gene modules co-regulated across multiple biological conditions”. Submitted
to
Bioinformatics.
APPLES software
The APPLES framework can be downloaded, with the installation instructions,
from:
Abbreviations
ABA Absiscic Acid
AD Activation domain
BRE IIB recognition element
bHLH basin Helix Loop Helix
bZIP bsic Zipper
CBE Cold binding element
CNS Conserved noncoding sequence
ChIP Chromatin Immunoprecipitation
DBD DNA binding domain
DB DNA binding
DPE Downstream promoter element
DREB Dehydration responsive element binding
ETI Effector triggered immunity
EMSA Electrophoresis mobility shift assay
ERF Ethylene response factor
ET Ethylene
eY1H enhanced Yeast One-Hybrid
FGF Fibroblast response factor
FNR False negative rate
FPR False positive rate
hpi hours post infection
GM Genetic modification
GRN Gene regulatory network
IFPRI International food policy research institute
LRR Leucine rich region
mRNA messenger RNA
miRNA micro RNA
ncDNA noncoding DNA
PAMP Pathogen associated molecular patterns
PSSM Position specific scoring matrix
PTI PAMP triggered immunity
SA Salicilic acid
siRNA short interfering RNA
TAIR The Arabidopsis information resource
TBP TATA binding protein
TF Transcription factor
TSS Transcription start site
T-DNA Transfer-DNA
T3SS Type 3 secretion system
UTR untranslated region
WGD Whole genome duplication
WM Weight matrix
Chapter 1
Introduction
This conversion took nearly 20 years. If genetic engineering techniques had been
available then, the genes that controlled high Trp levels could have been inserted
into high-yielding hard-endosperm phenotypes, saving decades of labour intensive
plant breeding.
1.1
Motivation
Food security remains one of the biggest challenges facing mankind. The problem of
food security is two-fold- firstly, sustaining food production for a growing population
and secondly, distributing food high in nutrients to the places where it is needed
the most. The world population is predicted to grow by 2.5 billion to reach a
stag-gering figure of 9.5 billion by the year 2050, even by conservative estimates (United
Nations, 2012). According to the data from the IFPRI, hunger is at an “alarming”
to “extremely alarming” state already in large parts of central and southern Africa
(International Food Policy Research Institute, 2011). Therefore, the “Millennium
Development Goals” set out as the number one target to halve world poverty and
hunger by 2015 (United Nations, 2013). Among the key challenges in securing
sus-tainable crops is improving their tolerance to a large variety of biotic and abiotic
stresses including, but not limited to infection with
Botrytis cinerea.
B. cinerea
is a necrotrophic fungus that affects over 200 plant varieties worldwide (van Kan,
2006; Williamson et al., 2007). Infection with the fungus leads to devastating
pre-and post-harvest losses pre-and severe financial losses to farmers (Williamson et al.,
2007). Hence, improving our understanding of the mechanisms of infection and the
defence response will provide possible future solutions in improving crops’ resilience
to biotic stress. As with the example of
opaque-2
mutant in maize, improving
plant responses to biotic and abiotic stresses often coincides with decreased biomass
(Herms and Mattson, 1992). Each stress elicits a complex cellular and molecular
response system implemented by the plant in order to prevent damage and ensure
survival, but often at the detriment of growth and yield (Herms and Mattson, 1992).
Therefore, manipulating plant responses to internal and external changes has the
potential to significantly impact on the survivability and longevity of the plant.
life cycle of approximately 6 weeks from germination to seed maturation and can
be easily cultivated in restricted spaces, such as growth rooms and chambers.
De-velopments in the “floral dip” technique by Clough and Bent (1998) means that
Arabidopsis is subject to efficient transformations utilising
Agrobacterium
tumefa-ciens
leading to a large number of mutant lines and genomic resources becoming
available from “The Arabidopsis Information Resource” (TAIR). Arabidopsis is a
member of the
Brassicaceae
family, which also includes cabbage and radish and is
therefore closely related to other crop species, suggesting that Arabidopsis may serve
as a good proxy for uncovering biological and molecular workings of the plant. The
knowledge gained in the studies of Arabidopsis can potentially be directly applied
to other related crops, provided that functionally equivalent genes exist and can be
identified in other plant species.
1.2
Regulation of gene expression
The gene is one of the key units in molecular biology. A gene is defined as a
se-quence of nucleotides which when converted into mRNA (messenger RNA) serves
as a template for building the associated protein. Introns and exons are also
con-tained within the coding region of a gene, splicing of the introns out of the protein
coding sequence from the mRNA molecules gives rise to multiplicity of different
pro-teins from the same template coding sequence. The process of generating mRNA
from DNA is performed by RNA polymerase (Lehman et al., 1958; Bessman et al.,
1958) and is called transcription. Gene expression/transcription can be enhanced
or silenced through transcription factors (TFs). TFs contain DNA binding domains
(DBD) which can bind directly to the DNA sequence and can also interact with
other proteins and TFs.
discovered in humans (Hoffman et al., 1990; Kao et al., 1990; Peterson et al., 1990).
Arabidopsis TBPs have been shown to be functionally equivalent to those found in
animals (Mukumoto et al., 1993), and function as a subunit of Transcription Factor
II D (TFIID) along with other TBP associated factors (TAFs) (Albright and Tjian,
2000). The TATA-box acts as a starting point for the assembly of the transcription
machinery, which TFIID itself is part of, whereby RNA Polymerase II, Mediator,
general TFs, transcription factor II A (TFIIA), transcription factor II B (TFIIB),
transcription factor II E (TFIIE), transcriotion factor II F (TFIIF) and
transcrip-tion factor II H (TFIIH) assemble in a specific order (reviewed in Hahn (2004)).
Other polymerases include Polymerase I, which transcribes genes encoding
the 45S precursor into large ribosomal RNA (reviewed in ) and Polymerase III, which
transcribes many small RNA genes including those for tRNA and 5S ribosomal RNA
(reviewed in ). Plants also contain additional RNA polymerases: Polymerase IV is
involved in the transcription of microRNAs (miRNAs) (Herr et al., 2005; Onodera
et al., 2005) whereas some organelles, such as chloroplasts, use plastid encoded
polymerase (PEP) (Lysenko and Kuznetsov, 2005). In turn, mRNA is translocated
outside of the nucleus into the cytoplasm and turned into proteins by the process
of translation (reviewed in (Malys and McCarthy, 2011)). Together, the process of
conversion from DNA into proteins through transcription then translation forms the
central tenet of the “Central Dogma” (Crick, 1970).
(a) Pathway of transcription initiation and re-initiation for RNA Polymerase II. Pol II, Mediator and TFIIA, B, D, E, F and H subunits assemble around the TATA-box sequence in the core promoter region, immediately upstream from the TSS, to form the Pre initiation complex (PIC). With help from ATP, the complex opens and becomes unstable, initiating transcription. After the transcription process has been initiated, Pol II, TFIIB and TFIIF are released from the DNA, whilst Med TFIIA, D, E and H remain bound to the promoter sequence allowing for re-recruitment of the missing factors. Once all subunits have assembled again into PIC transcription can be initiated once more. With permission from Macmillan Publishers Ltd, from Hahn (2004).
(b) General promoter structure consists of upstream enhancers and proximal promoter where TFs can bind followed by the core promoter region surrounding the transcription start site (TSS) with 5’ UTR and protein coding region immediately downstream of the TSS. The core promoter contains IIB recognition element (BRE) adjacent to the TATA-box, initiator (Inr) site and downstream promoter element (DPE) (Adapted with permission from Smale and Kadonaga (2003)).