• No results found

Computational approaches to identify regulators of plant stress response using high-throughput gene expression data

N/A
N/A
Protected

Academic year: 2021

Share "Computational approaches to identify regulators of plant stress response using high-throughput gene expression data"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

ContentslistsavailableatScienceDirect

Current

Plant

Biology

j ourn a l h o m e pa g e :w w w . e l s e v i e r . c o m / l o c a t e / c p b

Computational

approaches

to

identify

regulators

of

plant

stress

response

using

high-throughput

gene

expression

data

Alexandr

Koryachko

a

,

Anna

Matthiadis

b

,

Joel

J.

Ducoste

c

,

James

Tuck

a

,

Terri

A.

Long

b,∗

,

Cranos

Williams

a,∗∗

aElectricalandComputerEngineering,NorthCarolinaStateUniversity,Raleigh,NC,USA bPlantandMicrobialBiology,NorthCarolinaStateUniversity,Raleigh,NC,USA

cCivil,Construction,andEnvironmentalEngineering,NorthCarolinaStateUniversity,Raleigh,NC,USA

a

r

t

i

c

l

e

i

n

f

o

Articlehistory: Received4April2015 Accepted28April2015 Keywords: Stressresponse Transcriptionfactors Generegulatorynetworks Algorithms

Arabidopsisthaliana

a

b

s

t

r

a

c

t

Insightintobiologicalstressregulatorypathwayscanbederivedfromhigh-throughputtranscriptomic datausingcomputationalalgorithms.Thesealgorithmscanbeintegratedintoacomputationalapproach toprovidespecifictestablepredictionsthatanswerbiologicalquestionsofinterest.Thisreview conceptu-allyorganizesawidevarietyofdevelopedalgorithmsintoaclassificationsystembasedondesiredtypeof outputpredictions.Thisclassificationisthenusedasastructuretodescribecompletedapproachesinthe literature,withafocusonprojectgoals,overallpathofimplementedalgorithms,andbiologicalinsight gained.Thesealgorithmsandapproachesareintroducedmainlyinthecontextofresearchonthemodel plantspeciesArabidopsisthalianaunderstressconditions,thoughthenatureofcomputationaltechniques makestheseapproacheseasilyapplicabletoawiderangeofspecies,datatypes,andconditions.

©2015TheAuthors.PublishedbyElsevierB.V.ThisisanopenaccessarticleundertheCCBY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).

1. Introduction

Plantsaresessileorganismssubjecttoconstantlychanging envi-ronments.Theabilitytorespondtotheseenvironmentalchanges, therefore,iskeytoplantadaptationandsurvival.Anoverallgoal of plantabiotic stressresearch is todevelop anunderstanding ofthemolecularcomponentsofasingleorcombinatorialstress response and show how these components interact, enabling directedgeneticmanipulationsthatcanenhancestresstolerance

[1].Transcriptionfactorsareoneofthefirstcategoriesofgenes activatedinresponsetoastress[2].Transcriptionfactoractivity canleadtoalterationsinactivityandaccumulationofdownstream transcriptionfactorsandproteinsthatmodulateplantmorphology andmolecularcomposition.Inthisway,manipulationofthe activ-ityofjustonetranscriptionfactororasmallfamilyoftranscription factorscanlead toalterationsin a transcriptionalcascadewith dramaticoutcomes.Thisstrategyisthebasisforboth evolution-aryadaptationaswellasgeneticmanipulationofstressresponses

[3,2,4].Despiteawidespreadfocusonstress-inducedtranscription

factorsandarecentbreadthofhighthroughputdata,successful

Correspondingauthor.Tel.:+19195150478. ∗∗Co-correspondingauthor.

E-mailaddresses:[email protected](T.A.Long),[email protected] (C.Williams).

geneticmanipulationsoftranscriptionfactorsincropplantsthat improvestresstolerancearelimited.Identificationof transcriptio-nalregulatorsinthemodelspeciesArabidopsisthalianaisafirst steptothesearchforcandidatesforgeneticmodificationincrop species,yetanumberoflimitationsexistinthisidentification.First, with5–10%ofplantgenomesreportedtocodefortranscription factors,thenumberofcandidategenestostudyisextensive[5–7]. Second,itisdifficulttopredicttheeffectsofonetranscription fac-torin isolationandevenmoredifficulttopredictcombinatorial effectsoftranscriptionfactorsactingonthesametargetsor act-ingincomplexes.Finally,thewaysinwhichtranscriptionfactor activityismodulatedinresponsetoonestressoracombination ofstressesarecomplex.Inotherwords,ahugenumberofpossible experimentsexisttotesttheeffectofcombinationsoftranscription factorsundercombinationsofstresses.Computationalapproaches playacriticalroleintheresearchprocessbyproducinga setof testablepredictions,thuslimitingthespaceofexperimentsneeded toyieldabetterunderstandingofthecascadingresponses result-ingfromstress.These predictionsrange frominvolvementof a transcriptionfactorin astressresponsetodetaileddescriptions oftranscriptionfactorand targetgeneinteractiondynamics.An increasingabundanceofcomputationalapproachesnecessitatesa carefulevaluationoftheutilityandapplicationofthesetools.

Inthisreview,wesummarizeandorganizealgorithmsinvolved incurrentandpromisingcomputationalapproachesinto5 cate-gories(“Types”)basedonthetypeofinferenceanalgorithmaims http://dx.doi.org/10.1016/j.cpb.2015.04.001

2214-6628/©2015TheAuthors.PublishedbyElsevierB.V.ThisisanopenaccessarticleundertheCCBY-NC-NDlicense(http://creativecommons.org/licenses/by-nc-nd/4. 0/).

(2)

toobtainfromabiologicaldataset.Wefocusonalgorithmsthat canoperateongeneexpressiondata,asthisiscurrentlythemost availableformofhighthroughputdata.Wethendemonstratehow algorithmsfromdifferentcategorieshavebeencombinedinthe scopeofcomputationalapproachestoachievespecificobjectives associatedwithplantstressresponse.Additionalexamplesof algo-rithmsthatfallintotheproposedclassificationsystemcanbefound eitherinplantrelated[8–10]orgeneralcomputationalapproach

[11–15]reviewarticles.Webuildonthesereviewsbyorganizing

algorithmsbasedontheirutilityandhighlightinghowinferences achievedbyalgorithmsinmultiplecategoriescanbesystematically combinedtoachieveabetterunderstandingofthetranscriptional cascadeinvolvedinstressresponse.We thendiscusshowthese algorithmshavebeenutilizedinrecentstudiestoachievea cer-tainobjective.Inthisway,researchersaimingtoacquirespecific predictionsfromanexpressiondatasetcanmoreefficientlychoose appropriatealgorithms.

2. Background

Environmental conditions in agricultural settings are highly variable, leading to suboptimal crop yields and survival rates

[16].Thefrequencyandintensityofenvironmentalextremes, par-ticular drought, heat, and pests are expected to increase with climatechange[1,17,16].Alargenumberofstressresponse stud-iesarefocusedonelucidatingtranscriptionalcascadesregulating responsestoindividualandcombinedstresses.Transcription fac-tors thatplay important rolesin modulating suchcascadesare candidatesforgeneticengineeringapproachesandareworthyof intensivestudy.

Geneexpressionanalysisisawidelyproposedmeansofbringing a greater understandingto allabiotic stress responsesfor sev-eralreasons.Alargenumberofgeneshavealteredexpressionin responsetostressandthesealterationsplayanimportantrolein adaptation[18,19].Expressiondataisalsorelativelycheap.Because ofthis,high-throughputgeneexpressiondatasetshavebeen gen-eratedandarepubliclyavailableforamultitudeofstresses,both bioticandabiotic,withexamplesinA.thalianaincludingbutnot limitedtopathogeninfection[20–22],cold[23],pH[24],salt[25], light [26],and nutrient [25,27–30]stress.Though thesestudies arecomparableintheory,afewlargestudieshaveattemptedto mitigatethe effects ofvariations in experimentalsetupby col-lecting expression data under different stresses imposed with otherwiseidenticalgrowthconditions[18]orundercombinations ofstresses[31–34].Analysesoftheseconcurrentand combinato-rialexperimentsinparticularhave revealeddistinctpatternsof differentstressresponsesalongwithsomecommonfeatures, infer-ringthatbothgeneralandspecificstressresponsepathwaysexist. For example, analysisof theAtGenExpress databaseof concur-rentstressapplicationindicatesthatsomeabioticstressesresult insustainedgeneexpressionalterationsand othersintransient alterations[18].Asetofearly-andcommonlyinducedgenes, rep-resentingthesocalledPlantCoreEnvironmentalStressResponse (PCESR),includestranscriptionfactors,indicating thata general stressresponsemaybetranscription factormediatedand likely occurs earlyin stress response cascades [18,35]. Combinatorial studiesindicatethatgenesrespondingtocombinedstressesare oftendistinctfromthoserespondingtoindividualstresses, high-lighting a need for both more studies of this type as well as computational methods to attempt to predict these emergent behaviors[31–34].Despitetheseextensiveanalyses,limiteddirect predictionsconcerningstresspathwayshavebeenmadeand val-idated.Themajorityofdetailedcharacterizationsoftranscription factors,includingdirect promoterbindingandinfluenceon tar-getgene expression,are theresultof traditional studies.These

studiesaretimeandcostintensive.Furthermore,sincemanykey regulatorshavebeenfoundthroughphenotypicmutantscreens, subtleyetimportantphenotypesandgenescaneasilybemissed. Redundancyisexpectedincriticalregulatorymechanisms[36],and predictionsconcerningwhichregulators ormutantstocombine in a geneticengineering strategywould beextremely valuable. Arecent increaseinalgorithm developmentand utilizationwill helptoincreasethepredictivepowerinavailabledatasetssothat regulatorsandcombinatorialregulatorymechanismsbeyondthe “lowhanging fruit” canbeidentified.In thefollowing sections, wedescribeandorganizesetsofalgorithmsandimplementations thereofinexperimentalapproaches,aimingtobringattentionto thebenefitoftheseapproachesandfacilitatefutureincreasesin frequencyandstrengthofcomputationalbiologystudies.

3. Classificationofinferencealgorithms

Manycomputationalalgorithmshavebeendevelopedfor ana-lyzinggeneexpressiondata.Wefocushereonalgorithmscapable of identifyingstress related genes,groupinggenes byfunction, inferringconnectionsbetweengenes,estimatinggeneinteraction direction and type, and predicting gene expression states and values ininterconnectedregulatorynetworks.Thesealgorithms differincomplexityandimpliedassumptions,butcanbeclassified basedonfunctionality.Wecategorizethesealgorithmsin5distinct groupsbasedonthetypeofinsighttheyprovidetoabiological pro-cessofinterest.Dependingonresearchobjectives,thesealgorithms caneitherbeusedseparatelyorasapartofasystematic compu-tationalapproachwhereinferencesfromalgorithmsofonetype canbeusedasinputforalgorithmsofanothertype.Forexample,a computationalapproachdesignedtopredictgeneinteractionsand theirtypebasedontimecoursemicroarraydatacanbecomprised of3algorithmsofdifferenttypesthatsequentiallyprocessinput datatoobtainadesiredoutput(Fig.1).

Thealgorithmsdescribedcanbeappliedtotranscriptomicdata obtainedatMtimepointsortreatments(tj,j=1,...,M)foraset ofNgenes(gi,i=1,...,N).Examplesofsuchdatasetsincludethe globalabioticstressexpressiondatabaseAtGenExpress[18].This databaseincludesdatasetsformultipleabioticstresstreatments thatareobtainedforN≈24,000genesatM=7timepointsusing AffymetrixATH1GeneChipmicroarrayanalysis.Hence,theactivity ofeachgenecanberepresentedbyasetofnumbersgi(t1),gi(t2),

...,gi(tM),formingapatternthatisusedbyalgorithmstomake inferences.

Type1algorithmsattempttocapturegenesthatarerelevantto a particular condition. Techniques for determining differentially expressedgenes are an example of algorithmsfalling into this category[37].Differentialexpressiontechniquesworkby assum-ing that significant changein transcript levelsof a given gene understressconditionrelativetoitsactivityundernormal condi-tionsindicatesthatthegeneplaysaroleinthestressresponse. Thisassumption disregardsposttranscriptional modificationsas alternatemeansofgeneproductregulation.Sincetranscript mea-surementprecisioncanvaryfromoneexperimentalapproachto another,statisticaltestsareoftenappliedtodeterminethe sig-nificanceofthechangeintranscriptlevels.Student’st-testfor2 treatmentsorANOVAforasetoftreatmentsarecommonlyapplied to deduce statistical significance. Other differential expression inferencealgorithmsweredevelopedforlargescaleexperimental techniquessuchasmicroarrays,forwhichthecorrelationbetween within-arrayreplicatescan betakeninto consideration [38],or RNA-Seq, forwhichcountbasedstatistics aremoreappropriate

[39,40].

Type 2algorithmsaimto identifyrelationshipsbetween genes. These algorithms work by assuming that genes with “similar”

(3)

Fig.1.Conceptualviewoftheinformationflowinacomputationalapproach.Biologicaldataisusedtoidentifygenesofinterest(Type1algorithm),inferconnections betweenthesegenes(Type2algorithm),andpredicttypesoftheseconnections(Type3algorithm).

expressionpatternsareco-regulatedorarepartofthesame regula-torypathway[13].Techniqueslikeco-expressionanalysis[41–45]

fallintothiscategory.

Common metrics that have been used to assess similarities betweengenesbasedontheirexpressionpatternsincludePearson correlation coefficient [42,46–48], Spearman correlation coeffi-cient [49–51], partial correlation coefficient [52–54], Euclidean distance[55,56],andmutualinformation[57–59].Thesemetrics typicallyrepresenta quantifiedmeasurethatestablishesa pair-wisecomparisonbetweentheexpressionlevelsoftwogenes,g1 and g2, acrosstime points or experimentaltreatments. Kumari etal.[60]presentedastudythatevaluatedtheutilityofSpearman rankcorrelation,WeightedRankCorrelation,Kendall,HoeffdingsD measure,Theil-Sen,RankTheil-Sen,DistanceCovariance,and Pear-soncorrelationcoefficientontranscriptionaldatafordetermining geneassociation.TheauthorsfoundthatSpearman,Hoeffding,and Kendallcorrelationcoefficients weremore effectivein identify-ingrelatedpathwaygenesthanothers.Incontrast,Maetal.[61]

claimthatbasedonmanualinspectionoftheexpressionpatterns ofseveralpairsofTF-targetgenes,theGinicorrelationcoefficient cancompensatefortheshortcomingsofthePearson,Spearman, Kendall,andTukeysbiweightcorrelationsindetectingtransient regulatoryrelationshipsbetweentranscription factorsandtheir targets.Metricssuchasareabetweenexpressioncurves[62],Z -score[63],andothersappearintheliteraturebuthavenotbeen extensivelyevaluated.

Relationshipsbetweenindividualgenesoracrossestablished groupsofgenescanbegeneratedbasedonthesesimilaritymetrics. Atypicalprocedureforestimatingrelationshipsbetween individ-ualgenesistosetathresholdvalueandassignconnectionsbetween geneswhosepairwisesimilarity valueishigherthan aselected threshold[62,58].Thestatisticalsignificanceofthesimilaritycan alsobetakenintoconsiderationwhenestablishingaconnection

[57].Groupsofsimilarlybehavinggenesareinmostcasesidentified usingclusteringalgorithms.Clusteringalgorithmsapplysimilarity metricstoisolategroupsofco-expressedgenes.k-means cluster-ing[64],theMarkovClusteralgorithm[65,66],biclustering[67], self-organizingmaps[68],hierarchicalclustering[69],and affin-itypropagation[70]areexamplesofclusteringalgorithmsapplied totranscriptomicdata.Martinetal.[64]appliedk-means cluster-ing,hierarchicalclustering,andself-organizingmapstotimeseries transcriptomicdatafrommice.Theresultssuggestedthatk-means wasabletoconveycomparablegroupingtohierarchicalclustering, andself-organizingmaps(morethan80%agreement)while main-taininglessofacomputationalloadthanotherapproaches.Freyand Dueck[70]showedthattheaffinitypropagationalgorithmyields morecompactclusterscomparedtok-meansintermsofthesum ofinterclusterdistanceswhichmightimplytighterrelationships betweengenesinthesamecluster.

Clustering has also been used to reduce the complexity of buildingtranscriptionalnetworksbyreducing highdimensional networkswithmanygenestolowerdimensionalnetworksof clus-tersof genesor “metagenes”,which represent groups ofgenes withsimilarexpressionactivity.Theexpressionpatternofa meta-gene may be defined as thecluster average or the expression pattern of the gene with the highest sum of similarities with itsclustermembers.Somealgorithmshaveextracted metagene expressionpatternsfirstbyapplyingprincipalcomponentanalysis (PCA)orsingularvaluedecomposition(SVD)totheoverall expres-siondataset.Theclustersarethenassembledbasedonsimilarities betweengeneandmetageneexpressionpatterns[71,72].

Type3algorithmsaimtoinfercausalrelationshipsbetweengenes. Causalinferenceproceduresareoftenbasedontheassumptionthat achangeinonegene(g1)willresultina subsequentchangein anothergene(g2)atsomelatertimeifg1activatesorinhibitsg2

[73–77].Thus,theapproachissimilartoco-expressionanalysisin

thatitaimstofindgeneswithsimilartemporalexpression pat-terns.Thekeydifferencedistinguishingthisapproachfromthose inType2istheassumptionthatthesesimilaritieswilloccurata delay,allowingforinferenceonthedirectionofregulation(which genecomesfirstinaregulatorycascade)inadditiontoa relation-shipconnection.TheequationforPearsoncorrelationcoefficient, forexample,canbemodifiedtoassessthistemporalcharacteristic byincorporatingatimedelay.Eq.(1)reflectssimilarityatthedelay ofonetimeunit.Thealgorithmscapturetheregulationdelayfor apairofgenesbyselectingthetimeunitdurationthatmaximizes thecorrelationcoefficient[75].

g1→g2 =

M−1 j=1 (g1(tj)−g¯1)(g2(tj+1)−g¯2)

M−1 j=1 (g1(tj)−g¯1)2

M−1 j=1 (g2(tj+1)−g¯2)2 , where g¯1= 1 M1 M−1

j=1 g1(tj), g¯2= 1 M1 M−1

j=1 g2(tj+1) (1)

Twosets ofsimilarity values,each correspondingtoa range of delaysforacertaindirectionofshift,arecalculatedtoassessthe strength anddirectionality ofconnectionin each pairofgenes. Smallsimilarityvalues,correspondingtoalowprobabilityof reg-ulation,canberemoved,leavingtheremaininghighconfidence connectionstocharacterizegenesthathavepotentialcausal rela-tionships.Approachesthatusemodificationsofthemetricin(1)

havebeeneffectiveforsingledatasetswith50and27timepoints andsamplingintervalsof20min[75]andfor acollectionof18 datasetswith7timepointsineachandsamplingintervalsranging from0.5to12h[78].Othersampletimesmayberelevant depend-ingonthefeaturesthatexistinthedata.

(4)

Anotherclassofalgorithmsthatinferregulatoryinteractions betweengenesisBayesiannetworks[79,80].Bayesiannetworks arecapableofinferringregulatoryconnectionsfromtimecourse andnon-timecoursedata.Thesealgorithmsattempttofindcausal connectionsbasedonBayes’rulebyexplicitlychoosinganetwork structure that best describesexperimental data. The algorithm considersanetworkofgeneregulationsasasetofdependencies wheretheprobabilityofexpressionofatargetisconditionedon theexpressionofitsregulator.Theseregulationsaredescribedas conditionalprobabilities.Algorithmsthentrytofinda network structurethatbestdescribesthedatabasedonascoringfunction. Identificationofthenetworkstructureisacomputationally inten-siveproblem.Complexitygrowsexponentiallywithanincreasing numberofnodes[11].Forexample,around1018different topolo-giesariseforanetworkofonly10genes[12].Thus,mostofthe approachesusingBayesiannetworksconcentrateonasmallsubset ofgenes(typicallywhensomeportionofageneregulatorynetwork isalreadyknown)oremploysub-optimalbutlesscomputationally intensesolutionstohandlelargernetworks[11].

DynamicBayesianNetworks(DBN)[81,82]incorporate order-inginformationintimecoursedatatoallowforfeedbackloops(not allowedinstandardBayesiannetworks).Thesefeedbackloopsare allowedbytreatingexpressionofthesamegeneatdifferenttime pointsasdifferentnodes.Nodescorrespondingtothesamegene arecombinedafterthestructureinferenceprocedure.This algo-rithmleadstoanincreaseincomplexitysincethenumberofnodes involvedinstructureinferenceroutineisaproductofthenumber ofgenesandthenumberoftimepoints.

Type4algorithmsaimtoinfercombinationsofregulator expres-sionstatesthatarenecessarytoresultinaparticularstateoftarget. Thesealgorithms canbeconceptualizedasa search fora func-tionalrelationshipbetweenatargetanditsregulator(s)(gi=f(g1,

g2,...,gN)).Inthiscase,aqualitativemeasureofgenebehavior canbeused,withgeneexpressionvalues representedaseither highorlow, activeor inactive,or“ON”or“OFF”tosimplifythe problem.An“ON”stateofonlyacoupleregulatorsmaysufficeto upregulatetheexpressionofthetarget.Thisqualitativeassumption allowstheuseofBooleannetworks[83]inType4inference prob-lems.ExpressionvaluesinBooleannetworkinferenceapproaches arediscretizedmostlyintwostates,representinganactivitylevel ateachtimepoint[84–86].Regulatoryconnectioninference algo-rithmstrytofindabinaryfunctionthatcomputesthenextstateof agenebasedonacombinationofothergenes’statesusingsimple Booleanoperations,e.g.AND(&)ifmorethanoneregulatorshould haveacertainstatetoinfluenceacommontarget,OR(|)ifanyof theregulatorstatessufficeforthesamepurpose,andNOT(¬)inthe caseofrepression(Fig.2).Thegoalofthisapproachistofindthe simplestfunctionforeachgene,whichisthefunctionthatdepends onthefewestregulatorgenespossible.

Adirectapproach tofindthesimplestBooleanfunctionthat satisfiesagivendatasetistocompareallpossiblefunctions capa-bleofgeneratingtheobservedexpressionpattern.Thenumberof Booleanfunctionsthatcanrepresenttheexpressionactivityofa generegulatedbyasmanyasntranscriptionfactorsis22n

[87],

makingtheproblemcomputationallyinfeasibleforalarge(more than10)numberofgenes.Somealgorithmsusepriorknowledge toconfinethenumber of genestoanalyze.Othersrelyon net-workstructuresinferredbyothertypesofalgorithmstoconfine thenumber,type and directionalityofpossibleregulatory rela-tionshipsbetweenindividualgenesorgroupsofgenes.Another factorconstrainingtheuseofBooleannetworksinwholegenome datasetanalysisisthesmallnumberofsamples(timepoints) asso-ciatedwithmostdatasets.Thesesmallsamplesizestypicallydo notprovidethediversityneededtouniquelydefinerelationships acrossalargenumberofindividualgenes.Forexample,for5time points,which isthemedian numberin typicalgeneexpression

Fig.2.Booleannetworkrepresentationingraphicalandfunctionalforms. Combi-nationsoftranscriptionfactorsg1,g2,andg3influenceexpressionofeachotherand targetgenes.Thestateofg6,forexample,isinfluencedbyacombinationofg2and g3orbyg1alone.

Fig.3. Type5algorithmsoutputintermsofthesystemofODEsandpredictedgene expressiondynamics(gi(t))basedonexperimentalvalues(gi(tj)).Inthisexample,

theexpressionpatternofeachgeneisinfluencedbytheexpressionofatleastone othergene,withsomegenes(g4)influencedbytheirownexpression(feedback loop).

datasets[88],thenumberofgeneswithdistinctBoolean expres-sionpatternsislimitedtoonly25=32.Anyattemptedanalysisof morethan32geneswithsuchadatasetwouldresultinatleast 2geneswithidenticalbehaviorwhichwouldlimitresolutionto groupsofgenes(e.g.metagenes)asopposedtoindividualgenes.

Type5algorithmsaimtodescribedynamicbehaviorina trans-criptionalnetwork.Theresultingnetworkrepresentationallowsfor thereconstructionofcontinuouschangesintranscriptsovertime (Fig.3).Ordinarydifferentialequationsarecommonlyusedto cap-turethedynamicsassociatedwithgeneexpressionchanges[89]. Theseequationsallowfortheestimationofgeneexpression val-uesatanygiventimepointeitherbetweensamples(interpolation) orbeyondthelastcollectedsample(extrapolation)[29].Whena generegulatorynetworkisrepresentedintermsoflinear differ-entialequations,theinstantaneouschangeinexpressionofagene isrelatedtothesumofweightedexpressionvaluesofinfluencing genes: dgi dt = N

k=1 aikgk, (2)

(5)

whereaikrepresentinfluencecoefficients.Coefficientsforlinear differentialequationsareofteninferredusingtheLeastAbsolute ShrinkageandSelectionOperator(LASSO)algorithm[90],a modi-ficationofthelinearregressionapproach.WhenLASSOisusedfor ODEinferencepurposes,thechangesinexpression,i.e.differences betweenexpressionvaluesatconsecutivetimepoints,are approx-imatedbyalinearcombinationofothergenes’expressionvalues. Expressionpatternsfortargetgenesarereplacedwithpatternsof changesinexpression[91–93]toinferinfluencecoefficients.Given thatbiologicalprocessesareassumedtobeinherentlynonlinear, linearOrdinaryDifferentialEquation(ODE)inferencealgorithms fortranscriptionalnetworksrelyontheassumptionthatthe sys-temoperatesclosetoastabilitypoint[93].Thesystemmaynotstay closetoastabilitypointinthecaseofstressinducedresponses, where a plant may transition from one stable steady state to another.NonlinearODEs,thoughpotentiallymorebiologically rel-evantbecausetheydonotrely onthesteadystateassumption, typically requirethe estimation of more coefficients associated withnonlinearterms[94].Coefficientestimationroutinesfor infer-encealgorithms searchtheparameterspacetofindcoefficients that yield solutions closest to measured expression values

[95,96].

Allof the described algorithms requireimplementation and validation in biological systems in order to assess theirutility. Anumberofvalidationtechniquesexist,dependingonthetype ofalgorithm[97–113].Thesevalidationtechniquesarevisualized withkeyreferencesinFig.4.ValidationforalgorithmsofTypes1

and2,whichpredictassociationsbetweenageneandaprocess oragene anda groupof genes,are limitedtoanalysisof Gene Ontology(GO)enrichmentorphenotypesinmutantsof transcrip-tionalregulators.Thesephenotypesrangewidelydependingonthe stressresponseinquestion,andcouldinvolveextensive experi-mentationtosearchforaphenotypeofinterest.Awiderrangeof techniquesexistforalgorithmsofTypes3–5,algorithmsthat pre-dictrelationshipsbetweentranscriptionfactorsandtargetgenes. Theserelationshipscanbetested indirectly throughexpression profiling,computationallythroughpromoteranalysis,ordirectly throughbindinginteractions.Giventhatno“goldstandard” vali-dationtechniqueexists[114],convincingsupportofteninvolves thecombinationofmultiplevalidationtechniques,suchas expres-sionanalysis and binding activityfor a regulator and target of interest.Similarly,complexpredictionssuchasthosederivedfrom

Type4andType5algorithmsrequireacombinationofstaticand dynamicvalidationtechniques–includingexpressionprofilingat multipletimepoints,preferablyalongwithdeterminationof bind-ingactivity.

4. Computationalapproaches

Computationalapproachesareusedwidelytogaininsightinto processesunderlyingplantresponse tostressconditions. These approacheshaveasimilarstructureintermsofthetypesof algo-rithms theyuseand differin thecombination of and order in whichthesealgorithmsareapplied.Inthefollowingexamples,we describehowalgorithmsofdifferenttypeshavebeencombinedin particularcomputationalapproachestoanswerresearchspecific questions.

4.1. Relevantgeneidentification

A large number of current computational approaches are focusedonidentifyinggenesthatplayakeyroleinaprocessof interest.The importanceofthese genesisthen typically tested throughmutantphenotypicanalysis.Maetal.[56]analyzedaset ofA.thalianaabioticstressresponsetranscriptomedatasetswith

6timepointstoidentifystressrelatedgenes.Thecomputational approachstartedbypartitioning eachstressdatasetinto “infor-mative” and “noninformative” genesusing differential network analysis(Type1algorithm).Theauthorsstatedthatdifferential net-workanalysisthatinvolvesmachinelearningandtrainingbasedon

aprioriinformationismoresensitivethandifferentialexpression analysis,whichisstatisticsoriented.TheGinicorrelationcoefficient wasthencalculatedforpairsof“informative”genestoestablish sig-nificantconnections(Type2algorithm).Stressrelatedgeneswere identifiedfromtheresultingnetworkbasedonthecombinationof 33topologyscoresobtainedfromthenetworkofsignificant con-nections(Type1algorithm).Theauthorsvalidatedtheiralgorithm byperformingaphenotypicscreenfor89candidatesidentifiedas saltstressrelated.Mutantsof2previouslyunreportedsalt stress-relatedgenesshowedphenotypes.

Dinnenyetal.[25]conductedDNAmicroarrayexperimentson

A.thalianarootresponsetoirondeficientmediawith7timepoints spanning72htoidentifycommonstressresponsebehavior pat-terns.Theauthorsapplieddifferentialexpressionanalysis[115]to identifygeneshavingatleasta1.5-foldchangeinexpressionwith afalsediscoveryratevaluelessthan10−4atasamplingtimepoint comparedtonotreatment(Type1algorithm).Theanalysisshowed thatthestrongesttranscriptionalresponseoccurredafter24hof treatment.Dinnenyetal.[25]thenappliedtheaffinity propaga-tionclusteringalgorithm[70]toformgroupsofsimilarlyexpressed genesandthusidentifygeneralpatternsofgeneexpression(Type 2algorithm).Longetal.[28]usedtheresultsofthisanalysisand screenedthroughmutantsof38identifiedgenescodingfor coex-pressedtranscriptionfactors.Thescreensledtoidentificationof importantironhomeostasisregulatorsPOPEYE(PYE)andBRUTUS (BTS).

Linetal.[30] investigatedtheeffectofphosphatestarvation onA.thalianarootgene signalingusingaDNAmicroarray time coursewith 3 time pointsto infer functional modules in early transcriptionalresponses.Theauthorsuseddifferentialexpression analysiswiththerequirementofa2-foldchangeinexpressionwith ap-valuecutoffof0.05toidentifystressrelatedgenes(Type1 algo-rithm).Additionalinformationfrom2671experimentaldatasets, 300ofwhicharerootspecific,wasusedtoselect187rootspecific genes(Type1algorithm).TheauthorsusedtheMulti-Array Corre-lationComputationUtility(MACCU)toolboxbasedonthresholding pairwise Pearson correlation coefficients toobtain 3 functional modulesofstressspecificgenes(Type2algorithm).Tovalidatethe results,Linetal.[30]conductedmutantscreenson31members ofaclusterwheremostofthegenesareknowntoparticipatein rootdevelopment.Only5testedlinesdidnotshowastatistically significantroothairlengthphenotype.

4.2. Genefunctionelucidation

Anothergroupofcomputationalapproachesaimtoassociate geneswithaspecificfunctionduringaprocessofinterest.The guilt-by-associationheuristic[43]isoftenusedtoassignafunctiontoan unknowngenebasedonknownfunctionsofco-regulatedgenes (GeneOntologyenrichment).Polanskietal.[48]analyzedsixA. thalianastressresponse transcriptomedatasetstoidentifygene modulesshowingevidenceforco-regulation.Thecomputational approachrevealed78modulesofco-regulatedgenes,71ofwhich wereoverrepresentedinGeneOntologycategoriesand51ofwhich wereenrichedin transcription factorbindingmotifs(compared to24and6of78randomlyassignedmodules,respectively).The approachusedinformationaboutwhichgenesweredifferentially expressedineachstressresponseasaninput(previously deter-minedinotherpublications usingType 1algorithms).For each genedifferentiallyexpressedunderatleast2conditions,the algo-rithmassembledasetofcorrelatedgenesforeachcondition(Type

(6)

Fig.4. Validationtechniquesforalgorithmtypeswithkeyreferences.Examplesshownarethosetypicallyseenincurrentcomputationalresearchapproaches,specifically forresearchprojectsinA.thaliana.

2algorithm).Aco-regulationrelationshipinapairofgeneswas establishedifthesegeneshadsharedasignificantnumberof corre-latedgenesacrossstressconditions(Type2algorithm).Theauthors usedGeneOntologyenrichment,promoteranalysis,andyeast one-hybridprotein–DNAinteractionstovalidatetheresultingmodules ofco-regulatedgenes.

MaandBohnert[116]integratedtimecourseandcellspecific transcriptomics data with gene promoter structures to iden-tifystressrelatedcis-elementsinA. thaliana.Thecomputational approach used in this work detectedknown stress related cis -elementsandidentifiedsecondarymotifs.Theauthorscombined abioticandbioticstress,hormoneand chemicaltreatmenttime coursesanddifferentlightconditionsamplestocreateone com-bined expression pattern of 145values pergene. Differentially expressedgeneswereidentified bycombiningtheresultsfrom fuzzyk-meansclustering[117]appliedtoallgeneprobesandthe ‘limma’statisticalprogram[118]whichidentifiedgenes differen-tiallyexpressedinatleastonecondition(Type1algorithm).Fuzzy

k-meansclusteringwasagainappliedtotheresultingsetto iden-tifystressrelatedclustersofgenes(Type2algorithm).Theauthors assignedfunctionstoclustersbasedonGOenrichment.Binding motif analysis using Plant Cis-acting Regulatory DNA Elements (PLACE)database[105]revealedmotifssignificantlyoverexpressed inthefunctionrelatedclusters.Furtheranalysisof22majorclusters resultedintheidentificationofnewDNAregulatorymotifs[119].

4.3. Generelationshipinference

Computationalapproachesthataimtounravelinfluential rela-tionshipsbetweenregulatorsandtheirtargetsarelesscommonbut areincreasinginfrequency.Windrametal.[22]applieda computa-tionalapproachtoidentifytranscriptionfactorfamiliesoperating atdifferentstagesofA.thalianapathogendefenseresponse.The authorsanalyzedtranscriptionalprofilesat24time pointswith 4 replicates per time point. The computational approach pre-dictedgeneregulatoryinteractions,confirmedexperimentallyor by bindingmotif enrichment.The analysisstarted with assess-ment of differentiallyexpressed genesbased ona combination ofMAANOVA(MicroArrayANalysisOf VAriance)[120], approx-imateFtests,GP2S(Gaussianprocess2sample)test[121],and Hotellingstatistic(T2)[122](Type1algorithm).Next,a

SplineClus-ter[123]algorithmseparateddifferentiallyexpressedgenesinto

clustersassociatedwithdifferentstagesofstressresponse(Type 2algorithm).TheclusterswerevalidatedbyGOenrichment anal-ysis.NonparametricmodificationofBayesiannetwork inference algorithm [124] wasapplied to clusterrepresentatives toinfer regulatoryconnectionsbetweenclusters(Type3algorithm).The authors validated the regulatory effect of one of the clusters throughexperimentswithaknockoutmutantlineforthe tran-scription factor TGA3. Experimental data showed altered gene expressioninpredictedTGA3targetclustersinthetga3-2mutant,

(7)

Fig.5. Transitionsbetweenalgorithmsofdifferenttypes.Typicalexperimentaltransitionsbetweenalgorithmsareindicatedwithbluearrowsandperspectivefuture transitions,lesscommonbutpossiblewithmorereliablesupportingalgorithms,areindicatedwithwhitearrows.

whereastargetsregulatedbynonTGA3clusterswerelessaffected. Theeffectofanothertranscriptionfactor,ANAC055,wasvalidated bybindingmotifenrichmentintargetclusters.

Redestigetal.[78]analyzedasetof18DNAmicroarraytime seriescorrespondingtoninedifferentabioticstresseswithseven timepointsobtainedfromrootandshootofA.thalianaseedlings withtheaimofassociatingstressresponsivetranscriptionfactors with theirtargets. The authors concluded that their computa-tional approach delivered a usable number of high-confidence targetgenes(12–59%ofidentifiedtruetargets)forstressrelated transcriptionfactors.Thecomputationalapproachidentifiedstress relatedtranscriptionfactorsbyselectingoneswithmaximum over-allresponseandmaximumchangeinresponsesatisfyingaspecific thresholdcriteria(Type1algorithm).Covariancevaluesbetween atranscriptionfactorandothergenesoverasetofdelayswere calculatedforasetofconditions(Type3algorithm).Highscores correspondedtoahighprobabilityofregulation.

Krouketal.[29]conductedDNAMicroarrayexperimentsonA. thaliananitrateresponsewithsixtimepointsspanning20minutes tocaptureageneregulatorynetworkunderlyingplantadaptation tonitrateprovision.Theinferredtemporalmodelofthereaction processbuiltfor20 clusterrepresentativesresultedin 70% cor-rect predictions of expression value direction changeafter the lasttimepoint inthetimecourse.Thecomputationalapproach startedwithANOVAtoidentifynitrogenregulatedgenes(Type 1algorithm).Next,MeVsoftware[125]wasusedtoseparatethe nitrogenregulatedgenesinto20clusters,eightofwhichappeared tohaveover-representedbiologicalfunctions(Type2algorithm). TheapplicationofLASSObasedalgorithmtocluster representa-tivesprovidedcoefficientsforasystemoflinearODEsdescribing thedynamicsofeachcluster(Type5algorithm).Predictionsonthe directionofchangeobtainedfromODEsweretestedbycomparing themwithexpressionvaluesfromatimepointthatwasnotused forinferencepurposes.

4.4. Summary

Ascanbesurmisedfromtheexamplesgiven,algorithmsfrom

Type1,Type2,andType3aremorecommonincurrent experimen-talapproachesappliedtoplants.Theproblemofdimensionality preventstheextensiveuseofType4andType5algorithmsfor individualgenes based onwhole genomedatasets due todata requirementsforsuchtype of inference[13].Thus, the dimen-sionoftheproblemistypicallyreducedbylimitingasetofgenes toonesknowntointeractorparticipate inthesame biological process.Recentnon-stressrelatedapproachesinA.thalianahave

employedsuchtechniques.Espinosaetal.[126]used experimen-tally obtainedknowledge aboutrelationshipsof 15 genes in A. thalianaflowerdevelopmentprocesstopredictdevelopment sce-nariosusingBooleannetworksapproach(Type4algorithm).Sankar

et al. [127] built a model to predict states of thecomponents

fromauxinandbrassinosteroidsignalingnetworksinA.thaliana

byapplyingBooleanlogicapproach(Type4algorithm)andthen transformed theresulting discrete network representation to a setofordinarydifferentialequations(Type5algorithm)toobtain quantitativepredictions.Cruz-Ramirezetal.[128]investigatedthe dynamicsofasymmetriccelldivisionwithintheA.thalianarootby analyzingasystemofnonlineardifferentialequationsfor7 inter-actingcomplexes(Type5output).Theanalysispredictedabistable behavioroftheprocess.Finally,Pokhilkoetal.[129]refinedthe interactionmodeldescribingcircadianrhythmsinA.thalianaby modelingtheprocesswithasystemofnonlinear ODEs(Type5

output).

Similaritiesinregulatoryprocesses onagenomiclevel allow fortheapplicationofcomputationalapproachesthatwere devel-opedfornon-plantspecies.Somecomputationalapproachesare availableinsoftware packages.Anextensiveuseofthese pack-ages showsthat even ifa technique wasdeveloped and tested for one species, it can be applied to a similar dataset from anotherspecies.Examplesoftheseapproachesarebrieflydescribed here. Vermeirssen et al. [130] combined the Learning Module Networksalgorithm[131]developedforyeast,ContextLikelihood ofRelatednessalgorithm[132]testedonEscherichiacoli,and Dou-bleTwo-wayt-testsalgorithmtestedonE.colitoidentifyoxidative stressregulatorytranscriptionfactorsinA.thaliana(Type3output). TheAlgorithmfortheReconstructionofGeneRegulatoryNetworks (ARACNE)[58]wasdevelopedtoinfertranscriptionalregulations inhumanBcells,butthenusedforotherapplicationsincluding theinferenceoftranscriptionalinteractionsunderlyingroot devel-opmentandphysiological processesinA.thaliana[133](Type3

output).Othersoftwarepackagesthatshowedtheabilitytorecover generegulatorynetworksfromtranscriptomicdatainclude CLR

[132],MRNET[134],C3NET[57],andARTIVA[135].TheDialogue

onReverseEngineeringAssessmentandMethods(DREAM)project attemptedto comparesuch GRNinferencemethods appliedto

E.coli,Staphylococcusaureus,Saccharomycescerevisiaeandinsilico

microarraydata[136].Theauthorsdiscoveredthatthesemethods havecomplementaryadvantagesandlimitationsunderdifferent contexts.Inthecaseofmulticellularorganisms,theperformance oftechniqueshassofarbeenmeasuredbasedongoalsachieved foraspecificapplication.Suchperformanceisdifficulttocompare betweenmethodssincegoalsandapplicationsareoftendiverse.

(8)

5. Conclusions

We presented a classification of computational algorithms basedonthetype of information theyaimtoinfer.This struc-turewasusedtodescribeapproachesintheliteraturethathave been used to gain insight into biological processes of interest basedontranscriptomicsdata.Examplesofexistingcomputational approachesappliedtoplantstresstranscriptionaldatasets demon-stratedapatternoftransitionbetweenalgorithmsofdifferenttypes (displayedgraphicallyinFig.5).Thisprogressiondemonstratesthat thequalityofpredictionsmadebyanalgorithminthescopeofa computationalapproachoftendependsonthequalityof predic-tionsmadebyaprecedingalgorithmaswellasonthequalityofthe originalbiologicaldata.Basedonavailablealgorithmsandexample implementations,wecanstatethateventhoughbothstressrelated geneidentificationandgroupingalgorithms(Type1andType2) arestill evolving,confidenceinType 2algorithm predictionsis sufficienttoallowforatransitiontocausalityinference(Type3).

Type3algorithmshavethepotentialtosupplyType4andType 5algorithmswithinformationaboutthestructureofgene regula-torynetworks.Thisinformationwillreducethenumberofpossible functionalrelationshipstoconsiderforthesetypesofalgorithms dramaticallyandthusallowfortheincreaseinscopeand predic-tivepower.Therefore,theperspectivetransitionsshowninFig.5

willlikelyappearmoreofteninfuturecomputationalapproaches asreliabilityofType3algorithmspredictionsincrease.

Acknowledgements

Thismaterialisbasedupon worksupportedbytheNational ScienceFoundationunderGrantNo.1247427andbytheNational ScienceFoundationGraduateResearchFellowshipunderGrantNo. 1252376.WethankRosangelaSozzaniandRobertFranksforcritical readingofthismanuscript.

References

[1]R.Mittler,E.Blumwald,Geneticengineeringformodernagriculture: chal-lengesandperspectives,Annu.Rev.PlantBiol.61(2010)443–462. [2]L.Vaahtera,M.Brosché,Morethanthesumofitsparts–howtoachievea

specifictranscriptionalresponsetoabioticstress,PlantSci.180(3)(2011) 421–430.

[3]W.Wang,B.Vinocur,A.Altman,Plantresponsestodrought,salinityand extremetemperatures:towardsgenetic engineeringforstresstolerance, Planta218(1)(2003)1–14.

[4]A.E.Valdés,Forcedadaptation:plantproteinstofightclimatechange,Front. PlantSci.5(2014)762.

[5]J.Riechmann,J.Heard,G.Martin,L.Reuber,J.Keddie,L.Adam,O.Pineda, O.Ratcliffe,R.Samaha,R.Creelman,etal.,Arabidopsistranscriptionfactors: genome-widecomparativeanalysisamongeukaryotes,Science290(5499) (2000)2105–2110.

[6]M.K.Udvardi,K.Kakar,M.Wandrey,O.Montanari,J.Murray,A.Andriankaja, J.-Y.Zhang,V.Benedito,J.M.Hofer,F.Chueng,etal.,Legumetranscription factors:globalregulatorsofplantdevelopmentandresponsetothe environ-ment,PlantPhysiol.144(2)(2007)538–549.

[7]R.Melzer,G.Theißen,MADSandmore:transcriptionfactorsthatshapethe plant,in:PlantTranscriptionFactors,Springer,2011,pp.3–18.

[8]G.R.Cramer,K.Urano,S.Delrot,M.Pezzotti,K.Shinozaki,Effectsofabiotic stressonplants:asystemsbiologyperspective,BMCPlantBiol.11(1)(2011) 163.

[9]S.Friedel,B.Usadel,N.VonWirén,N.Sreenivasulu,Reverseengineering:a keycomponentofsystemsbiologytounravelglobalabioticstresscross-talk, Front.PlantSci.3(2012)294.

[10]G.Krouk,J.Lingeman,A.M.Colon,G.Coruzzi,D.Shasha,etal.,Generegulatory networksinplants:learningcausalityfromtimeandperturbation,Genome Biol.14(6)(2013)123.

[11]C.Sima,J.Hua,S.Jung,Inferenceofgeneregulatorynetworksusing time-seriesdata:asurvey,Curr.Genomics10(6)(2009)416.

[12]K.Cho,S.Choo,S.Jung,J.Kim,H.Choi,J.Kim,Reverseengineeringofgene regulatorynetworks,Syst.Biol.IET1(3)(2007)149–163.

[13]M.Hecker,S.Lambeck,S.Toepfer,E.vanSomeren,R.Guthke,Gene regu-latorynetworkinference:dataintegrationindynamicmodels–areview, Biosystems96(1)(2009)86.

[14]G.Karlebach,R.Shamir,Modellingandanalysisofgeneregulatorynetworks, Nat.Rev.Mol.CellBiol.9(10)(2008)770–780.

[15]A.M.Middleton,E. Farcot,M.R.Owen, T.Vernoux, Modeling regulatory networkstounderstandplantdevelopment:smallisbeautiful,PlantCell24 (10)(2012)3876–3891.

[16]N.J.Atkinson,P.E.Urwin,Theinteractionofplantbioticandabioticstresses: fromgenestothefield,J.Exp.Bot.63(10)(2012)3523–3543.

[17]C.deSassi,J.M.Tylianakis,Climatechangedisproportionatelyincreases her-bivoreoverplantorparasitoidbiomass,PLOSONE7(7)(2012)e40557. [18]J.Kilian,D.Whitehead,J.Horak,D.Wanke,S.Weinl,O.Batistic,C.D’Angelo,E.

Bornberg-Bauer,J.Kudla,K.Harter,TheAtGenExpressglobalstressexpression dataset:protocols,evaluationandmodeldataanalysisofUV-Blight,drought andcoldstressresponses,PlantJ.50(2)(2007)347–363.

[19]L.López-Maury,S.Marguerat,J.Bähler,Tuninggeneexpressiontochanging environments:fromrapidresponsestoevolutionaryadaptation,Nat.Rev. Genet.9(8)(2008)583–593.

[20]R.F.Ditt,K.F.Kerr,P.deFigueiredo,J.Delrow,L.Comai,E.W.Nester,The

ArabidopsisthalianatranscriptomeinresponsetoAgrobacteriumtumefaciens, Mol.PlantMicrobeInteract.19(6)(2006)665–681.

[21]R.J.O’Connell,M.R.Thon,S.Hacquard,S.G.Amyotte,J.Kleemann,M.F.Torres, U.Damm,E.A.Buiate,L.Epstein,N.Alkan,etal.,Lifestyletransitionsinplant pathogeniccolletotrichumfungidecipheredbygenomeandtranscriptome analyses,Nat.Genet.44(9)(2012)1060–1065.

[22]O.Windram,P.Madhou,S.McHattie,C.Hill,R.Hickman,E.Cooke,D.J.Jenkins, C.A.Penfold,L.Baxter,E.Breeze,etal.,ArabidopsisdefenseagainstBotrytis cinerea:chronologyandregulationdecipheredbyhigh-resolutiontemporal transcriptomicanalysis,PlantCell24(9)(2012)3530–3557.

[23]B.-h.Lee,D.A.Henderson,J.-K.Zhu,TheArabidopsiscold-responsive transcrip-tomeanditsregulationbyICE1,PlantCell17(11)(2005)3155–3175. [24]A.S.Iyer-Pascuzzi,T.Jackson,H.Cui,J.J.Petricka,W.Busch,H.Tsukagoshi,P.N.

Benfey,Cellidentityregulatorslinkdevelopmentandstressresponsesinthe

Arabidopsisroot,Dev.Cell21(4)(2011)770–782.

[25]J.R.Dinneny,T.A.Long,J.Y.Wang,J.W.Jung,D.Mace,S.Pointer,C.Barron, S.M.Brady,J.Schiefelbein,P.N.Benfey,Cellidentitymediatestheresponseof

Arabidopsisrootstoabioticstress,Science320(5878)(2008)942–945. [26]S.González-Pérez,J.Gutiérrez,F.García-García,D.Osuna,J.Dopazo,Ó.

Lorenzo,Ó.Lorenzo,J.L.Revuelta,J.B.Arellano,Earlytranscriptionaldefense responsesinArabidopsiscellsuspensioncultureunderhigh-lightconditions, PlantPhysiol.156(3)(2011)1439–1456.

[27]T.J.Buckhout,T.J.Yang,W.Schmidt,Earlyiron-deficiency-induced transcrip-tionalchangesinArabidopsisrootsasrevealedbymicroarrayanalyses,BMC Genomics10(1)(2009)147.

[28]T.A.Long,H.Tsukagoshi,W.Busch,B.Lahner,D.E.Salt,P.N.Benfey,ThebHLH transcriptionfactorPOPEYEregulatesresponsetoirondeficiencyin Arabidop-sisroots,PlantCell22(7)(2010)2219–2236.

[29]G.Krouk,P.Mirowski,Y.LeCun,D.E.Shasha,G.M.Coruzzi,etal.,Predictive networkmodelingofthehigh-resolutiondynamicplanttranscriptomein responsetonitrate,GenomeBiol.11(12)(2010)R123.

[30]W.-D. Lin, Y.-Y. Liao, T.J. Yang, C.-Y. Pan, T.J. Buckhout, W. Schmidt, Coexpression-basedclusteringofArabidopsisrootgenespredictsfunctional modules in early phosphate deficiencysignaling, Plant Physiol. (2011) 110.

[31]L.Rizhsky,H.Liang,J.Shuman,V.Shulaev,S.Davletova,R.Mittler,When defensepathwayscollide.TheresponseofArabidopsistoacombinationof droughtandheatstress,PlantPhysiol.134(4)(2004)1683–1696. [32]S.Rasmussen,P.Barah,M.C.Suarez-Rodriguez,S.Bressendorff,P.Friis,P.

Costantino,A.M.Bones,H.B.Nielsen,J. Mundy,Transcriptomeresponses tocombinationsofstressesinArabidopsis,PlantPhysiol. 161(4)(2013) 1783–1794.

[33]C.M.Prasch,U.Sonnewald,Simultaneousapplicationofheat,drought,and virustoArabidopsisplantsrevealssignificantshiftsinsignalingnetworks, PlantPhysiol.162(4)(2013)1849–1866.

[34]N.Sewelam,Y.Oshima,N.Mitsuda,M.Ohme-Takagi,Asteptowards under-standingplantresponsestomultipleenvironmentalstresses:agenome-wide study,PlantCellEnviron.37(9)(2014)2024–2035.

[35]A.Hahn,J.Kilian,A.Mohrholz,F.Ladwig,F.Peschke,R.Dautel,K.Harter,K.W. Berendzen,D.Wanke,Plantcoreenvironmentalstressresponsegenesare systemicallycoordinatedduringabioticstresses,Int.J.Mol.Sci.14(4)(2013) 7617–7641.

[36]J.L.Riechmann,Transcriptionalregulation:agenomicoverview,The Ara-bidopsisBook16(1)(2002)1.

[37]X.Cui,G.A.Churchill,etal.,StatisticaltestsfordifferentialexpressionincDNA microarrayexperiments,GenomeBiol.4(4)(2003)210.

[38]G.K.Smyth,J.Michaud,H.S.Scott,Useofwithin-arrayreplicatespotsfor assessingdifferentialexpressioninmicroarrayexperiments,Bioinformatics 21(9)(2005)2067–2075.

[39]S.Anders,W.Huber,Differentialexpressionanalysisforsequencecountdata, GenomeBiol.11(10)(2010)R106.

[40]M.D.Robinson,D.J.McCarthy,G.K.Smyth,edgeR:abioconductorpackagefor differentialexpressionanalysisofdigitalgeneexpressiondata,Bioinformatics 26(1)(2010)139–140.

[41]K.Aoki,Y.Ogata,D.Shibata,Approachesforextractingpracticalinformation fromgeneco-expressionnetworksinplantbiology,PlantCellPhysiol.48(3) (2007)381–390.

[42]B.Zhang,S.Horvath,etal.,Ageneralframeworkforweightedgene co-expressionnetworkanalysis,Stat.Appl.Genet.Mol.Biol.4(1)(2005)1128.

(9)

[43]C.J.Wolfe,I.S.Kohane,A.J.Butte,Systematicsurveyrevealsgeneral appli-cabilityof“guilt-by-association”withingenecoexpressionnetworks,BMC Bioinform.6(1)(2005)227.

[44]B.Usadel,T.Obayashi,M.Mutwil,F.M.Giorgi,G.W.Bassel,M.Tanimoto,A. Chow,D.Steinhauser,S.Persson,N.J.Provart,Co-expressiontoolsforplant biology:opportunitiesforhypothesisgenerationandcaveats,PlantCell Envi-ron.32(12)(2009)1633–1651.

[45]I.Lee,B.Ambaru,P.Thakkar,E.M.Marcotte,S.Y.Rhee,Rationalassociationof geneswithtraitsusingagenome-scalegenenetworkforArabidopsisthaliana, Nat.Biotechnol.28(2)(2010)149–156.

[46]A. Gupta, C.D. Maranas, R. Albert, Elucidation of directionalityfor co-expressedgenes:predictingintra-operonterminationsites,Bioinformatics 22(2)(2006)209–214.

[47]J.Ehlting,V.Sauveplane,A.Olry,J.-F.Ginglinger,N.J.Provart,D. Werck-Reichhart,Anextensive(co-)expressionanalysistoolforthecytochrome P450superfamilyinArabidopsisthaliana,BMCPlantBiol.8(1)(2008)47. [48]K.Polanski,J.Rhodes,C.Hill,P.Zhang,D.J.Jenkins,S.J.Kiddle,A.Jironkin,

J.Beynon,V.Buchanan-Wollaston,S.Ott,etal.,Wigwams:identifyinggene modulesco-regulatedacrossmultiplebiologicalconditions,Bioinformatics 30(7)(2014)962–970.

[49]R.Balasubramaniyan,E.Hüllermeier,N.Weskamp,J.Kämper,Clusteringof geneexpressiondatausingalocalshape-basedsimilaritymeasure, Bioinfor-matics21(7)(2005)1069–1077.

[50]J.Nie,R.Stewart,H.Zhang,J.Thomson,F.Ruan,X.Cui,H.Wei,TF-Cluster:a pipelineforidentifyingfunctionallycoordinatedtranscriptionfactorsvia net-workdecompositionofthesharedcoexpressionconnectivitymatrix(SCCM), BMCSyst.Biol.5(1)(2011)53.

[51]X.Cui,T.Wang,H.-S.Chen,V.Busov,H.Wei,Tf-finder:asoftwarepackage foridentifyingtranscriptionfactorsinvolvedinbiologicalprocessesusing microarraydataandexistingknowledgebase,BMCBioinform.11(1)(2010) 425.

[52]H.Kishino,P.J.Waddell,Correspondenceanalysisofgenesandtissuetypes andfindinggeneticlinksfrommicroarraydata,GenomeInform.11(2000) 83–95.

[53]A.Wille,P.Zimmermann,E.Vranová,A.Fürholz,O.Laule,S.Bleuler,L.Hennig, A.Prelic,P.vonRohr,L.Thiele,etal.,SparsegraphicalGaussianmodelingofthe isoprenoidgenenetworkinArabidopsisthaliana,GenomeBiol.5(11)(2004) R92.

[54]J.Schäfer,K.Strimmer,Anempiricalbayesapproachtoinferringlarge-scale geneassociationnetworks,Bioinformatics21(6)(2005)754–764. [55]P.D’haeseleer,etal.,Howdoesgeneexpressionclusteringwork?Nat.

Bio-technol.23(12)(2005)1499–1502.

[56]C.Ma,M.Xin,K.A.Feldmann,X.Wang,Machinelearning-baseddifferential networkanalysis:astudyofstress-responsivetranscriptomesinArabidopsis, PlantCell26(2)(2014)520–537.

[57]G.Altay,F.Emmert-Streib,Inferringtheconservativecausalcoreofgene regulatorynetworks,BMCSyst.Biol.4(1)(2010)132.

[58]A.A.Margolin,I.Nemenman,K.Basso,C.Wiggins,G.Stolovitzky,R.D.Favera, A.Califano,ARACNE:analgorithmforthereconstructionofgeneregulatory networksinamammaliancellularcontext,BMCBioinform.7(Suppl1)(2006) S7.

[59]R.Steuer,J.Kurths,C.O.Daub,J.Weise,J.Selbig,Themutualinformation: detectingandevaluatingdependenciesbetweenvariables,Bioinformatics18 (suppl2)(2002)S231–S240.

[60]S.Kumari,J.Nie,H.-S.Chen,H.Ma,R.Stewart,X.Li,M.-Z.Lu,W.M.Taylor, H.Wei,Evaluationofgeneassociationmethodsforcoexpressionnetwork constructionandbiologicalknowledgediscovery,PLOSONE7(11)(2012) e50411.

[61]C.Ma,X.Wang,Applicationoftheginicorrelationcoefficienttoinfer regu-latoryrelationshipsintranscriptomeanalysis,PlantPhysiol.160(1)(2012) 192–203.

[62]L.Rueda,A.Bari,A.Ngom,Clusteringtime-seriesgeneexpressiondatawith unequaltimeintervals,in:TransactionsonComputationalSystemsBiology X,Springer,2008,pp.100–123.

[63]M.Triska,D.Grocutt,J.Southern,D.J.Murphy,T.Tatarinova,cisExpress:motif detectioninDNAsequences,Bioinformatics29(17)(2013)2203–2205. [64]S.Martin,Z.Zhang,A.Martino,J.Faulon,Booleandynamicsofgenetic

regu-latorynetworksinferredfrommicroarraytimeseriesdata,Bioinformatics23 (7)(2007)866–874.

[65]S.VanDongen,Aclusteralgorithmforgraphs,Rep.Inf.Syst.(10)(2000)1–40. [66]W.I.Mentzen,E.S.Wurtele,RegulonorganizationofArabidopsis,BMCPlant

Biol.8(1)(2008)99.

[67]Y.Zhang,H.Zha,C.-H.Chu,Atime-seriesbiclusteringalgorithmfor revea-lingco-regulatedgenes,in:InformationTechnology:CodingandComputing, 2005.ITCC2005,vol.1,IEEE,2005,pp.32–37.

[68]P. Tamayo,D. Slonim,J. Mesirov,Q. Zhu,S.Kitareewan,E.Dmitrovsky, E.S.Lander,T.R.Golub,Interpretingpatternsofgeneexpressionwith self-organizingmaps:methodsandapplicationtohematopoieticdifferentiation, Proc.Natl.Acad.Sci.U.S.A.96(6)(1999)2907–2912.

[69]M.B.Eisen,P.T.Spellman,P.O.Brown,D.Botstein,Clusteranalysisanddisplay ofgenome-wideexpressionpatterns,Proc.Natl.Acad.Sci.U.S.A.95(25) (1998)14863–14868.

[70]B.J.Frey,D.Dueck,Clusteringbypassingmessagesbetweendatapoints, Sci-ence315(2007)972–976.

[71]X.Li,Y.Ye,M.Ng,Q.Wu,MultiFacTV:moduledetectionfromhigher-order timeseriesbiologicaldata,BMCGenomics14(Suppl4)(2013)S2.

[72]J.-X.Liu,C.-H.Zheng,Y.Xu,Extractingplantscoregenesrespondingtoabiotic stressesbypenalizedmatrixdecomposition,Comput.Biol.Med.42(5)(2012) 582–589.

[73]T.Chen,V.Filkov,S.S.Skiena,Identifyinggeneregulatorynetworksfrom experimentaldata,in:ProceedingsoftheThirdAnnualInternational Con-ferenceonComputationalMolecularBiology,ACM,1999,pp.94–103. [74]A.T.Kwon,H.H.Hoos,R.Ng,Inferenceoftranscriptionalregulation

relation-shipsfromgeneexpressiondata,Bioinformatics19(8)(2003)905–912. [75]W.A.Schmitt,R.M.Raab,G.Stephanopoulos,Elucidationofgeneinteraction

networksthroughtime-laggedcorrelationanalysisoftranscriptionaldata, GenomeRes.14(8)(2004)1654–1663.

[76]W.Zhao,E.Serpedin,E.R.Dougherty,Inferringgeneregulatorynetworksfrom timeseriesdatausingtheminimumdescriptionlengthprinciple, Bioinfor-matics22(17)(2006)2129–2135.

[77]P.C.Ma,K.C.Chan,Inferringgeneregulatorynetworksfromexpressiondata bydiscoveringfuzzydependencyrelationships,IEEETrans.FuzzySyst.16(2) (2008)455–465.

[78]H.Redestig,D.Weicht,J.Selbig,M.A.Hannah,Transcriptionfactortarget pre-dictionusingmultipleshortexpressiontimeseriesfromArabidopsisthaliana, BMCBioinform.8(1)(2007)454.

[79]D.Heckerman,AtutorialonlearningwithBayesiannetworks,Innov.Bayesian Netw.(2008)33–82.

[80]N.Friedman,M.Linial,I.Nachman,D.Pe’er,UsingBayesiannetworksto ana-lyzeexpressiondata,J.Comput.Biol.7(3–4)(2000)601–620.

[81]K.Murphy,S.Mian,etal.,Modellinggeneexpressiondatausingdynamic Bayesiannetworks,Tech.rep.,TechnicalReport,ComputerScienceDivision, UniversityofCalifornia,Berkeley,CA,1999.

[82]N.Dojer,A.Gambin,A.Mizera,B.Wilczy ´nski,J.Tiuryn,Applyingdynamic Bayesiannetworkstoperturbedgeneexpressiondata,BMCBioinform.7(1) (2006)249.

[83]S.Liang,S.Fuhrman,R.Somogyi,etal.,REVEAL,ageneralreverseengineering algorithmforinferenceofgeneticnetworkarchitectures,in:Pacific Sympo-siumonBiocomputing,vol.3,1998,p.2.

[84]R.Albert,Booleanmodelingof geneticregulatorynetworks,in:Complex Networks,Springer,2004,pp.459–481.

[85]E.Dimitrova,L.García-Puente,F.Hinkelmann,A.Jarrah,R.Laubenbacher,B. Stigler,M.Stillman,P.Vera-Licona,ParameterestimationforBooleanmodels ofbiologicalnetworks,Theor.Comput.Sci.412(26)(2011)2816–2826. [86]R.Laubenbacher,B.Stigler,Acomputationalalgebraapproachtothereverse

engineeringofgeneregulatorynetworks,J. Theor.Biol. 229(4) (2004) 523–537.

[87]T.Akutsu,S.Miyano,S.Kuhara,etal.,Identificationofgeneticnetworksfroma smallnumberofgeneexpressionpatternsunderthebooleannetworkmodel, in:PacificSymposiumonBiocomputing,vol.4,WorldScientificMaui,Hawaii, 1999,pp.17–28.

[88]B.A.Rosa,J.Zhang,I.T.Major,W.Qin,J.Chen,Optimaltimepointsamplingin high-throughputgeneexpressionexperiments,Bioinformatics28(21)(2012) 2773–2781.

[89]G.Bernot,J.-P.Comet,A.Richard,M.Chaves,J.-L.Gouzé,F.Dayan, Model-ingandanalysisofgeneregulatorynetworks,in:ModelinginComputational BiologyandBiomedicine,Springer,2013,pp.47–80.

[90]R.Tibshirani,Regressionshrinkageandselectionviathelasso,J.R.Stat.Soc. B:Methodological(1996)267–288.

[91]R.Guthke,U.Möller,M.Hoffmann,F.Thies,S.Töpfer,Dynamicnetwork recon-structionfromgeneexpressiondataappliedtoimmuneresponseduring bacterialinfection,Bioinformatics21(8)(2005)1626–1634.

[92]M.Gustafsson,M.Hornquist,A.Lombardi,Large-scalereverseengineeringby thelasso,arXivpreprintq-bio/0403012.

[93]M.S.Yeung,J.Tegnér,J.J.Collins,Reverseengineeringgenenetworksusing singularvaluedecompositionandrobustregression,Proc.Natl.Acad.Sci.U. S.A.99(9)(2002)6163–6168.

[94]M.Gustafsson,M.Hörnquist,J.Lundström,J.Björkegren,J.Tegnér,Reverse engineeringofgenenetworkswithLASSOandnonlinearbasisfunctions,Ann. N.Y.Acad.Sci.1158(1)(2009)265–275.

[95]L.Palafox,N.Noman,H.Iba,Reverseengineeringofgeneregulatorynetworks usingdissipativeparticleswarmoptimization,IEEETrans.Evol.Comput.17 (4)(2013)577–587.

[96]M.Kabir,N.Noman,H.Iba,Reverseengineeringgeneregulatorynetworkfrom microarraydatausinglineartime-variantmodel,BMCBioinform.11(Suppl 1)(2010)S56.

[97]M.Ashburner,C.A.Ball,J.A.Blake,D.Botstein,H.Butler,J.M.Cherry,A.P.Davis, K.Dolinski,S.S.Dwight,J.T.Eppig,etal.,GeneOntology:toolfortheunification ofbiology,Nat.Genet.25(1)(2000)25–29.

[98]S.Maere,K.Heymans,M.Kuiper,BiNGO:aCytoscapeplugintoassess overrep-resentationofgeneontologycategoriesinbiologicalnetworks,Bioinformatics 21(16)(2005)3448–3449.

[99]J.M.Alonso,A.N.Stepanova,T.J.Leisse,C.J.Kim,H.Chen,P.Shinn,D.K. Stevenson,J.Zimmerman,P.Barajas,R.Cheuk,etal.,Genome-wide inser-tional mutagenesis of Arabidopsis thaliana, Science 301 (5633) (2003) 653–657.

[100] A.Sessions,E.Burke,G.Presting,G.Aux,J.McElver,D.Patton,B.Dietrich,P. Ho,J.Bacwaden,C.Ko,etal.,Ahigh-throughputArabidopsisreversegenetics system,PlantCell14(12)(2002)2985–2994.

[101] M.G.Rosso,Y.Li,N.Strizhov,B.Reiss,K.Dekker,B.Weisshaar,An Arabidop-sisthalianaT-DNAmutagenizedpopulation(GABI-Kat)forflankingsequence tag-basedreversegenetics,PlantMol.Biol.53(1–2)(2003)247–259.

(10)

[102] B.Ülker,E.Peiter,D.P.Dixon,C.Moffat,R.Capper,N.Bouché,R.Edwards,D. Sanders,H.Knight,M.R.Knight,Gettingthemostoutofpubliclyavailable T-DNAinsertionlines,PlantJ.56(4)(2008)665–677.

[103] P.Hilson,J.Allemeersch,T.Altmann,S.Aubourg,A.Avon,J.Beynon,R.P. Bhalerao,F.Bitton,M.Caboche,B.Cannoot,etal.,Versatilegene-specific sequencetagsforArabidopsisfunctionalgenomics:transcriptprofilingand reversegeneticsapplications,GenomeRes.14(10b)(2004)2176–2189. [104] A.Coego,E.Brizuela,P.Castillejo,S.Ruíz,C.Koncz,J.C.delPozo,M.Pineiro,

J.A.Jarillo,J.Paz-Ares,J.León,TheTRANSPLANTAcollectionofArabidopsis

lines:aresourceforfunctionalanalysisoftranscriptionfactorsbasedontheir conditionaloverexpression,PlantJ.77(6)(2014)944–953.

[105] K.Higo,Y.Ugawa,M.Iwamoto,T.Korenaga,Plantcis-actingregulatoryDNA elements(PLACE)database:1999,NucleicAcidsRes.27(1)(1999)297–300. [106] R.V.Davuluri,H.Sun,S.K.Palaniswamy,N.Matthews,C.Molina,M.Kurtz,E. Grotewold,AGRIS.Arabidopsisgeneregulatoryinformationserver,an infor-mationresourceofArabidopsiscis-regulatoryelementsandtranscription factors,BMCBioinform.4(1)(2003)25.

[107] S.K.Palaniswamy,S.James,H.Sun,R.S.Lamb,R.V.Davuluri,E.Grotewold, AGRISandAtRegNet.Aplatformtolinkcis-regulatoryelementsand transcrip-tionfactorsintoregulatorynetworks,PlantPhysiol.140(3)(2006)818–829. [108] T.R. O’Connor, C. Dyreson, J.J. Wyrick, Athena: a resource for rapid visualizationandsystematicanalysisofArabidopsispromotersequences, Bioinformatics21(24)(2005)4411–4413.

[109] S.M.Brady,N.J.Provart,Web-queryablelarge-scaledatasetsforhypothesis generationinplantbiology,PlantCell21(4)(2009)1034–1051.

[110] T.L.Bailey,M.Boden,F.A.Buske,M.Frith,C.E.Grant,L.Clementi,J.Ren,W.W. Li,W.S.Noble,MEMESUITE:toolsformotifdiscoveryandsearching,Nucleic AcidsRes.(2009)gkp335.

[111] M.K.Das,H.-K.Dai,AsurveyofDNAmotiffindingalgorithms,BMCBioinform. 8(Suppl.7)(2007)S21.

[112] K.W. Berendzen,K.Harter, D. Wanke,Analysisof plantregulatorydna sequencesbytransientprotoplastassaysandcomputeraidedsequence eval-uation,in:PlantSignalTransduction,Springer,2009,pp.311–335. [113] N.Wehner,L.Hartmann,A.Ehlert,S.Böttner,L.Onate-Sánchez,W.

Dröge-Laser, High-throughput protoplast transactivation (PTA) system for the analysisofArabidopsistranscriptionfactorfunction,PlantJ.68(3)(2011) 560–569.

[114] C.Olsen,K.Fleming,N.Prendergast,R.Rubio,F.Emmert-Streib,G.Bontempi, B.Haibe-Kains,J.Quackenbush,Inferenceandvalidationofpredictivegene networksfrombiomedicalliteratureandgeneexpressiondata,Genomics103 (5)(2014)329–336.

[115] T.-M.Chu,B.Weir,R.Wolfinger,Asystematicstatisticallinearmodeling approachtooligonucleotidearrayexperiments,Math.Biosci.176(1)(2002) 35–51.

[116] S.Ma,H.J.Bohnert,IntegrationofArabidopsisthalianastress-relatedtranscript profiles,promoterstructures,andcell-specificexpression,GenomeBiol.8(4) (2007)R49.

[117] A.P.Gasch,M.B.Eisen,Exploringtheconditionalcoregulationofyeastgene expressionthroughfuzzyk-meansclustering,GenomeBiol.3(11)(2002) 1–22.

[118] G.K.Smyth,Linearmodelsandempiricalbayesmethodsforassessing differ-entialexpressioninmicroarrayexperiments,Stat.Appl.Genet.Mol.Biol.3 (1)(2004)1–25.

[119] S.Ma,S.Bachan,M.Porto,H.J.Bohnert,M.Snyder,S.P.Dinesh-Kumar, Dis-coveryofstressresponsiveDNAregulatorymotifsinArabidopsis,PLOSONE 7(8)(2012)e43198.

[120]H.Wu,M.K.Kerr,X.Cui,G.A.Churchill,MAANOVA:asoftwarepackageforthe analysisofspottedcDNAmicroarrayexperiments,in:TheAnalysisofGene ExpressionData,Springer,2003,pp.313–341.

[121]O.Stegle,K.J.Denby,E.J.Cooke,D.L.Wild,Z.Ghahramani,K.M.Borgwardt,A robustbayesiantwo-sampletestfordetectingintervalsofdifferentialgene expressioninmicroarraytimeseries,J.Comput.Biol.17(3)(2010)355–367. [122]Y.C.Tai,T.P.Speed,etal.,AmultivariateempiricalBayesstatisticforreplicated

microarraytimecoursedata,Ann.Stat.34(5)(2006)2387–2412.

[123]N.A.Heard,C.C.Holmes,D.A.Stephens,D.J.Hand,G.Dimopoulos,Bayesian coclusteringofanophelesgeneexpressiontimeseries:studyofimmune defenseresponsetomultipleexperimentalchallenges,Proc.Natl.Acad.Sci. U.S.A.102(47)(2005)16939–16944.

[124]S.Klemm,etal.,CausalStructureIdentificationinNonlinearDynamical Sys-tems,Dept.ofEng.,Univ.ofCambridge,UnitedKingdom,2008(Master’s thesis).

[125]E.Howe,K.Holton,S.Nair,D.Schlauch,R.Sinha,J.Quackenbush,MeV: multi-experimentviewer,in:BiomedicalInformaticsforCancerResearch,Springer, 2010,pp.267–277.

[126]C.Espinosa-Soto,P.Padilla-Longoria,E.R.Alvarez-Buylla,Ageneregulatory networkmodelforcell-fatedeterminationduringArabidopsisthalianaflower developmentthatisrobustandrecoversexperimentalgeneexpression pro-files,PlantCell16(11)(2004)2923–2939.

[127]M.Sankar,K.S.Osmont,J.Rolcik,B.Gujas,D.Tarkowska,M.Strnad,I.Xenarios, C.S.Hardtke,Aqualitativecontinuousmodelofcellularauxinand brassinos-teroidsignalingandtheircrosstalk,Bioinformatics27(10)(2011)1404–1412. [128]A.Cruz-Ramírez,S.Díaz-Trivino,I.Blilou,V.A.Grieneisen,R.Sozzani,C. Zamioudis,P.Miskolczi,J.Nieuwland,R.Benjamins,P.Dhonukshe,etal.,A bistablecircuitinvolvingscarecrow-retinoblastomaintegratescuestoinform asymmetricstemcelldivision,Cell150(5)(2012)1002–1015.

[129]A.Pokhilko,A.P.Fernández,K.D.Edwards,M.M.Southern,K.J.Halliday,A.J. Millar,TheclockgenecircuitinArabidopsisincludesarepressilatorwith additionalfeedbackloops,Mol.Syst.Biol.8(1)(2012)574.

[130]V.Vermeirssen,I.DeClercq,T.VanParys,F.VanBreusegem,Y.VandePeer,

Arabidopsisensemblereverse-engineeredgeneregulatorynetworkdiscloses interconnectedtranscriptionfactorsinoxidativestress,PlantCell(2014), tpc-114.

[131]A.Joshi,R.DeSmet,K.Marchal,Y.VandePeer,T.Michoel,Modulenetworks revisited:computationalassessmentandprioritizationofmodelpredictions, Bioinformatics25(4)(2009)490–496.

[132]J.J.Faith,B.Hayete,J.T.Thaden,I.Mogno,J.Wierzbowski,G.Cottarel,S.Kasif, J.J.Collins,T.S.Gardner,Large-scalemappingandvalidationofEscherichiacoli

transcriptionalregulationfromacompendiumofexpressionprofiles,PLoS Biol.5(1)(2007)e8.

[133]R.A.Montes,G.Coello,K.L.González-Aguilera,N.Marsch-Martínez,S.de Folter,E.R.Alvarez-Buylla,ARACNe-basedinference,usingcurated microar-raydata,ofArabidopsisthalianaroottranscriptionalregulatorynetworks,BMC PlantBiol.14(1)(2014)97.

[134]P.E.Meyer,K.Kontos,F.Lafitte,G.Bontempi,Information-theoreticinference oflargetranscriptionalregulatorynetworks,EURASIPJ.Bioinform.Syst.Biol. 2007(2007)79879.

[135]S.Lebre,J.Becq,F.Devaux,M.P.Stumpf,G.Lelandais,Statisticalinferenceof thetime-varyingstructureofgene-regulationnetworks,BMCSyst.Biol.4(1) (2010)130.

[136]D.Marbach,J.C.Costello,R.Küffner,N.M.Vega,R.J.Prill,D.M.Camacho,K.R. Allison,M.Kellis,J.J.Collins,G.Stolovitzky,etal.,Wisdomofcrowdsforrobust genenetworkinference,Nat.Methods9(8)(2012)796–804.

Figure

Fig. 1. Conceptual view of the information flow in a computational approach. Biological data is used to identify genes of interest (Type 1 algorithm), infer connections between these genes (Type 2 algorithm), and predict types of these connections (Type 3 a
Fig. 3. Type 5 algorithms output in terms of the system of ODEs and predicted gene expression dynamics (g i (t)) based on experimental values (g i (t j ))
Fig. 4. Validation techniques for algorithm types with key references. Examples shown are those typically seen in current computational research approaches, specifically for research projects in A
Fig. 5. Transitions between algorithms of different types. Typical experimental transitions between algorithms are indicated with blue arrows and perspective future transitions, less common but possible with more reliable supporting algorithms, are indicat

References

Related documents

There has been growing interest in the detection of functional disabilities in young children.”2 A va- riety of developmental screening tests are effective in the early

Results: We discovered that spinal nerve ligation (SNL) produced neuropathic pain-related anxiety-like behaviors in rats, which could be specifically inhibited by

Email: [email protected] ABSTRACT The ethanolic extract of leaves from Anisomeles malabarica was tested for in vitro antioxidant using various free radical scavenging assays, such

They represent costs that cannot be identified as attributable to specific services or licensees such as general international consultation, propagation research covering many

widespread reforms in the agricultural sector, involving a move away from the previous collective regime to a system in which farmers have greater freedom in making production

Techniques: Sentiment analysis can be performed using machine learning approach or lexicon based approach or a.. combination

thetic HMR - E silencer: In the context of the wild-type (DRY1364; Figure 4) or SAS5 (DRY2109; Figure 5) re- stored silencing in an HMR-SS D I strain to wild-type HMR-E silencer

Figure 13: Normalized correlation results of di ff erent test images under noise attacks (a) embedding the watermark in the highest frequency subband of the host image (b) embedding