An analysis of quantitative PCR reliability through replicates using the Ct method

(1)

J. Biomedica doi: 10.4236/jb

Published On

An ana

using t

Chris C. St

1_{Bioprocess R} 2_{Department o} 3_{Department o}

Email: erik.bo Received 6 Fe

ABSTRAC

There is con suring nucle tions. The m thod is the r analyzed wit (Ct) method. performed h of initial con a concentrat resultant Ct dard and no riability/relia supports th replicates, th tically distin ten orders o tion. As exp grow as the demonstrate confound qu tion at low t that a miscl 3000 initial c tion region thermal wea vide data th detection str and plate fi classification becomes unr Keywords:M Molecule Co Replicates an

1. INTRO

Real time Po

al Science an bise.2010.3506

nline May 201

alysis of

the C

t

m

towers1, Fre

R&D Division D of Biomedical E of Biomedical In oczko@vanderb

ebruary 2010; re

CT

nsiderable in eic acids from most common real-time poly

th the crossin . Utilizing a m hundreds of re nditions whose tion range of value distribu ovel statistica ability of the e following he mean and/o nguishable an of magnitude ected, the var

number of in e that these v uantitative cla template conc lassification t copies of temp

correlates w ar of the TAQ hat indicate rategy based lling statistic n transition w reliable.

Misclassificati ounting; Rank nd Reliability

ODUCTION

olymerase Ch

nd Engineerin 64 Published On

0 in SciRes.h

f quanti

method

ederick R. H

Dow AgroScien Engineering Van

nformatics Van bilt.edu

evised 1 March

nterest in qua m single cells

nly employed ymerase chain ng point or cr multiwell plate eplicate react e initial numb

ten orders of utions are an al techniques

e PCR proce conclusions. or median Ct d can be ran in initial tem riances in the nitial copies d variances are assification of

centrations. T transition is plate DNA an with independ Q polymerase that an alter on the theor cs is accurate where the re

ion Transition Ordering Run

N

hain Reaction

ng, 2010, 3, 45 nline May 2010

http://www.scir

itative P

Haselton2, Er

nces LLC Indian nderbilt Univer nderbilt Univers

h 2010; accepted

antitatively m to small pop d laboratory

n reaction (PC rossing thresh e reader we h tions each at a ber of copies s f magnitude.

alyzed with s to assess the ess. Our anal

Given suffic values are sta nk ordered ac mplate concen e Ct distribut declines to 1.

large enough f the initial co The data indi centered aro nd that the tra dent data on enzyme. We p rnative endp ry of well mix

e below the m eal time met

n; Single nning Title; P

(PCR) is wi

59-469 0 (http://www.S

rp.org/journal/j

PCR re

rik M. Boczk

napolis, Indiana rsity Nashville, sity Medical Ce

d 2 March 2010

mea- pula- me-CR) hold have a set span The stan-e va-lysis cient atis-cross ntra-tions We h to ndi-icate ound ansi-the pro-point xing mis- thod PCR; dely use cal mo [4-6 pro to u to s pro fact per mu a m I eme cro amp con dan clo amp kno that the cate cos to u pen

T in t num the low 1 init tud 2 betw 3 init SciRP.org/journa /jbise

eliability

ko3* a, USA; Nashville, USA enter Nashville, 0.

ed for quantita and research dified foods, v 6]. In a real tim ocess is record

use the amplif solve the inve oxy for the am

tory solution red by the am

ltivariate natu model that acco

In its stead, th erged [12]. T ssing thresho plification cu nditions diverg nce with this

se to the dete plification err own result fro

t averaging ov number of re es appears typ st, time, effort

understand th ndence on initi The goal of th the distributio mber of identi

initial templ wing:

1) Are measu tial template c de?

2) How many ween differen 3) How does tial template c

al/jbise/).

y throu

A;

, Nashville, US

ative analysis areas includin vaccine effica me PCR react ded. The goal o

fication time s rse problem o mount of initia of this invers mplification o

ure of the enzy ounts for the v he ad-hoc cros The heuristic b ld is predicat urves from id

ge dramatical observation a ection thresho ror. But varia om the statist ver replicates eplicates. In p

pical. Since t t, and accurac he requiremen

ial template co his study was on of Ct valu

cally prepared late concentra

ures of centra concentration

y replicates ar nt initial templ the number o concentration?

gh repl

A.

[1-3] in a va ng the study o acy, and in sys

tion the DNA of a quantitati series data, of determining al template,

se problem ha f error, diluti yme system an variability [1,7 ssing point, C

behind the sel ted on the ob

dentically pre lly with iterati a threshold va ld to limit va ability remains

tics of Subsec reduces varian practice, from there is a bala cy it is of pra nt for replicate

oncentration. to examine t ues generated

d replicates as ation to determ

al tendency in over 10 orde

re required to late concentra of replicates d ?

JBiSE

licates

ariety of clini-of genetically stems biology amplification ive analysis is

g a reasonable

0 . A satis-as been ham-ion error, the nd the lack of 7-11].

Ct, method has

lection of the servation that epared initial ion. In accor-alue is chosen ariability from s. It is a well ction 4.8 [13] nce in ratio to m 1 to 5 repli-ance between actical interest es and its

de-the variability from a large s a function of mine the

fol-nformative of ers of

magni-o discriminate ations? depend on the

(2)

460 C. C. Stowers et al. / J. Biomedical Science and Engineering 3 (2010) 459-469

Using a multiwell plate format we measured hundreds of replicates to produce Ct value distributions. Using

standard and novel statistical techniques we analyze the

Ct value distributions and demonstrate that the sample

mean and/or median Ct values are statistically

signifi-cantly distinguishable over ten orders of magnitude. Furthermore, we show that the sample mean Ct values

are reliably ordered according to the initial concentration of template. In other words, if x and y are initial template

concentrations with x < y and µx and µy are the

corres-ponding sample mean Ct values then µx > µy. The order

reverses because less initial template requires more cycles of PCR to amplify. We utilize ordering as a con-venient and natural device to quantify the role of repli-cates on reliability. We ask and answer the following question: Given an unlabeled dilution series how many replicates are required to reliably order the tubes? We find that the answer depends on the range of initial tem-plate.

A focus of this work was to cover as broad a range of initial conditions as possible with the same experimental format. We observed that the mean and/or median Ct

values had the smallest variance above 104_{initial copies.}

Most published standard curves focus on this range [2]. Few studies have analyzed issues of variability and ro-bustness below this range. We show that below 104

ini-tial copies the probability of misclassification of the ini-tial template concentration given a Ctvalue grows

ra-pidly and saturates near a half. The dispersion in the Ct

value distributions and the rise in misclassification cor-relate with an independent measure of the thermal wear of the TAQ polymerase enzyme.

Driven by the observed broadening of the Ct value

distributions below a thousand initial copies, and in-spired by elegant methods that sidestep the issues created by the dynamics of exponential growth [14-18], we examined a format for single molecule detection uti-lizing an endpoint analysis and the statistical properties of well mixing and plate filling. We present data that such an assay is accurate where the real time method becomes unreliable.

2. MATERIALS AND METHODS

2.1. PCR

Rt-PCR results were generated using linearized double stranded EC3 plasmid DNA containing the ybdO gene. The plasmid was linearized by digestion with the restric-tion enzyme BamH1 prior to PCR. The following primer sequences were used.

1) Forward: 5’-AAT TAT TCT AAA ACC AGC GTG TC-3’

2) Reverse: 5’-TTT GGG ATT GAA TCA CTG TTT C-3’

The PCR supermix was prepared as described in [19], with the exception that we used Qiagen HotStarTaq Cat

# 203203, Roche dNTPs Cat# 13583000, DMSO Sigma # D8418 at 2%, and Sybr Green (Sigma # 86205) at 5-times the recommended concentration. Primers were used at a concentration of 1 µM. All samples were run

on the 384 well plate platform using an Applied Biosys-tems 7900HT thermocycler and the SDS 2.3 software. The Ct value threshold was set at 5.0 RFU (Relative

Fluorescence Units) for all samples. The DNA concen-trations of concentrated stocks were measured using a Nano-Drop 100 spectrophotometer prior to use. Subse-quent dilutions were performed using sterile, nuclease free water from Ambion # AM9937. The following thermo-cycling program was used.

1) 2 min at 50°C Initial Warmup Phase; 2) 15 min at 95°C Initial TAQ Activation Step; 3) 1 min at 95°C DNA Denaturation;

4) 1 min at 50°C Primer Annealing; 5) 1 min at 72°C DNA Extension;

6) 0.25 min at 80°C Fluorescence Measurement; 7) Repeat Steps 3-6 forty times.

2.2. Preparation of Identical Replicates

To ensure uniformity in the face of pipetting error the PCR supermix was prepared in well-mixed batches in a 14-mL conical tube. Each sample consisted of 184 rep-licates and 8 negative controls, requiring exactly half of a 384 well plate. All of the components except for DNA were loaded into a 14 mL conical tube in the following order: 800 µL PCR buffer, 5.6 mL of nuclease free water,

160 µL DMSO, 320 µL MgCl2 (Qiagen Cat #

124113012), 160 µL of a primer mix (a 1:1 mix of the

forward and reverse primer stored at a concentration of 50 M each), 160 µL Sybr Green (100X stored in DMSO),

160 µL of dNTPs, and lastly 80 µL of Taq polymerase.

We have noticed that the order at which these are added affects the reproducibility of the assay. The mixture was vortexed at high speed for 1 minute. 335 µL of supermix

was then removed to be used as a negative control and placed into a 1 mL eppendorf tube and 25 µL of water

was added. This mixture was then briefly vortexed to ensure well mixing. The remaining 7.105 mL of super-mix was then split equally four ways into 2 mL cryostat tubes, and 134 µL of water plus the amount of desired

DNA was added to each cryostat tube. Each tube was then briefly vortexed. For each reaction contained within a single well of the plate, 10 µL of the respective

reac-tion mix was loaded into a well of the 384 well plates.

2.3. TAQ Polymerase Pre-Wear Assay

(3)

C. C. Stowers et al. / J. Biomedical Science and Engineering 3 (2010) 459-4 461

added to the preworn enzyme with subsequent resump-tion of cycling. An efficiency was calculated by averag-ing the derivative over the resultant amplification curve.

2.4. Statistical Analysis of Ct Distributions

The sample mean Ct values for each initial template con-

centration were compared pairwise using a permutation test that is asymptotically valid and robust in situations where the distributions are not necessarily normal and/or the ratio of the variances is unknown, indicating that a t-test is not supported [20,21]. The test statistic T [20], measures the difference in mean rank of the samples within their union, scaled by a consistent estimator of their variance. Because the Ct value distributions may be

skewed by outliers, we also considered the median as a measure of central tendency. The median Ctvalues were

compared pair-wise using a bootstrap test that has been shown to outperform all reasonable alternative methods [22].

Given a linear regression, ymxb, of the mean/

median Ct values against x = log(n), the log of the

num-ber of initial copies of template, a relative error was cal-culated from the quantiles of the Ct value distributions as

follows. Allow x h

Q and x l

Q to be high and low

quan-tile values chosen from the Ct value distribution

gener-ated by initial log template x. Since the Ct value

gener-ally increases with decreasing amount of initial template the slope m of the regression line(s) is negative. Thus,

the difference in the predicted amount of initial template DNA from the distributions divided by the input amount is given as:

x

m b Q m b

Q_lx _hx

n n

10 10 10(  ) _ ( ) 



(1)

Let U represent the universe of possible Ct values, and

let T stand for the collection of possible initial template

concentrations. The initial template concentrations are thought of as the class labels. We consider the probabil-ity of misclassifying an observed Ctvalue given a known

class label. Suppose that we draw a Ctvalue from a

giv-en class, how likely is it to find that value in any of the other classes? The mean misclassification probability is estimated from the Ct value distributions corresponding

to different initial template concentrations according to the following formula.



 

U i

x \ T | i P x | i P x

P( ) ( ) ( ) (2)

where P(i|x) is the conditional probability of finding the Ct value i, given the initial template concentration x,

and the P(i|T\x) is the conditional probability of observing that same value given any initial template concentration other than x. The later conditional prob-ability is interpreted as the probprob-ability of misclassi-

fication. The conditional probabilities are estimated from the measured Ct value frequency distributions.

2.5. Plate Filling with Microbeads

Experiments were performed using 20µm latex beads

from Beckman Coulter (#PN6602798) using flat bottom 96 well plates from Becton Dickinson Labware. 96 well plates were used in place of 384 well plates for ease of microscopic analysis. Various dilutions of beads were prepared using a Beckman Multisizer Coulter Counter 2. 25 µL of each dilution was loaded into each well of the

96 well plate. The number of beads in each well was counted with a Nikon TE-2000 microscope.

2.6. Plate Filling Simulations

The statistics of the plate filling stochastic process were modeled using Monte-Carlo simulation. For instance, the expected number of empty wells in a 96 well plate was estimated by simulation using the following function:

Table[Mean[ Table[Length[

Complement[Range[96],

RandomInteger[Range[96], m]]], {10000}]], {m, 1, 600}]

Here m is the number of molecules being plated from a well mixed solution. The mean is estimated from 10,000 realizations. The standard deviation is computed by replacing the function Mean by Standard Deviation. A graph of these functions is shown in Figure 9. All

simulations and analysis were carried out in Mathemati-ca 6.03 (Wolfram Research), and the notebooks are available upon request.

3. RESULTS

The Ct value data are summarized in Figure 1. The

fig-ure shows that above 104_{copies the data are distributed}

about the median with smaller variance than those below. Outliers exist across all the data, mostly trending upward of the median, indicative of reactions lagging behind the pack. The data show the distributions as collected across ten orders of magnitude in initial template.

3.1. Mean and Median Ct values

While the Ct value distributions below 104 copies of

ini-tial template DNA are broad and noisy, the sample means and medians form a monotone increasing series when stratified according to initial template. These data are shown in Table 1. The means and medians are very

nearly equal in all cases and the data distributions appear unimodal.

At initial template concentrations larger than 104

(4)

462

F o r d

to identify d no hypothesi template con not within a another. Wh count the me a level of sig ception of th cally differen result of the in Table 2.

The Ctva

because we riances, stan

[image:4.595.107.493.101.495.2]

Table 1. Samp Observe that th

Distribution N

Log Copies T

Replicates

Mean Ct Valu

Median Ct Va

Figure 1. Summ of DNA amplif run at each initi dots. Outliers ar

istinguishable is test appears ncentrations it a fraction of hen all of the eans and medi gnificance gre he smallest tw

nt at the 3/10 significance t

alue distributi do not a pri dard hypothe

C. C. Stowers

Res.

ple mean and m he sample mean

Number

Template

ue

alue

mary of the Ct fied. As describ

ial condition. T re defined as po

e mean and m s necessary. H

is not clear th f a standard

data points a ians are statist eater than 1/20

wo medians t 000 level of test for the m

ions may no ori know the sis tests to co

s et al. / J. Bio

median Ct values ns and medians 1 2 9.0 8.0 184 182 6.6 10.6 6.6 10.4

value data, stra bed in the mater The box covers t oints beyond 3/

median values However at lo hat the means deviation of are taken into tically differen 0000 with the that were stat significance. means is displa

t be normal e ratio of the ompare means

omedical Scien

s. The correspon form an ordere

3 4 7.3 6.3 183 183 6 12.8 15.6 4 12.8 15.7

atified as a func rials and metho two quartiles ab /2 the discrete in

and ower s are one o

ac-nt at e

ex- tisti-The ayed

and va-s are

not bee hav con dist dec upp fou var exp trib larg ima crim

nce and Engin

nding distributi ed sequence stra

5 6 5.7 4.1 184 183 6 19.5 25.5 7 19.3 25.5

ction of the log ods section, a m

bout the median nterquantile ran

t justified. S en proposed f ve shown to ntains the T-tributions. Th clining as tem per boundary urth and fifth riance estimat plains the larg butions contai ger variances ately 184 rep mination over

neering 3 (201

ions are shown atified by initia

7 8 9

3.1 2.1 1.9 184 179 730 27.6 28.8 30. 28.1 28.7 30.

g of the initial n minimum of 175 n with outliers nge from the ed

Several non-p for this comm

have power. -statistic com he values of th mplate decreas of the null distributions tor takes its ge T value. Al in some over s. The data in

licates the C

r nine orders

10) 459-469

in Figure 1. al template.

10 11 9 1.1 0.3

0 175 178 7 35.5 36.4 7 35.0 35.7

number of copie 5 replicates wer shown as the re dge of the box.

parametric m mon situation The last row mparing temp he T-statistic ses and come distribution. s are non-ove

minimum va ll of the other rlap and henc ndicate that

Ct method can

of magnitude

JBiSE

es re ed

methods have n [20,21] and w of Table 2

plate-adjacent are generally closer to the Because the erlapping the alue and this r pairs of dis-ce have much with approx-n provide dis-e. One goal of

E

e d

2

t y e e e s -h

(5)

Copyright

our work, to analysis of h make the sam

3.2. Standa

A linear reg initial copies The regressi over the entir contrast, Fig

described by contains at l also be seen gression line In an inde and supermi experiences

C. C. Sto

o be describe how few repl me claim.

ard Curves

gression of th s of DNA tem ion line captu

re range of in

gure 1 gives th

y a function t least two sigm n in the oscilla in Figure 2.

ependent expe x, we examin as a function

[image:5.595.132.458.377.675.2]

Table 2. applied p treme val with the T Table 1. applied to side of th thesis is re

Property Min Max T

Figure 2. sion line i on the ind tributions.

owers et al. /

Res.

ed below, is licates are req

he mean Ct v

mplate is sho ures the data nitial template

he impression that is initially moid like tran ation of the d

eriment with t ned the wear n of thermocy

Results of test airwise to Ct v lues of the dist

T-statistic for a In each test, th o the pooled da he range indicat ejected with co

y 2v1 3v

-5.02 -3

3.93 4.

684.33 39

Linear regress is shown in red dividual data po

.

/ J. Biomedic

to provide so quired to relia

alues against own in Figur

a reasonably w concentration n that the data

y linear and t nsitions. This data about the

the same enzy that the enzy ycling alone.

ting the null hy value distributio ribution of the adjacent distrib he null distribut ata as described ting that the p-v

nfidence. v2 4v3

.85 -4.63

.17 3.87

9.61 51.45 2

sion of the mea d along with a 9 oints reflect on

cal Science a

ome ably

log

re 2.

well n. In a are then can e

re-yme yme The

mid of a wea whe ind tran gre

G mea from equ and in F

pro sha as tran

ypothesis of sto ons adjacent in

T-statistic und

butions. The dis tion was simul d in [20]. In eac

value is less th

5v4 6v5

-4.25 -4.06

3.74 4.13

23809.7 91.0

an Ct values wi 95% confidence ne standard dev

and Engineer

ddle of the tra approximately ar transition c ere the distrib dependent obse

nsition point ssion. The dat Given a regre

asure of the m the spread uation for n, a

d third quartile

Figure 5. The

oximately 20% arply below. T the red line, nsition region,

ochastic equalit n initial templat der the null hyp stributions, e.g. lated using 20,0 ch test the calc han 1/20000. In

7v6 8v7

-4.10 -3.94

5.03 4.77

24.27 7.29

th log initial co e interval in da

iation compute

ring 3 (2010)

ansition region y 25. As seen corresponds to butions begin t ervation, we s and consider ta are summar ssion model o relative error

in the Ct val

as described in es were used e data show th % above 104

The piecewise produces a l , and agrees e

ty. A hypothes te concentration pothesis are sh . “2v1”, are lab 000 random pe

ulated T-statist

n each case the

9v8 10v9

-4.62 -4.50

4.87 3.73

18.37 81.7

opy number. Th shed line. The ed from the Ct v

) 459-4

n corresponds n in Figure 1

o the region ne to broaden. B split the data red a piecew rized in Figur

of the data w r of the inver

lue distributio n Subsection to produce th hat the relativ

4_{initial copi}

e linear regre larger relative

lsewhere.

is test was n. The ex-hown along

beled as in ermutations tic fell

out-null

hypo-9 11v10

0 -3.57

3.74

4.22

he regres-error bars value

463

JBiSE

s to a Ct value 1, the thermal

ear 104_copies

ecause of this in two at this ise linear

re-re 4.

e computed a rsion process, ons, using the 2.4. The first he data shown ve error is ap-es, and rises ession, shown e error in the 3

E

e l s s s

(6)

464

3.3. Miscla

The graph o shown in Fi

that the broa the inverse p centration fr attempt to su

Ct value dist

initial templ value we con in the equatio The constr

[image:6.595.139.460.83.293.2] [image:6.595.136.462.345.544.2]

Figure compute Error ba

Figure to the tr ssification

f the relative

gure 5 is an

adening of the problem of ass rom a measur ummarize the tributions on t ate concentra nsidered stati on for P(x) de

ruction of a s

C. C. Stowers

Res.

3. TAQ effici ed as the avera ars represent a s

4. Piecewise lin rends observed

error of the attempt to qu e Ct value dis

signing an init red Ct value.

impact of th the process of ation based o istics such as escribed in Sub standard curve

s et al. / J. Bio

iency as a fun age derivative o standard deviati

near regression in Figures 1 an

linear regress uantify the ef stributions has tial template c In an alterna he variance of f categorizing on a measured

that summar bsection 2.4. e, as describe

omedical Scien

nction of therm of relative fluor ion over three i

n of the mean C

nd 3.

sion, ffect s on con-ative f the g the d Ct

rized

ed in

the con ano rep plat ure like clas the valu qua data

nce and Engin

mo-cycling pre-escence over th independent exp

Ct values. The d

previous sec ncentration to other. Consid resenting a c te concentrati d Ct value co

ely is it to m ss label? The distributions ue will be mi antifies this n a set, are show

neering 3 (201

wear. An effic he amplification periments.

data were split a

ction, is one o a measured der each dist lass. The cla ion. The heuri

oming from misassociate t larger the ov , the more lik isclassified. T notion. The re wn in Figure 6

10) 459-469

ciency is n curves.

according

way to assig d Ct value. N

tribution of ss label is th istic is this: G one of these this value wit verlap between

kely it is that The formula f esults, condit

6.

JBiSE

gn a template Now consider

Ct values as

he initial tem-Given a meas-classes, how th the wrong n the meat of t an unknown for P(x) given

tioned on our

E

e r s -w

(7)

Copyright

The analys results show probability o Over a narro concentration quently, at lo less saturate

C. C. Sto

[image:7.595.135.460.86.290.2] [image:7.595.133.461.374.578.2]

Figure and the ues cor whose l given in

Figure Subsect Hill's fu proxima

sis indicates, a wn in Figure

of misclassific ow range span n the probabi ower concentr es at the val

owers et al. /

Res.

5. Relative err e standard curve rresponding to limits were cal n Subsection 2.

6. Misclassifica tion 2.4 and con unction is show ately 2950 copi

as is clearly c

1, that abov

cation are sma nning the nex ility rises to rations, the pr lue of one h

/ J. Biomedic

ror of the PCR es shown in Fig the first and t culated using th 4.

ation frequency nditioned upon wn as the dash es of initial tem

orrelated with ve 105_copies

all, less than xt two decade one half. Su robability mor half and equ

cal Science a

process as calc gures 2 and 4, B third quartile w

he regression li

y calculated acc the Ct value dis hed line. The m mplate DNA.

h the the 0.1. es of

ubse-re or uates

clas the clas Hil that cur can

and Engineer

culated from th Blue and Red r were used to ca

ine(s), accordin

cording to the d stributions show midpoint of the

ssification to flip is interpr ss or out, in l’s function, s t the midpoin rs at 103.47_≈₂

n be seen in F

ring 3 (2010)

he Ct value dist respectively. Th alculate a ∆DN ng to the defini

definition of P(x

wn in Figure 1. e transition occ

coin flipping reted as the ch

a one versus shown as the nt of the misc 2950 initial co

Figure 2, the

) 459-4

tributions he Ct val-NA value

ition of n

x) given in

. A best fit curs at

ap-g. The two sid hance of bein

all sense. Fr dotted curve, classification opies of templ

ordered pair

465

JBiSE

ded nature of ng in the right rom a best-fit it is deduced transition oc-late DNA. As (3.47,25) lies 5

E

(8)

466

close to the stripe, indica error, misclas

3.4. Rank O

In the first s sample mean data are stat This was the plate concen stand how few

In this sec Suppose that in a set of tub a correctly ca taining the scrambled as that the amou series. The q PCR runs are

Ct values cor

given level o is one level o closely relate

Conditione Monte-Carlo we draw k r

their sample

k-replicates. T

numerical va class labels, x

fraction of po sample mean of k. The resu

regression li ating the con ssification and

Ordering wi

section we sum n and median C

tistically disti case using al ntration. Doub w data are req ction we imag

an investigato bes, with no er alibrated pipett dilution serie s to order, but unt of DNA w question is to e required such rrectly order, a of confidence?

of quantitation ed pre-requisite ed on our data simulation. F replicates unif

mean . Th The resulting s alues and if thi

x, we score thi

[image:8.595.134.465.484.687.2]

ositive draws t ns, from a tota ults are shown

Figure 7. rank order

C. C. Stowers

Res.

ine and well cordance betw d pre-wear dat

ith Replicate

mmarized the

Ct values comp

inguishable an ll 184 replicat

tless it is of quired to make gine the follo or has produce rror other than

teman. Howev es are unlabe t not contents. within the tubes determine ho h that the resul and hence labe

We imagine n removed from

e.

we can answe From each Ct

formly at rand his mimics an set are sorted is order agrees is trial positive that resulted in l of 20,000 dr in Figures 7a

Reliability of r ring the initial t

s et al. / J. Bio

within the e ween the rela ta.

es

findings that mputed from all nd rank orde tes per initial t interest to un e this same clai owing experim ed a serial dilu n that coming f ver, the tubes c eled and bec

It is unequiv s form a mono ow many repli ltant sample m el, the tubes wi

that rank orde m inversion, b

er this question value distribu dom and com n experiment w d according to t

s with that of t e. We compute n correctly ord raws at each v and8.

replicates given template concen omedical Scien error ative t the l the ered. tem- nder-im. ment. ution from con-ome vocal otone icate mean ith a ering but a n by ution mpute with their their e the dered value F ord cop our acc whe ind 90% ure wit vio 3.5

In t vari 100 dard by mo sep con and we reve S mix mu the tion the tive as t F

n the task of ran ntrations with x

nce and Engin

Figure 7 show

der the highes pies with grea r data. The use curacy. The in en all the da dicate that 35 o

% accuracy ov

e 8 shows tha

th smaller init r of the sampl

5. End Point

the previous s iance that sho 0 initial copies d curves or cl error. As an a lecule detectio arated the ana nsider the proc d experiment a show that the eals the pattern Suppose, as b xed, aliquots

ltiwell plate a total number n limitation of expected num e statistic that the total numb

Figure 9 sho

nk ordering a dil x ≥ 104_.

neering 3 (201

ws that indiv st concentratio

ater than 90% e of 8 or more nset to Figure

ata are consid or more replic ver the entire t the larger va tial template i

le over the ent

Detection

sections we h ows that below s of template D lassification vi alternative we on utilizing an alysis into two cess of plate fi

are in good ag e process of am

n of plate fillin before, that w

of a DNA so and perform 4

of wells that f the machine. mber of unamp

t has the prop ber of molecul ows the exce

lution series. Th

10) 459-469

idual data po ons from 104

% accuracy co e replicates gu

e 8 shows the

dered together cates are requi e concentratio ariance of the is responsible

tire range.

have detailed a w 104_{and ce}

DNA, quantita ia a Ct value i

describe a for endpoint anal o parts. In the

filling. We sho greement. In th mplification by ng.

we pipette id olution into th 40 cycles of PC amplified abo . Our data dem plified wells i perty that its e

les of DNA de llent congrue

he results for

JBiSE

oints can rank to 109_initial

onditioned on uarantees 99% e performance r. These data ired to exceed on range.

Fig-e distributions for the

beha-an beha-analysis of ertainly below ation via stan-is confounded rmat for single lysis. We have e first part we ow that theory he second part y PCR simply

(9)

Copyright

theory and e solutions of methods. Uti DNA from th ical bead cou endpoint ana pendent expe for PCR wer

C. C. Sto

Figure results results

Figure pected buted a from th dotted

experiment for 20 micron la ilizing PCR to hose that are n unting. Figure

alysis. Each d eriment. The re determined

owers et al. /

Res.

8. Reliability for rank orderi over the entire

e 9. Agreement number of em among the well heory is shown green line. Exp

r filling of 96 atex beads as o discriminate

not, is more c

e 10 describes

data point rep concentration d through dilu

/ J. Biomedic

of replicates g ing the initial t

range of initial

t between exper mpty wells is sh ls of a 96-well p n as the solid b perimental data

6-well plates w described in e wells filled w complex than

s the results o presents an in ns of DNA pl ution as descri

cal Science a

iven the task o template distrib template DNA

rimental and th hown as a funct plate. The expe lue line, while

are shown in re

with n the with opt-of an

nde-lated

ibed

in m are rem the

4. C

Wit var

and Engineer

of rank ordering butions with x A, see Table 1.

heoretical plate tion of the tota

cted number of one standard d ed.

methods. Wh uniformly sc markably well plate filling c

CONCLUS

th a sensitive riability of re

ring 3 (2010)

g a dilution ser < 104_{. Inset sh}

filling statistic l number of be f unfilled wells deviation is sho

hen these put caled by a fa with the theo curve.

SIONS

PCR plate rea al time PCR

) 459-4

ries. The hows the

cs. The ex-eads distri-calculated own as the

ative DNA c actor of 2.4 th ory over the en

ader we have process. We

467

JBiSE

concentrations he data agree ntire length of

examined the e have shown 7

E

s e f

(10)

468

that using le template con are statistica ponding with have shown gressed over

The Ctva

copies of in independent tion. Indepen enzyme expe in the corresp

Given a st used to quan values on th concentration over the hig ing near 20%

fication analy to high. The perhaps mor ror analysis

fication ana approach to ing a standar template con cation into o We are curre A question tribution data of replicates riance of th

[image:10.595.140.458.81.297.2]

Figure cules sp theory in

ess than a few ncentration en lly distinguish h the initial a

that the me ten orders of alue distributi itial template

statistical tec ndent data on eriences a tran ponding regio tandard curve, ntify the impa he process of n. The relative

hest initial te %. Both the re

ysis capture th misclassificat e intuitive to perhaps mor lysis suggest solving the in rd curve to co ncentration we ne of a discre ently exploring n central to th a concerns th

stems from t e distribution

C. C. Stowers

Res.

10. The numbe pread over a 38 n blue. The red

w hundred rep nsures that the

hable and ran amount of tem ean/median va

magnitude. ions appear DNA and th chniques conf

n TAQ-wear nsition of decr on.

, the Ctvalue d

act of the var predicting th e error varies emplate conce elative error a he transition f tion frequency

interpret whi re quantitative ts an alterna nverse proble onvert a Ct va

e consider pro ete number of

g this idea. he analysis of he role of repl the statistical n of sample m

s et al. / J. Bio

er of unamplifi 84 well plate. T d dots represent

plicates per in e mean Ct va

nk ordered cor mplate DNA. alues can be

noisy below he results of firm this obse indicate that reasing efficie

distributions w riability of th he initial temp

from 7% to 5 entrations ave and the miscla from low varia y is smoother le the relative e. The miscla ative clasifica

m. Instead of alue into an in babilistic clas f template clas

f the Ct value

licates. The v fact that the means is sma

omedical Scien

ied wells as a f The expected nu

experimental P

nitial alues

rres-We e

re-104 two erva-t erva-the ency were he Ct

plate 50% erag-assi- ance and e er-assi- ation f us-nitial ssifi- sses. dis-value va-aller than can cate cibi nien Ran num init data 35 reg dish lica sca trat met wel T dem tion tim DN How hun qua alte cou ana W lecu hav tha

nce and Engin

function of the umber of empt PCR endpoint d

n the variance n the data dist es are require ility? Rank or nt and meanin nk ordering si mber of replic tial template. a points provi or more repli ion for the sam heartening res ates are requir rce. For samp tions it may b

thod using th lls than to con These data, an monstrated tha n with statistic me PCR capab NA samples ra wever the dat ndreds of initi antitation. We ernative meth unting. In this alysis shows si We have des

ule counting ve demonstrat t the expecte

neering 3 (201

number of DN ty wells calcula data.

e of the data d tributions tea d in practice f rdering of the ngful device f imulations wit cates required Below the tr ide better than icates are sugg

me degree of sult for the fo red precisely ples with sma e more accura he expected n nsider replicat

nd the work at the use of cal replicates ble of quanti anging from u ta presented a

al copies real e and other gr hods for singl s regard, a p ignificant prom cribed a deco into plate fill ted through s ed number of

10) 459-469

NA mole-ated from

distributions. ach about how

for resolution e sample mean

for exploring th our data su d depends on ransition regi n 90% rank ac gested below rank accuracy ollowing reaso where the sa all initial temp

ate to conside number of (u es.

of other grou the Ct metho

renders the p itative analys upwards of 1 above show th

time PCR is roups have be le molecule d process involv

mise. omposition o

ing and ampl simulation an f (un)amplifie

JBiSE

What lessons w many repli-n arepli-nd reprodu-ns is a conve-this question. uggest that the the range of on individual ccuracy, while the transition y. But this is a on: More rep-ample may be plate concen-er an endpoint un) amplified

ups [24] have d in conjunc-process of real sis for initial 04_molecules.

hat for tens to unreliable for een exploring detection and ving endpoint

of single mo-lification. We d experiment ed wells is a

(11)

C. C. Stowers et al. / J. Biomedical Science and Engineering 3 (2010) 459-4 469

robust statistic on which to base an inverse problem or a rigorous hypothesis test to count small numbers of single molecules. The observed linear scaling between expected and perceived DNA concentration indicates that amplification by PCR is directly related to plate

filling.

It remains an open problem to determine the condi-tional probability with which PCR can amplify above threshold in 40 cycles from a single strand of DNA. While it is currently impossible to enumerate individ-ual molecules or particles smaller than a nanometer, it is straightforward to count macroscopic objects such as latex beads or single yeast cells with a Coulter counter. Haploid yeast cells provide a convenient and verifiable means to deliver single copies of Bacillus subtilis genes such as ybdO into the wells of a multiwell PCR plate.

In this way, we are currently exploring the relation-ship between amplification and DNA copy number.

5. ACKNOWLEDGEMENTS

This work was partially supported through NSF-DMS 0443855, NSF ECS 0601528, NIH EB009235, and the short-lived W. M. Keck Foun-dation Grant#062014.

REFERENCES

[1] Freeman, W., Walker, S. and Vrana, K. (1999) Quantita-tive rt-PCR: Pitfalls and potential. Biotechniques, 26, 124-125.

[2] Larionov, A., Krause, A. and Miller, W. (2005) A stan-dard curve based method for relative real time PCR data processing. BMC Bioinformatics, 6, 62.

[3] Yuan, J., Reed, A., Chen, F. and Stewart, C. (2006) Sta-tistical analysis of real-time PCR data. BMC Bioinfor-matics, 7, 85.

[4] Abramov, D., Trofimov, D. and Rebrikov, D. (2006) Accuracy of a real-time polymerase chain-reaction assay for quantitative estimation of genetically modified food sources in food products. Applied Biochemistry and Mi-crobiology, 42, 485-488.

[5] Diehl, F., Li, M., Dressman, D., Yiping, H., Shen, D., Szabo, S., Diaz, L., Goodman, S., David, K., Juhl, H., Kinzler, K. and Vogelstein, B. (2005) Detection and quantification of mutations in the plasma of patients with colorectal tumors. Proceedings of the National Academy of Sciences, 102, 16368-16373.

[6] Roussel, Y., Harris, A., Lee, M. and Wilks, M. (2007) Novel methods of quantitative real-time PCR data analy-sis in murine HelicobaCter pylori vaccine model. Vac-cine, 25, 2919-2929.

[7] Jagers, P. and Klebaner, K. (2003) Random variation and concentration effects in PCR. Journal of Theoretical Bi-ology, 224, 299-304.

[8] Lalam, N., Jacob, C. and Jagers, P. (2004) Modeling the PCR amplification process by a size-dependent

branch-ing process and estimation of the efficiency. Advanced Applied Probability, 36, 602-615.

[9] Liu, W. and Saint, D. (2002) A new quantitative method of real time reverse transcription polymerase chain tion assay based on simulation of polymerase chain reac-tion kinetics. Analytical Biochemistry, 302, 52-59. [10] Nedelman, J., Haegerty, P. and Lawrence, C. (1992)

Quantitative PCR: Procedures and precision. Bulletin of Mathematical Biology, 54, 477-502.

[11] Vaerman, J., Saussoy, P. and Inargiola, I. (2004) Evalua-tion of real-time PCR data. Journal of Biological Regu-lators and Homeostatic Agents. 18, 212-214.

[12] Pfaffl, M. (2001) A new mathematical model for relative quantification in real-time PCR. Nucleic Acid Research, 29, e00.

[13] DeGroot, M.H. (1986) Probability and statistics. 2nd Edition, Addison Wesley Reading MA,Massachusetts. [14] Cady, N., Stelick, S., Kunnavakkam, M., Lui, Y. and

Batt, C. (2004) A microchip-based DNA purification and real-time PCR biosensor for bacterial detection. Pro-ceedings of IEEE Sensors,Vienna, 1191-1194.

[15] Matsubara, Y., Kerman, K., Kobayashi, M., Yamamura, S., Morita, Y. and Tamiya, E. (2004) Microchamber assay based DNA quantification and specific sequence detection from a single copy via PCR in nanoliter vo-lumes. Journal of Biosensors and Electronics, 20, 1482-1490.

[16] Mitra, R. and Church, G. (1999) In situ localized amplification and contact replication of many individual DNA molecules. Nucleic Acids Research, 27, e34. [17] Samatov, T., Chetverina, H. and Chetverin, A. (2006)

Real-time monitoring of DNA colonies growing in a po-lyacrylamide gel. Analytical Biochemistry, 356, 300-302. [18] Williams, R., Peisajovish, S., Miller, O., Magdassi, S.,

Tawfik, D. and Griths, A. (2006) Amplification of com-plex gene libraries by emulsion PCR. Nature Methods, 3, 545-550.

[19] Karsai, A., Muller, S., Platz, S. and Hauser, T. (2002) Evaluation of a home-made SYBR Green 1 reaction mixture for real-time PCR quantification of gene expres-sion. Biotechniques-Short Technical Reports, 32, 790- 796.

[20] Neubert, K. and Brunner, E. (2007) A studentized per-mutation test for the non-parametric Behrens-Fisher problem. Computational Statistics & Data Analysis, 51, 5192-5204.

[21] Reiczigel, J., Zakarias, I. and Rozsa, L. (2005) A boot-strap test of stochastic equality of two populations.

American Statistical Association, 59, 156-161.

[22] Wilcox, R.R. (2006) Comparing medians. Computational Statistics & Data Analysis, 51, 1934-1943.

[23] Stowers, C. and Boczko, E.M. (2008) Platescale: The birthday problem applied to single molecule PCR. Bio-comp’08, Las Vegas. http://www.ucmss.com/cr/main/

papersNew/papersAll/BIC9135.pdf