Near Infrared Spectroscopy (NIRS) as a Process Analytical Technology (PAT) Tool to Enable Golden Batch Performance Using a Novel MIDUS Control Automation Platform.

(1)

ABSTRACT

VANN, LUCAS RICHARD. Near Infrared Spectroscopy (NIRS) as a Process Analytical Technology (PAT) Tool to Enable Golden Batch Performance Using a Novel MIDUS Control Automation Platform. (Under the direction of John Sheppard).

Developing a process with a quality by design approach must begin with the

appropriate identification of both Critical Quality Attributes (CQAs) and Critical Process

Parameters (CPPs). This holistic approach is required to realize maximum process

understanding and knowledge, increase potential for regulatory approval and equip

manufacturers to evaluate changes and implement change control procedures. Establishing a

robust control strategy during process development that can be scaled up and transferred to a

manufacturing process is crucial to ensure product quality throughout the product lifecycle.

Implementation of a strategy that can not only accurately monitor CPPs but also the

interdependencies between them is key to enable the advanced control required to ensure

consistent ideal or “golden” batch performance. This is specifically true in the case of large

molecules where there are thousands of metabolic pathways that can be altered based on

minute environmental changes that are often extremely synergistic in nature. To fully

develop a design space that characterizes acceptable performance, multivariate tools are

necessary. This is clear from a process development standpoint where DOE is routinely

utilized; however, there is a disconnect when transitioning to manufacturing where

monitoring and control is simplistic and primarily univariate. Multivariate tools such as

predictive models to create soft sensors are a requirement to maintain the level of

characterization of the process that enables advanced control. Currently the

(2)

from control systems (DCS, SCADA, PLC), advanced in-line monitoring from tools such as

near infrared spectroscopy (NIRS), at-line monitoring from sample analysis, offline data

from laboratory systems (LIMS) and multivariate (MVA) models to elucidate the

interdependencies of the various CPPs requires a platform that enables full integration that

has not been fully realized to date. The current research is aimed at overcoming these

challenges to bridge the gaps that currently exist in the industry.

The first study demonstrates the novel application of NIRS in a mammalian cell

culture process for feedback control of optimal harvest time in an influenza virus production

process. The second expands on the use of NIRS in a novel application for feedforward

control in beer fermentation. The final study culminates in demonstrating a novel platform,

MIDUS Control, that bridges the current automation gaps which exist in the industry thereby

(3)

(4)

Near Infrared Spectroscopy (NIRS) as a Process Analytical Technology (PAT) Tool to Enable Golden Batch Performance Using a Novel MIDUS Control

Automation Platform

by

Lucas Richard Vann

A dissertation submitted to the Graduate Faculty of North Carolina State University

in partial fulfillment of the requirements for the degree of

Doctor of Philosophy

Food Science

Raleigh, North Carolina

2018

APPROVED BY:

_______________________________ _______________________________

John Sheppard Tyre Lanier

Committee Chair

(5)

DEDICATION

This dissertation is dedicated to the love of my life, my best friend and companion through

(6)

BIOGRAPHY

Lucas Vann was born in Montreal Canada and is the second of 4 children of Rick and Donna

Vann. He received a Bachelors (2000) and Masters (2002) from McGill University in

Biosystems Engineering under the direction of Dr. John Sheppard. Lucas began working for

the Biomanufacturing Training and Education center at North Carolina State University in

2008. In 2009 he started to pursue a Ph.D, part time, in Bioprocessing in the Food Science

(7)

ACKNOWLEDGMENTS “

I can do all things through Christ who strengthens me” Philippians 4:13.

I am nothing without my Lord and Savior, Jesus Christ, with whom all things are possible

and I must first acknowledge His awesome grace and mercy.

I want to thank my beautiful wife (Ilse Zainos Vann) for her love, patience and support

during this time. I would be lost without you. Thanks to my family as well for all you have

done and continue to do for us.

I need to thank Dr. Sheppard not only for my Ph. D. but also for my job at NCSU and (a

“little” ways back in time) my Masters at McGill. You have played such a large role in who I

am professionally today and I thank you for your guidance as well as for your friendship that

I value highly.

Thanks to the Sheppard lab and especially Blake Layfield and John Durham (honorary

labmate). You guys helped me get through some tough times and I will always consider you

brothers.

I would lastly like to acknowledge all my professors and also my coworkers at BTEC who

have helped me grow and learn both professionally and personally. NCSU is an awesome

(8)

TABLE OF CONTENTS

LIST OF TABLES ... ix

LIST OF FIGURES... x

LIST OF ABBREVIATIONS ... xvii

CHAPTER 1: LITERATURE REVIEW. Historical Overview of Biomanufacturing and the Regulatory Evolution that has Resulted in the Current State of Process Understanding and Control ... 1

Introduction ... 2

Quality by design (QbD) ... 7

Process Analytical Technology (PAT) ... 9

Multivariate Analysis ... 10

Analytical Tools for Monitoring and Control ... 11

Optical / Spectroscopic Sensors ... 13

Raman Spectroscopy ... 13

Near Infrared Spectroscopy (NIRS) ... 14

Chemometrics ... 15

Chemometric Model Building Methodology... 17

Current Industry Limitations ... 21

Novel Approach ... 23

MANUSCRIPT 1. Evaluation of Near-Infrared Spectroscopy (NIRS) for On-line Determination of Virus Titers in Cell Culture Influenza Vaccine Manufacturing ... 25

Abstract ... 26

Introduction ... 27

Process Development and Optimization Strategies ... 30

Use of NIR for Monitoring Cell Metabolism and Virus Production ... 32

Chemometrics ... 32

Materials and Methods ... 34

(9)

Near Infrared Spectroscopy (NIRS) ... 35

Hemagglutination (HA) Assay ... 36

Chemometrics ... 37

Hemagglutination (HA) Assay Development ... 38

2 L Bioreactor Runs and Virus Model Validation ... 39

Results and Discussion ... 40

Shake Flask Study and Chemometric Modeling ... 40

Hemagglutination HA Reference Method and Model development... 48

2 L Bioreactor Validation Batch ... 51

Conclusions... 53

MANUSCRIPT 2. The Application of Near-Infrared Spectroscopy (NIRS) in Beer Fermentation for On-line Monitoring of Critical Process Parameters and their Integration into a Novel Feedforward Control Strategy... 54

Abstract ... 55

Introduction ... 56

Yeast Strains ... 64

Wort Production ... 65

Propagation and IPA Pilot-Scale Fermentation ... 66

FAN Analysis... 67

IPA Calibration Batches for NIR Models ... 67

IPA 300 L Validation Batch ... 68

Blonde Ale 300 L Validation Batch ... 69

IPA Bench Scale Batches for Repitching ... 70

Near Infrared Spectroscopy Fermentation Analysis ... 71

Chemometrics ... 72

Model Building ... 73

(10)

Blonde Ale Validation Batch ... 85

Feed Forward Control Strategy ... 86

Conclusions ... 92

MANUSCRIPT 3. Use of Near-Infrared Spectroscopy (NIRs) in the Biopharmaceutical Industry for Real-Time Determination of Critical Process Parameters and Integration of Advanced Feedback Control Strategies using MIDUS Control ... 94

Abstract ... 95

Introduction ... 96

Monitoring ... 98

Data Analysis and Integration ... 100

Advanced Control Implementation ... 102

Scale-down 2 L Bioreactor Study ... 105

Batch Monitoring ... 106

Data Analytics ... 107

Scale-up to 300 L Manufacturing batches ... 109

Spectroscopy ... 110

Chemometrics ... 110

Additional 30 L Batch Runs ... 111

Advanced Control Strategy Designer ... 113

Scale-down 2 L Bioreactor Study ... 116

Scale-up to 300 L for NIR Model Building ... 120

B Vitamin Analysis ... 124

Advanced Control Strategy... 127

Conclusions ... 133

SUMMARY AND CONCLUSIONS ... 136

(11)

APPENDICES ... 149 Appendix A. Record of Invention 1: Cell Separator and Repitcher... 140

(12)

LIST OF TABLES

MANUSCRIPT 2 ... 54 Table 1. Four consecutive re-pitches used to determine limits on

performance parameters based on process capability ... 89

Table 2. Data from selected batches of 7 consecutive re-pitches indicating a reduction in population performance in pitch 7 and the result of an

increased pitch rate in batch 8. Data entries in bold signify values

(13)

LIST OF FIGURES

MANUSCRIPT 1 ... 25 Figure 1. Common pre-processing and data transformation techniques used

for spectra prior to building models. The raw spectra are shown on

the top left that have been base-line corrected on the top right to

eliminate drift. Data must be mean centered and scaled as shown

on the bottom left which results in the spectra as seen on the

bottom right ... 41

Figure 2. A simplified graphical example showing one principle component axis (PC1) derived from three variables X1, X2 and X3. The data

points are mean centered and scaled prior to plotting and the best

fit plane (PC1) is drawn. Scores (t) are related to the projection of

each point onto the plane and loadings (p) refer to how closely

related the original axis was to the new plane ... 43

Figure 3. Two plots used to evaluate a principle component analysis model. The left plot shows the amount of residuals of the spectra produced

from the model compared to the original spectra when 3 factors or

principle components are used. The right plot shows the comparison

of scores for the first and second principle components which are

orthogonal to each other ... 44

Figure 4. Common evaluation plots from 2 Partial Least Square (PLS) prediction models. The top 4 plots are an initial model and the

bottom a refined model. Cross validation is used to evaluate

prediction accuracy while residuals look at which samples are not

well described on a spectral level. The scores plots are beneficial at

evaluating outliers or clusters and the standard error helps to

identify the required number of principle components required to

(14)

Figure 5. Cross validated results (one sample out at a time) for Glucose (R2 = 0.97), Lactate (R2 = 0.99), Ammonia (R2 = 0.98),

Glutamine (R2 = 0.99) and Total Cell Count (R2 = 0.98). Results

indicate that near infrared spectroscopy is well suited for these

cell culture applications ... 47

Figure 6. Four variations of the same PLS model. In each case a different reference method was used to generate the actual concentrations.

The standard method (top left) shows spectral variation in samples

with the same actual concentration result. Averaging multiples

assays (top right) or performing additional dilutions (bottom left)

show improvement however, the best results are shown using a

novel microscope method (bottom right) ... 49

Figure 7. Representation of the standard Hemagglutination Assay (HA) used to determine hemoglutanin concentration. The measurements (in

millimeters) were performed using a microscope. The standard

assay result yields a value of 32 however, it is evident additional

virus is present which was calculated based on the resulting area of

the intermediate blood spot formed ... 50

Figure 8. The final model evaluation results indicating 3 factors or Principle components are required to describe most of the variation.

Concentration and spectral residuals were reduced by a factor of 5

and 10 respectively and the R2 increased from 0.64 to 0.95 ... 51

Figure 9. Validation batch results. The left plot shows the evolution of HA concentration over time in real-time (solid line) including the off-

line assay results (●) from samples taken during the run. The right

plot indicates the prediction residuals of each scan for determination

(15)

MANUSCRIPT 2 ... 54 Figure 1. Principle Component Analysis score plot of scans taken from 3

IPA calibration batches to determine relative factor weighting. All

3 batches predominantly reside in the same general area as indicated

by the elliptical outline. Arrows indicate 7 potential outliers solely

based on spectral variation which help identify samples that may

introduce calibration model error ... 74

Figure 2. Partial Residual Error Sum Squared (PRESS) plots of initial (♦) and final (■) calibration models for ethanol ... 75

Figure 3. Actual vs predicted concentration plots for each constituent

calibration model indicating best fit line and R2 values. Free Amino

Nitrogen (●), Total Cell Count (▲), Specific Gravity (♦), and

Ethanol (■) ... 76

Figure 4. Residual concentration plots for each constituent to determine bias for low or high concentrations based on the absence of inherent

trends. Free Amino Nitrogen (●), Total Cell Count (▲), Specific Gravity (♦), and Ethanol (■). All samples are ordered from low to

high "actual" concentration with the x axis indicating sample

number. Only Total Cell count displays an upward trend indicating

a concentration bias ... 77

Figure 5. Predicted on-line NIR data for Specific Gravity (♦) and ethanol (■) as well as off-line measurements of Specific Gravity (●) and ethanol

(▲) concentrations in a 300 L IPA fermentation batch. The arrow

indicates the time point at which glucose was totally depleted and

the yeast began to metabolize maltose as the main carbon source ... 79

Figure 6. Sugar profiles determined using HPLC over the fermentation of a 300L pilot scale batch of IPA. Maltose (♦), Glucose (■) and

(16)

Figure 7. Prediction residual plots for Specific Gravity (♦) and Ethanol (■). Residuals were determined based on segmented linear regressions

of prediction plots and used as an indicator of variability in

prediction ... 81

Figure 8. On-line NIR predictions (●) and off-line spectrophotometric

measurements (■) of FAN levels in a 300 L IPA batch ... 82

Figure 9. On-line NIR predictions (▲) and off-line automated cell counter measurements (■) of TCC concentrations in a 300 L IPA

fermentation batch ... 83

Figure 10. Predicted on-line NIR data for Specific Gravity (♦) and ethanol (■) as well as off-line measurements of Specific Gravity (●) and ethanol

(▲) concentrations in a 300 L Blonde Ale fermentation batch using

models generated from IPA calibration batches... 86

Figure 11. On-line NIR predictions (●) and off-line spectrophotometric measurements (■) of FAN in a 300 L Blonde Ale fermentation

batch using models generated from IPA calibration batches ... 86

Figure 12. Four consecutive re-pitches used to determine process specification limits based on inherent batch variability. Pitch 1 (■), pitch 2 (♦),

pitch 3 (●) and pitch 4 (▲) ... 88

Figure 13. SG (top) and ethanol (bottom) data for 5 batches identified as pitch 2 (♦), pitch 4 (■), pitch 6 (▲), pitch 7 (●) and pitch 8 (×) and taken

from 8 consecutive batches (7 re-pitches). The plots indicate

reduced population performance in pitch 7 which is corrected by an

increased pitch rate in the subsequent batch ... 90

MANUSCRIPT 3 ... 94 Figure 1. The relationship between Critical Process Parameters (CPPs) and

Critical Quality Attributes (CQAs). Cell environment (CPPs)

(17)

critical parameters that can be controlled within the bioreactor

(CCPs) therefore have a direct impact on cell performance as

measured by KPAs which are directly linked to final product

quality. The lists shown for each group is not comprehensive but

serves as an example of possibilities ... 97

Figure 2. Examples of in-line, on-line, at-line and off-line monitoring. These are variable data types that are often stored in different locations

making integration a common challenge. Standard parameter

control and data storage can accomplished using Distributed

Control Systems (DCS), Supervisory Control and Data Acquisition

(SCADA) systems as well as simple Programmable Logic

Controllers (PLC) ... 100

Figure 3. Trends of “good” (left) and “bad” (right) batches. Failed batches were identified based on an abnormal reduction in growth rate near

the end of the batch and that the main carbon source (glucose) was

not totally consumed during the total batch time of 6 hours ... 115

Figure 4. Amino acid percent weight comparison between two lots of yeast extract (YE) from the same supplier and manufacturer. The old lot

with acceptable performance is represented by solid bars and the

new lot with poor performance by light textured bars ... 116

Figure 5. Averaged glucose (decreasing trend lines) and optical density (increasing trend lines) data for 3 batches at each media condition.

Yeast Extract new lot at 12 gL-1 (♦), YE new lot at 16 gL-1(●) and YE old lot at 12 gL-1 (▲). Error bars are plus and minus one

standard deviation ... 117

Figure 6. Scores plot of data points from all 9 batches. The black ellipse to the bottom left represents all points from batches with the original

(18)

Figure 7. The three main contributors to variation between good and bad batches (isolated from all the sources shown in the embedded top

left plot) based on a PLS Batch Level Model (BLM) relating in-

line, at-line and off-line data to Key Performance Attributes (KPA).

Shown is the time range of 4 to 6 hours where the metabolic shift

occurred, alanine (♦), agitation (●) and methionine (▲) ... 119

Figure 8. NIR model calibrations for 3 constituents, alanine (top left), methionine (top right) and glucose (bottom left). All cross

validated (one sample out at a time) R2 values were above 0.95

which indicates a relatively good fit for each model. The final plot

(bottom right) shows the reduction in error as additional factors are

added to the glucose model with a final recommendation of 5

factors based on the predicted residual error sum squared (PRESS)

values ... 121

Figure 9. 300 L validation batch data with NIR scans (average of 128 scans taken every 15 minutes) for the following models: glucose (♦),

methionine (×) and alanine (+) as well as at-line YSI measurements

for glucose (▪) and offline UPLC measurements for methionine (▲) and alanine (●) determined from manual samples taken at periodic

intervals ... 123

Figure 10. Comparison of various B vitamin concentrations, determined by HPLC, in 2 lots of yeast extract at 12 g L-1. The solid bars represent

the old lot with acceptable performance and the textured bars

represent the new lot with poor performance. (Pantothenic values

are shown divided by a factor of 10 for scaling purposes) ... 125

Figure 11. Top: Glucose (decreasing trend lines) and optical density (increasing trend lines) data for three 30 L batches at identical

(19)

4.75 hours (▲). The bottom plot shows the real time agitation

speed for each batch, 12 gL-1 YE (bottom trend line) 12 gL-1 with addition of B vitamins at 4.75 hours (middle trend line) and 16 gL-1 YE (top trend line). Agitation was in cascade control and

automatically changed based on oxygen demands of the culture

(increased respiration rate requires higher agitation speeds) ... 126

Figure 12. Unique strategy designer for advanced process control that initiates at the start of every batch. Three control workflows are shown

including a univariate model monitoring abnormal depletion rate

of alanine (1), a set of rules implemented using a case block (taken

from the menu on the bottom left and easily configured as shown

on the bottom right) that when satisfied trigger induction (2), and a

complex control strategy that predicts a filter clog will occur using

a multivariate model and closes the loop to effect changes to save

the filter and the batch (3) ... 129

Figure 13. Trend data of a run that exhibited a filter clog resulting in over pressurization causing a lost batch. Bioreactor pressure is shown in

red with the pressure set-point in orange. The light blue is the

backpressure control output and dark blue is the airflow rate ... 131

Figure 14. Continuous process verification (CPV) control charts from an E. coli fermentation of a Univariate Analysis of pressure

differential between set-point and current value (top) and a soft

sensor multivariate analysis PCA model characterizing

backpressure control (bottom). Pressure deviates outside of

specification limits (± 3 sigma) at 9.1 hours while the multivariate

model exceeds warning limits (2 sigma) at 5.7 hours and exceeds

specification limits (3 sigma) at 5.8 hours essentially predicting

(20)

Figure 15. Schematic of the novel Monitoring, Integration, Data analysis, Understanding, Strategy & Control (MIDUS Control) platform

(21)

LIST OF ABBREVIATIONS

A&C: Analytics & Control

ABE: Acetone, Butanol, and Ethanol

AEP: Average Error of Predictions

APC: Advanced Process Control

AVP: Average Variability in Prediction

BEM: Batch Evolution Model

BLM: Batch Level Model

CCPs: Critical Control Parameters

CIP: Clean-In-Place

Cp: Process Capability

CPPs: Critical Process Parameters

CPV: Continuous Process Verification

CQAs: Critical Quality Attributes

CUs: Concentration Units

CV: Coefficient of Variation

DCS: Distributed Control Systems

DNA: Deoxyribonucleic Acid

DO: Dissolved Oxygen

DOE: Design of Experiment

EPR: Ethanol Production Rate

FAN: Free Amino Nitrogen

FDA: Food and Drug Administration

FT-NIR: Fourier Transforms- Near Infrared

GFP: Green Fluorescent Protein

HA: Hemagglutination Assay

HPLC: High-performance liquid chromatography

(22)

HWE: Hot Water Extract

ICH: International Conference on Harmonization

IPA: Indian Pale Ale

IPTG: Isopropyl β-D-1-thiogalactopyranosid

IR: Infrared

KPAs: Key Performance Attributes

LIMS: Laboratory Information Management System

LSL: Lower Specification Limit

LT: Lag Time

MDCK: Madin-Darby Canine Kidney

MIDUS: Monitoring, Integration, Data analytics, Understanding, Strategy design

MIR: Mid-Infrared

MOI: Multiplicity of Infection

MSC: Multiplicative Scatter Correction

MVA: Multi Variate Analysis

MVDA: Multivariate Data Analysis

NA: Neuraminidase

NIRS: Near Infrared Spectroscopy

OPA: O-phthaldialdehyde

OPC: Object Linking and Embedding for Process Control

PAT: Process Analytical Technology

PBS: Phosphate Buffer Solution

PC: Principle Component

PCA: Principle Component Analysis

PDA: Photo Diode Array

PID: Proportional, Integral, Derivative

PLC: Programmable Logic Controllers

PLS: Partial Least Squares

(23)

QbD: Quality by Design

QTPP: Quality Target Product Profile

RI: Refractive Index

SCADA: Supervisory Control and Data Acquisition

SCR: Sugar Consumption Rate

SECV: Standard Error of Cross Validation

SG: Specific Gravity

SIP: Steam In Place

SNV: Standard Normal Variate

SPC: Statistical Process Control

TC: Total Carbohydrates

TCC: Total Cell Count

TCID 50:Tissue Culture Infectious Dose 50%

UPLC:Ultra performance liquid chromatography

USL: Upper Specification Limit

UVA FD: Univariate Analysis Fault Detection

WHO: World Health Organization

(24)

CHAPTER 1. LITERATURE REVIEW

(25)

INTRODUCTION

Biomanufacturing in its simplest definition refers to the use of a living organism to

manufacture a product. It arguably has the potential to be a defining technology of this

century. The field of Biomanufacturing is vast and interdisciplinary incorporating

biotechnology, bioengineering, biochemistry and microbiology, among others, in the

production of various products from fermented foods such as yogurt and beer to therapeutic

proteins such as monoclonal antibodies. There are numerous other examples which include

vaccines, amino acids, industrial enzymes, biochemicals and biofuels to name a few. Based

on the products mentioned it is clear that Biomanufacturing is both a very old as well as a

very new science with advances being made continually. Regulations, based primarily on

patient safety, have evolved alongside Biomanufacturing in a unique fashion. At first this

occurred primarily in response to major tragedies; however the focus has changed over the

past 15 years due to the observation that the very regulations that were in place to increase

safety were also deterring innovation, process understanding and advancements in

automation and control. This realization led to the introduction of Quality by Design (QbD)

and Process Analytical Technology (PAT) guidelines which encouraged manufacturers to

incorporate additional analyzers that would lead to increased understanding of their process.

Demonstration of this understanding to the FDA would then allow for flexibility in

manufacturing to effect more advanced control and achieve increased consistency and

quality. While the concept is sound the implementation has been limited and a major gap

still exists in the integration of advanced process control (APC) in biomanufacturing.

(26)

manner due to advanced control challenges. The following chapters will outline 3 novel

applications of APC utilizing near infrared spectroscopy (NIRS) as a PAT tool integrated in a

systems based approach. The results show the versatility and effectiveness of NIR

spectroscopy combined with other PAT tools such as multivariate models and soft sensor

technology in a platform approach that can be used as a template for manufacturers on a

global scale. The platform developed and shown here enables manufactures to achieve more

consistent ideal batch or, so called, “golden-batch” results and can only be described as

MIDUS Control.

While the term “biomanufacturing” may not have been used thousands of years ago,

even by today’s definition it was being employed. Production relied on microorgamnisms

(primarily bacteria and yeast) that were naturally found on, or were associated with, the raw

materials involved in the process. These organisms were used to make products for human

consumption such as bread, beer (documented Egyptian evidence as early as 2300 BC), wine,

cheese, pickles, yogurt, kimchee and soy sauce in what we now refer to as fermentation

processes. The first significant mention of bio-therapeutic production was in China around

500 BC where boils were reportedly treated with antibiotics from moldy soybean curds.

However, it was not until the 1900s that the microbial industrial revolution began for the

production of primary metabolites such as ethanol, butanol and acetone. In fact, Clostridial

fermentation of carbohydrates to acetone, butanol, and ethanol (ABE) was one of the first

large-scale biotechnological processes developed during the early part of the 20th century, ranking second in importance only to ethanol fermentation by yeast [1]. After World War I,

(27)

dramatic increase in car production and a subsequent increase in demand for a solvent that

would produce quick drying lacquers. Previously amyl-acetate had served this purpose which

was a by-product of alcoholic fermentation by yeast. However, Prohibition came into effect

in 1920, which meant this solvent was no longer available and automakers had to turn to

butyl acetate from butanol fermentation. Up to 1950, 66% of butanol used worldwide was

derived through fermentation and plants were located around the world in the USA, Canada,

South Africa, China, and the Soviet Union [2]. Biotechnological production of ethanol,

acetone and butanol gave way to synthetic chemical processes beginning in the 1950s due to

two main factors. Primarily, the petrochemical industry grew at an unprecedented rate,

quickly satisfying the demand for these solvents and forcing the decline of the fermentation

industry. Secondly, the low cost of crude oil for chemical synthesis compared with the

escalation in price of substrates for the fermentation process meant the fermentation process

was no longer economically feasible [3]. Renewed interest in alternative methods of

obtaining fuels through microbial conversion occurred in the 1970s due to the abrupt increase

in petroleum prices. This led to intensified research into the fermentative production of

biofuels directed at improving overall process yields and productivity which is still ongoing.

Biomanufaturing from a therapeudic or medicinal perspective began around 1940

where Howard Florey, Ernst Chain and Norman Heatly began studying penicillin, which had

been discovered by Alexander Fleming in 1928 when he observed that a blue green mold had

caused bacterial growth to diminish on a contaminated petri dish [4]. Penicillin’s lack of

toxcicity as well as efficacy to treat bacterial infections opened the door to a new type of

(28)

project which forced the researchers to pursue alternative avenues with American companies

[5]. Antibiotics are usually produced once the cells have exhausted primary nutrients and

have entered into a stationary phase of growth and are therefore referred to as secondary

metabolites. Attempts at increasing production of such compounds began with selection and

employment of mutant strains with high yields. Numerous techniques aimed at discovering

“superproducers” were employed up to and through the 1970s with the most common,

primarily due to low cost, being the use of well-defined chemical media to isolate mutants

grown in smaller colonies [5]. These activities produced strains which were unstable and

prone to further mutation and further improvement was desired. The key to solving this

dilemma lay with molecular biology and genetic engineering. The advent of recombinant

DNA technology in 1973 by S. Cohen and H. Boyer enabled researchers to make controlled

modifications of a microorganism’s genomes to create new higher antibiotic producing

strains as well as strains capable of producing completely foreign proteins. Insulin, which

prior to recombinant DNA technology had to be purified from animals and/or cadavers, was

the first such human recombinant protein to be produced in 1978. However, it was not until

1982 that it was approved by the FDA as the first biotech drug produced in genetically

modified bacteria. In 1986 the first anticancer drug, interferon, was produced through

biomanufacturing as was the first recombinant vaccine for humans targeting hepatitis B [6].

To date, methods in this field have continued to advance and recombinant DNA

techniques are common practice for host development and improvement in most

biopharmaceutical companies. However, as with any manufacturing process, quality and

(29)

consumer. It must be made clear that safety of the end consumer is the primary objective and

all efforts to increase quality and consistency are to ensure that main purpose. Specifically,

this is critical in the case of manufacturing biopharmaceuticals which are often injected and

can have dire and life threatening consequences if standards are not met. Regulation of the

manufacture of these drugs became increasingly crucial as a result of, and in response to,

poor and unsafe practices. One well known example occurred in 1902 when the Biologics

control act was instituted after several deaths were caused when patients received diphtheria

vaccines that had been contaminated with tetanus. However, the evolution of these

regulations caused several undesirable consequences as well [7].

By the year 2000 drug regulations seemed effective in ensuring that product quality

was sufficient. However, companies were extremely hesitant to implement new technologies

because of past regulatory hurdles and the uncertain response of regulators and how they

would perceive such innovations. The traditional manufacturing approach was based on end

product testing and adherence to defined process specifications to deliver acceptable product

quality. Emphasis was placed on empirical data for validation as well as manufacturing and

there was little understanding as to the cause of deviations or failures. As such, there was a

high amount of product waste (reported to be as much as 50%), poor control, inability to

predict scale-up effects and low manufacturing efficiencies [7]. In response to these issues

the FDA launched a new initiative in 2002 to modernize the biomanufacturing industry

(30)

Quality by Design (QbD)

Quality by Design is a risk- and science-based approach to process development and

manufacturing that ensures quality through a clear understanding of all components and key

process parameters and their effect on final product quality. Since the FDA’s first Risk Based

Approach publication in 2002 there have been 4 major parts to the QbD initiative. 1) the

International Council on Harmonization (ICH) guidance Q8(R2): Pharmaceutical

Development, 2) ICH guidance Q9: Quality risk management, 3) ICH guidance Q10:

Pharmaceutical Quality System, and 4) an FDA guidance on Process Analytical Technology

(PAT) [8], [9]. While the main objectives of these guidances are clear, interpretation of these

initiatives by industry has been an ongoing process. The main overall objective is to develop

a process that consistently maintains the quality attributes of the product such as “identity,

strength, purity, and other quality characteristics designed to ensure the required levels of

safety and effectiveness” (ICH Q8). To accomplish this there are a number of actions that

must be taken. The first is to accurately define quality specifications that are performance

based. This can be termed as defining the product design goal where the quality target

product profile (QTPP) is determined. The QTPP provides an understanding of what will

ensure the quality, safety, and efficacy of a specific product [10]. Once a clear definition of

quality is understood the next step involves determining critical quality attributes (CQAs) for

the product. A CQA is defined in ICH Q8(R2) as “ A physical, chemical, biological, or

microbiological property or characteristic that should be within an appropriate limit, range, or

distribution to ensure the desired product quality”. Examples may include moisture content,

(31)

provides the framework for true process understanding and is crucial in enhancing root cause

analysis and post approval change management. The ability to determine which attributes

are in fact critical lies with effective risk assessment. There are a number of risk assessment

tools that can be utilized and are described in ICH Q9 which include the following: Basic risk

management facilitation methods (flowcharts, check sheets, etc.), Fault tree analysis, Risk

ranking and filtering, Preliminary hazard analysis, Hazard analysis and critical control points,

Failure mode effects analysis, Failure mode, effects, and criticality analysis and Hazard

operability analysis [10]. Therefore the attribute is determined to be critical based on the

probability of occurrence and the potential severity to the patient if the attribute were to

deviate outside specified ranges. It is important to note however that while detectability does

not in itself impact the criticality of an attribute it does have an impact on overall risk. One

such example is a tool described by Martin-Moe et al, 2011[9] that was developed for risk

ranking where a criticality score can be determined for each attribute. The degree of impact

to safety or efficacy is multiplied by the degree of confidence that the impact is known

(criticality = impact x uncertainty) which results in scores ranging from 2 (low risk) and 140

(high risk).

Once the CQAs have been identified the next step is to discover the process design

space. This step demonstrates that there is a clear understanding of the process by identifying

critical process parameters (CPPs) and linking them to CQAs. ICH Q8(R2) defines design

space as the multidimensional combination and interaction of input variables and process

parameters that have been demonstrated to provide assurance of quality. Therefore CPPs are

(32)

of the product. This is effected through the application of design of experiment (DOE)

activities to ensure all synergistic interactions are captured and optimal operating conditions

can be determined. A well-defined design space examines all possible iterations of parameter

variations and their resulting impact on quality attributes and provides the framework for

effective process control. Therefore a control space, which is a subsection of the design

space, can be determined based on the manufacture’s desired operation range. In essence,

while the CQAs are directly related to the product, they are in fact dependent on how the

product is manufactured. As such, if the CPPs are correctly identified and controlled, the

ideal CQAs will be achieved [11]. This is best accomplished through process analytical

technology (PAT).

Process Analytical Technology (PAT)

PAT is defined by the FDA as “a system for designing, analyzing, and controlling

manufacturing through timely measurements (i.e., during processing) of critical quality and

performance attributes of raw and in-process materials and processes, with the goal of

ensuring final product quality”. PAT effects control of CPPs that affect CQAs and is the

framework for enhancing process understanding and providing process robustness. The

ability to monitor these critical parameters fosters increased understanding and allows for

more efficient control and the ability to deliver acceptable quality products in the midst of

variable process parameters and raw materials which in turn has the effect of increasing

process robustness. It is important to realize the significance of the terminology used in the

(33)

based approach to increased understanding. An individual can purchase the latest tools to

repair a car engine but if that person does not know how an engine operates there is no hope

that those tools will be used effectively. True process understanding occurs when all critical

sources of variability are identified and explained and that variability is then managed by the

process. Understanding implies that product quality attributes can be predicted by monitoring

all relevant process parameters. In effect, a well-integrated PAT system enables quality to be

built into the process instead of the traditional methods of testing quality into the product.

When it comes to application of PAT there are three key components including multivariate

analysis, analytical tools for monitoring and control and finally, knowledge management and

continuous optimization tools [12].

Multivariate Analysis

Biomanufacturing processes are extremely complex and present a unique set of

challenges. These are predominantly cell-based systems that are extremely sensitive to

minute environmental variations. A cell population establishes a complex dynamic

relationship with its environment where changes in this environment can have both positive

and negative metabolic consequences. The immediate environment surrounding a cell

includes parameters such as temperature, pH, dissolved oxygen, nutrients (which often

include variable complex raw materials), products and by-products. In addition,

contamination and impurities are always a concern especially when present at levels that fall

below detection limits. As such, collecting real-time data of all critical variables during

(34)

batches creates an even larger challenge which requires multivariate statistical analysis.

Multivariate data analysis (MVDA) relates raw information from process variables (which

can include PAT analyzers) to CQAs which generates understanding and allows for effective

process control. Predictive mathematical models are determined that describe relationships

between CPPs and CQAs using various software tools which allow users to employ

multivariate techniques and integrate empirical models (without expert knowledge) and / or

mechanistic models (requiring expert knowledge) [8], [11], [13]. This type of analysis relies

on the availability of accurate and reliable analytical results in the required time frame to

make necessary changes that ensure quality.

Analytical Tools for Monitoring and Control

Effective process control depends on timely, accurate, selective, sensitive, and

reliable measurements of all critical process parameters. Reliability refers to attributes such

as repeatability (consistently deliver similar results), robustness (consistent results

irrespective of process conditions) and stability (retaining selectiveness and sensitivity over

time). Accuracy relates to the degree of difference between the true and measured value

while selectivity describes the ability to measure one variable in the midst of numerous

others. Sensitivity refers to the detection or quantification level of the sensor. Without such

measurement there is no method for the precise feedback control required for efficient

processing. Measurements must be taken in real-time and be stable and reproducible so to be

able to ensure high-quality products. Real-time measurements can be accomplished in two

(35)

1) On-line: A sample stream is diverted from the process for measurement and

subsequently returned to the process or discarded to waste.

2) In-line: A measurement where there is no need for a sample to be removed from the

process such as with a probe incorporated into the system.

The two other types of measurements are:

1) At-line: A sample is removed (in an automated or manual fashion) and analyzed in

close proximity to the process which can make it effective for process control

decisions and is often considered “almost” real-time.

2) Off-line: A sample is taken (in an automated or manual fashion) and is sent off to a

lab for analysis. Results are often obtained days after the sample is taken and

therefore can not be used for process control purposes.

While there are basic parameters which are routinely monitored by in-lineprobes such as

temperature, pH, cell density and dissolved oxygen there are a very high number of

parameters that are currently monitored by time consuming off-line analysis which is not

suitable for effective process control. Bioreactor applications pose additional challenges in

that sensors must be able to withstand sterilization temperatures and harsh cleaning

chemicals. These Bioprocesses are also three phase systems where cells are dispersed as a

solid phase in a liquid medium phase being aerated by a gas phase. These complex

hydrodynamics create increased difficulty in reliable monitoring especially if the vessel is not

homogenous. For a probe to be functional it must also be non-invasive and have no

(36)

of sensors available, only optical and spectroscopic meet the aforementioned criteria and can

be used in the liquid phase for biomanufacturing applications. These sensors are unique in

that they offer the possibility of measuring various analytes simultaneously and actual

sampling is only required for initial calibration and model building purposes. The most

popular and relied upon technique is Near-Infrared Spectroscopy (NIRS) which will be

focused upon in this dissertation however Raman will also be discussed since it is also a

highly used tool for in-line applications. Infrared (IR), Mid-Infrared (MIR), UV/Vis, and

Fluoresence spectroscopy will not be covered due to their saturation properties resulting in

limitations to be used real time as a true in-line PAT tool [13].

Optical / Spectroscopic Sensors

Spectroscopic analyzers are based on the interaction and subsequent absorbance or

scattering of varying frequencies of light energy by specific molecular bond vibrations.

Chemometric data analysis is required to extract relevant information from complex

overlapping spectral scans and will be covered in more detail in reference to NIRS.

Raman Spectroscopy

Raman spectroscopy deals with the ability to detect molecule vibrations based on

light scattering. In general, Raman has historically been used less than NIRS due to sample

degradation and fluorescence problems. However, with recent advances in instrument

technology these issues have diminished. The underlying principle is that photon energy will

(37)

related to the molecule of interest based on the angle of incidence. Only one in every 106-108 photons will be related to Raman scattering and as such the signal is extremely weak. This

causes the requirement for very long exposure times of photon energy to the sample (up to 15

minutes) which makes application to fast growing cultures somewhat limited. Conversely,

NIRS requires scan times of less than a minute to obtain the required information which is

another reason why it dominates with in-line applications [14], [15].

Near Infrared Spectroscopy (NIRS)

Near Infrared Spectroscopy (NIRS) deals with wave numbers between 12000 - 4000

cm-1and operates based on the principle that the atoms of molecules are in constant motion and vibrate at specific frequencies. The type and frequency of each vibration is molecule

specific and directly dependent upon the mass of each atom in the molecule and the strength

of the chemical bond between atoms. Light frequencies that correspond to molecular

vibrations are absorbed by the sample and the resulting infrared spectrum comprises peaks of

well-defined frequencies, band shapes and heights that correlate to molecule concentrations

present in the sample. The technology has increasingly been used, over the past decade, in

the pharmaceutical industry where it has served to monitor solid dosage forms [16] as well as

process variables involved in monitoring drying, granulating, powder blending and online

conformity testing [17]–[19]. More recently research has focused on implementing NIR

spectroscopy in the fermentation and cell culture stages of bioprocesses for

biopharmaceutical process optimization and control. The most significant NIR in situ

(38)

rates as are found with bacterial, aerobic yeast and fungal bioprocesses. In these cases, most

often NIRS is incorporated at-line as a rapid detection method for specific substrates or

products. Mammalian cell culture provides a relatively simple spectroscopic challenge as

agitation and aeration rates are generally low and media is usually homogeneous, non viscous

and Newtonian. A great deal of work has been done at-line which does not offer the optimal

level of control, however, use of in situ probes has also been studied. Riley M. and

colleagues [20] have used at-line NIRS to monitor pyruvate and 14 other amino acids,

quantifying in total 19 cellular nutrients and waste products in culture medium with

concentrations as low as 0.3mM. The most common key analytes that have been monitored

in situ using NIRS are glucose, glutamine, glutamate, lactate and ammonia. However, pH ,

viable cells and recombinant proteins have also been monitored [21]. The ability to monitor

constituents using NIR spectroscopy hinges on the advanced mathematical modeling,

referred to as chemometrics, that must be done to develop calibration models relating spectral

properties to quantitative results.

Chemometrics

Chemometrics refers to the discipline that applies multivariate empirical models to

data in order to extract meaningful results. The term was coined by a Swedish scientist,

Svante Wold, in 1971 where he used it to describe the application of mathematical and

statistical methods to analytical chemical measurements [22]. Chemometrics is a practical

application of mathematics and has evolved to play a major role in analytical chemistry. This

(39)

of multiple responses from a single analysis) creating complex and large data sets, as well as

with increased computer processing speed and data storage capabilities. The increased

process monitoring has not only created challenges related to data management but

integration (of the sensors as well as the data coming from them) as well. Spectroscopy based

analytical methods are extremely suited to chemometric applications due to the nature of the

spectra that often contain pertinent information which is very difficult to interpret. NIRS in

particular suffers from broad and overlapping peaks that create complex data matrices which

require chemometrics in order to extract essential information. Chemometrics is based on the

larger discipline of multivariate data analysis and therefore the same mathematical tools are

applied. However, instead of analyzing relationships between a number of process variables

with different units of measure (such as pH, temperature, dissolved oxygen etc.) the analysis

is normally applied to variables with identical units of measure (such as absorbance at

different wavelengths). To further clarify, MVDA would be applied to various sensor values

over the duration of a process where one of those sensors might be a NIRS measurement.

The value from a NIRS probe would be determined based on a chemometric model that

predicts the concentration of a constituent of interest. Chemometric models are strictly

empirical (often referred to as soft) and are generated from a limited set of available data.

Therefore, extrapolation is not advisable and it is critical that the model contains all possible

data variation which might be observed in the process. These models must also be validated

with data sets that were not used to create the model in order to test their true effectiveness

and functionality [23]. In general, there is a standard methodology that should be employed

(40)

Chemeometric Model Building Methodology

In order to construct calibration models a matrix must be built relating constituent

concentrations to spectral properties. Initially, samples are taken at the time of spectra

acquisition and constituents in the samples are determined using a primary reference method.

The calibration model relates these two matrices to each other so that constituent

concentrations can be predicted from future unknown spectra. The first step in this process is

to ensure that the model data is good and that there is sufficient data that represents all the

process variation over which the model will be expected to perform. Data can be described as

the combination of information and noise and the old adage remains true that “garbage in

equals garbage out”. When taking scans it is crucial that the proper reference be used and that

the probe and optics are in good working condition. However, defining good data is not

always straight forward. Initially the spectra are analyzed using principal component analysis

(PCA) without relationship to any reference data. This is referred to as unsupervised

classification since no sample information is provided and classification only occurs based on

the spectra itself without any prior expectations [24]. PCA reduces the large number of initial

variables (in this case absorbance at various wavelengths for a number of samples) by

projecting them onto a few latent variables (or principal components) that should describe

most of the variation of the original variables. The first PC describes the greatest amount of

the original data with each additional PC containing less than the previous [25]. The general

equation for a PCA model is the following:

(41)

X represents the entire matrix of data which is often extremely large. T and P’ are the scores

and loadings of each principal component and E represents the residual matrix of the

information that was not described by the model. Scores and loadings will be described

further in Chapter 2. In layman terms the equation can be described as the following:

Data = Model + Error (2)

If too many PCs are selected then the model begins to describe the error itself and noise

becomes incorporated which is detrimental [25]. The removal of noise from the models

becomes a critical step in PCA and is therefore done prior to building the model by

performing pre-processing to the spectra. Spectra in the NIR region are influenced by

non-linearity’s arising from light scatter effects. Light scattering occurs due to particles in the

sample media such as cells and insoluble compounds (which causes base-line shifts) as well

as bubbles and density fluctuations to name a few. However, these effects can be eliminated

by applying the correct pre-processing techniques. These techniques are employed to reduce

the variability between samples that are a result of scattering effects. The main scatter

correction methods are Multiplicative Scatter Correction (MSC), Standard Normal Variate

(SNV) and normalization which all estimate correction coefficients and apply them in

slightly different ways. In addition, derivatives are commonly used as well as straight

forward baseline correction. The challenge is to not apply too severe of a technique that will

result in removing valuable information as well [26]. Most often, in addition to

(42)

is mean centered, where the mean of the observations is subtracted from each individual

observation to reduce the effect of high and low offsets, and then scaled or transformed by

dividing each observation by their standard deviations [27]. Thankfully, these transformation

and pre-processing techniques are applied with simple selections in the software package that

is being used to perform the modeling. With the PCA model built it is then possible to

visualize the multivariate data based on the main PCs using a scatter plot of each sample’s

scores or loadings. This helps to identify outliers or clusters. Clusters can be identified

visually or by using techniques based on geometric properties of the scatter plots such as

Mahalanobis distances or nearest neighbor classifications where, for example, points that are

of a certain distance to 3 other points in a cluster are considered members of that cluster.

Identifying clusters in this way is referred to as supervised classification and additional

context can be added to help describe the clusters. Another supervised technique is to link

the qualitative PCA model to specific properties of the sample using Partial Least Squares

(PLS) thereby developing a quantitative prediction model.

PLS modeling is the next step in the overall process where reference data is added to

the data matrix. Here the multivariate data now is associated with one specific variable. This

correlation can be made for a number of different variables since each spectral scan contains

information about a range of constituents in the sample. PLS is very effective at analyzing

collinear and noisy data with numerous independent X variables (such as a spectra data

matrix) and modeling these to one or several dependent Y variables. The same PCs are

(43)

Y = X.W(PT.W)-1.CT + G (3)

X = T.PT + E (4)

Where W is defined as weight, PT is the loading matrix for the X data set, CT is the loading matrix for the Y space, G is the residuals matrix of the PLS model and E is the residual

matrix of the X data set as described previously for equation (1) [24], [28].

Building the PLS model is an iterative procedure and often specific wavelength

regions of the spectra are selected based on correlations to the reference data or previous

knowledge of where the primary molecular bonds associated with the constituent of interest

will adsorb photon energy. Since future spectral scans will be used to predict concentrations

of constituents it is critical that the reference method be accurate and precise. It must be

noted that the calibration model will not perform better than the reference method used in its

construction. Verification of the model is initially performed through cross validation. Cross

validation is the process of removing one sample, a number of samples or even one batch

from the original data set and using the remaining data to build the model and predict the

sample(s) that were removed. The final step is to validate the model by analyzing spectra

from a batch or batches that were not used in the original construction of the model. The

question is always raised concerning the required number of batches or samples. This

depends on the process, total number of samples including range of analysis, and the medium

(44)

from 3-4 different batches can provide the basis for good calibration models as long as those

batches span the inherent variability of the process.

Current Industry Limitations

It has been identified that the entire Pharmaceutical industry is in dire need of

innovative technological solutions which includes advanced process control [29]. The

advancement in real-time monitoring has exposed a critical limitation in the ability to

integrate the data with the modeling tools in a complete package that enables advanced

process control. This holistic approach is required to realize maximum process understanding

and knowledge, increase potential for regulatory approval and equip manufacturers to

evaluate changes and implement change control procedures. Establishing a robust control

strategy during process development that can be scaled up and transferred to a manufacturing

process is crucial to ensure product quality throughout the product lifecycle. Implementation

of a strategy that can not only accurately monitor CPPS but also the interdependencies

between them is key to enable the advanced control required to ensure consistent

performance. This is specifically true in the case of large molecules where there are

thousands of metabolic pathways that can be altered based on minute environmental changes

that are often extremely synergistic in nature. To fully develop a design space that

characterizes acceptable performance multivariate tools are necessary. This is clear from a

process development standpoint where DOE is routinely utilized. However, there is a

disconnect when transitioning to manufacturing where monitoring and control is primarily

(45)

requirement to maintain the level of characterization of the process that enables advanced

control. The ideal advanced control platform should also have the capability of improving

throughout the lifecycle of the product. While the main control strategy is developed and

validated as a part of the overall regulatory approval of the product there should be room for

continual improvement. This is especially true in the case where multivariate techniques are

being employed. This full characterization that incorporates standard inline monitoring from

control systems (DCS, SCADA, PLC), advanced monitoring from sensors such as NIRS,

at-line monitoring from sample analysis, offat-line data from laboratory systems (LIMS) and

MVA models to elucidate the interdependencies of the various CPPs requires a platform that

enables full integration. This type of control strategy platform valuable in bringing drugs to

market faster as well as ensuring greater quality throughout the lifecycle. Current automation

and control for the production of Biopharmaceuticals is well behind other highly automated

industries such as semi-conductors. This fact is understood not only by regulators but by

consortiums such as the National Institute for Innovation in Manufacturing

Biopharmaceuticals (NIIMBL). NIIMBL has identified process control as a prime area for

improvement calling out the need for more research into advanced real-time analytics for

process control, predictive modeling and increased integration of process systems.

BioPhorum Operations Group (BPOG), the world’s leading biopharmaceutical manufacturers

and suppliers partnership, is developing a Biomanufacturing Technology Roadmap which

also calls out the need for improvements in real-time monitoring and the conversion of big

data sets into “smart data”. It is very clear that routine monitoring and PID controllers do not

(46)

While advanced monitoring tools are available there is a challenge in utilizing these tools

effectively. The main reason for this is the challenge that exists integrating multiple control

systems along with various monitoring systems for effective data analytics that can be

implemented real-time for CPV, UVA FD, MVA FD and predictive functionality. There is an

essential need to have the capability of creating soft sensors and predicting what might occur

within a batch or unit operation and then act upon it. These are all aspects required for a true

advanced control solution. This must be solved using a system based approach with an

adaptive platform that is able to integrate into existing systems, perform data analytics,

design strategies to use all possible information and ultimately close the loop for advanced

control. Research must be conducted to elucidate this need and determine the optimal

platform solution to address these issues. In addition, research is required to determine the

optimal reporting interface for alerts/alarms and dashboards to ensure information is

disseminated and acted upon and alarms can be managed appropriately.

Novel Approach

The next 3 chapters highlight the novel work being done to bridge this gap and

implement PAT systems for advanced process control. First, a study focused on process

development utilizing advanced monitoring (NIRS) to not only advance process development

activities but show how these tools, specifically chemometrics, can also be used to improve

analytical assays through improved qualitative and quantitative characterization. A

(47)

selected due to the transient nature of the process from year to year and the dire need for a

strong platform to speed development times for rapid manufacturing.

The second application focused on implementation in an anaerobic fermentation

manufacturing process of alcohol production utilizing yeast. The work shows how NIRS can

be utilized in a novel feed-forward control strategy to monitor yeast performance and predict

repitching rate. The final chapter is an E.coli case study that shows PAT implementation

spanning from process development to manufacturing and culminates with a novel platform

enabling the implementation of an advanced control strategy using multivariate models,

(48)

MANUSCRIPT 1.

Evaluation of Near-Infrared Spectroscopy (NIRS) for on-line

determination of virus titers in Cell Culture based Influenza Vaccine

Manufacturing

The content of this chapter has been submitted for publication in: Biotechnology and Bioengineering

(49)

ABSTRACT

Traditional egg based influenza vaccine manufacturing processes suffer from a

number of shortcomings and limitations. Recent advances have been made in the use of cell

culture for the production of influenza vaccines. However, uncertainties exist due to the

inherent variability of cell population kinetics as well as the sensitive nature of the population

to slight environmental shifts. Therefore, it is not possible to develop a single process that

will yield similar results for every possible virus strain. Since new virus strains are

potentially selected each year and the threat of a new pandemic strain looms on the horizon it

becomes crucial to design both a process development and manufacturing platform that

would be applicable to any and every eventuality. To satisfy these criteria, such a platform

must be capable of designing robustness into the overall process while yielding acceptable

results in the shortest possible time. This work suggests the use of a novel platform

incorporating advanced process monitoring to better elucidate cause and effect relationships

as well as insuring that infection and harvest occur at optimal time points to maximize virus

titers and overall manufacturing process quality. By incorporating Near Infrared

spectroscopy (NIRS) to determine virus titers, substrate concentrations, cell turbidity and

by-product formation in real time, the design would also satisfy the PAT initiative set forth by

the FDA to increase process understanding and control. NIRS has already been shown to

effectively monitor most of these parameters with the exception of virus titer. The current

research demonstrates the effectiveness of this novel application of NIRS. Models were

(50)

validation batch. Validation predictions showed an average prediction residual of only 20.5

HAU/50uL and average percent difference between predicted and measured values of 15.5%.

Key words: NIRS, cell culture, influenza virus, HA, process development, quality, process

monitoring, control

INTRODUCTION

Over 20% of the population is infected with the influenza virus each year resulting in

250,000 to 500,000 influenza-related deaths globally and approximately 38,000 deaths in the

US alone. Of even further concern is the potential for pandemic outbreaks which pose a

severe global threat to public health [30], [31]. Vaccination has proven to be a critical tool in

controlling the spread of infectious diseases as evidenced by the eradication of polio,

smallpox and diphtheria in most parts of the world. A caveat of the influenza virus when

compared to the aforementioned viruses is that the two main surface glycoproteins,

Hemagglutinin (HA) and Neuraminidase (NA), which cause an immune response and the

resulting flu like symptoms, experience antigenic “drift” and “shift”, which enable new

mutant virons to avoid recognition by host antibodies. As such, re-occurring flu pandemics

occur and, while vaccination is the most effective means of prevention, new vaccines must be

created each year to target the most prevalent virus mutations [32]. Seasonal flu vaccines

consist of three virus strains selected by the World Health Organization (WHO) based on

surveillance data from the previous flu season. Selections are typically made in February in

order to generate a tri-valent vaccine ready for distribution in September. Therefore, it is