• No results found

Big Data in Evaluating Transformative Scientific Research: Concepts and a Case Study

N/A
N/A
Protected

Academic year: 2021

Share "Big Data in Evaluating Transformative Scientific Research: Concepts and a Case Study"

Copied!
31
0
0

Loading.... (view fulltext now)

Full text

(1)

Big Data in Evaluating Transformative

Scientific Research: Concepts and a

Case Study

Bhavya Lal

Vanessa Peña

IDA Science and Technology Policy Institute (STPI)

Understanding Federal R&D Impact

Big Data: Measuring the Impact of the Government’s Research and Development Investments National Press Club

March 19, 2013

(2)

About Us:

Science and Technology Policy Institute

• Federally funded research and development center chartered by Congress

• Provide rigorous and objective analysis of science and technology policy issues for the White House Office of Science and Technology Policy and other offices within the executive branch of the U.S. government and federal agencies

• Conduct science and technology analysis to inform policy decision-makers

(3)

Today’s Presentation

• Challenges in evaluating non-traditional

research

– Conceptual

– Data related

• Selected case study

(4)

Background

• Belief that the research enterprise is becoming more conservative

• Programs have been created to support transformative research

(5)

5

A Smorgasbord of Strategies in

Federal Programs

Pr

ocess

Can Be Combined With  Approaches That Are

Pe

o

p

le

DESIGN REVIEW RESEARCH

OUTCOME Transformative 

Outcomes

Simplified Proposal 

Submission

Can Be Combined With

Different Role for 

Program Managers  or Leadership Traditional Review  Mechanism Non‐Traditional  Selection Criteria T raditional Programs Non-T raditional Programs Incremental  Outcomes  Will Achieve And/Or A Program With Traditional Grant  Program Incremental  Approaches MANAGEMENT  Annual Reports/PI  Meetings

Can Be Combined With Can Be Combined With

Hands‐On – Must Accomplish  Stated Goals or  Funding  Withdrawn Interdisciplinary Topic/Outcome  Focused High‐Risk Length (e.g. > 3  years) In Funding “Frontier”  Research Topics “Pioneering”  Researcher(s) Environment That  Nurtures Creativity “Pioneering “ Reviewers Interdisciplinary  Review Panels

Can Be Combined With  Management  Approaches That Range 

From Hands‐off – Management  Allows Research to  Change Topics or  Direction Enlightened  Managers Team‐Based/  Partnerships

Can Be Combined With

Seed Funding Large (e.g. > $1M) 

Funding

Unique Criteria or 

Ranking Schemes

Can Be Combined With  Mechanisms Such As

To

Visionary Leaders Risk Taking 

Program Managers Stakeholder Input Spillovers:Create  CommunityInstitutional  ChangeTrain  Students With 5

(6)

ocess Can Be Combined  With Approaches  That Are Pe o p le OUTCOME Transformative  Outcomes Can Be Combined  With raditional Programs Incremental  Outcomes  Will Achieve And/Or A Program With Can Be Combined 

With Can Be Combined  With High‐Risk Length (e.g. > 3  “Pioneering”  Researcher(s) “Pioneering “ Reviewers Can Be Combined  With Management  Approaches That  Range From Hands‐off – Management  Allows Research to  Can Be Combined  With Large (e.g. > $1M)  Funding Can Be Combined  With Mechanisms  Such As Spillovers:Create  CommunityInstitutional  ChangeTrain  Students With

NIH Director’s Pioneer Award Program

(7)

Pr

ocess

Can Be Combined With  Approaches That Are

Pe o p le OUTCOME Transformative  Outcomes

Can Be Combined With

Non‐Traditional  Selection Criteria Non-T raditional Programs Incremental  Outcomes  Will Achieve And/Or

A Program With Can Be Combined With Can Be Combined With

Interdisciplinary Topic/Outcome  Focused High‐Risk Length (e.g. > 3  years) In Funding “Frontier”  Research Topics Interdisciplinary  Review Panels

Can Be Combined With  Management  Approaches that Range 

From Hands‐off – Management  Allows Research to  Change Topics or  Direction Team‐ Based/Partnerships

Can Be Combined With

Large (e.g. > $1M) 

Funding

Unique Criteria or 

Ranking Schemes

Can Be Combined With  Mechanisms Such As Visionary Leaders Stakeholder Input Spillovers:Create  CommunityInstitutional  ChangeTrain  Students With 7

NSF Emerging Frontiers in

Research & Innovation

DESIGN REVIEW RESEARCH MANAGEMENT

(8)

Policy Question

• Set-aside transformative research programs

are viewed as taking resources from

traditional programs

• Do they offer more transformative outcomes

in return?

(9)

Conceptual Challenge

• How does one operationalize and

measure “transformative” research?

(10)

Defining Transformative

Research

Tolstoyian thesis: although all conventional practitioners in the life sciences may be said to be conventional in the same way, all rebels seem to rebel in their own particular fashion

The dominant paradigm with respect to the definition is

“I know it when I see it”

(11)

Working Definition

11

Transformative means:

• Research that involves ideas, discoveries, or tools that

a. radically change our understanding of an important

existing scientific or engineering concept or educational practice or

b. leads to the creation of a new paradigm or field of science, engineering, or education

• Characterized not only by exceptional innovation, but also by the conscious taking of risks in the choice of its research subjects and methods

(12)

Operationalizing

Transformative Research is Challenging

• (1) No known metrics for transformative research except

in retrospect

• Darwin's theory of natural selection (replacement of Lamarckism as 

the mechanism for evolution)

• Development of quantum mechanics (supplanted classical mechanics)

• Plate tectonics  (which combined the hypotheses of continental drift 

and sea floor spreading into the theory of plate tectonics) replaced 

the static geosynclinals theory in describing continental drift

• If an evaluation is sought too soon, the transformative nature of the results may not be evident

(13)

Operationalizing

Transformative Research is Challenging (2)

• (2) Results may be controversial - especially if

contradicting prevailing wisdom ‐ not even possible to publish them

– George Akerlof’s path‐breaking economic theory of asymmetric 

information and adverse selection, which ultimately garnered him a 

Nobel Prize, was initially rejected by three major economics journals 

– Barry Marshall and Robin Warren’s discovery that peptic ulcers are 

caused by bacterium H. pylori and not stress brought them ridicule in 

the biomedical community

– Barbara McClintock on cytogenetics was ignored for decades 

Imagine Einstein submitting an application in 1905: “I propose to explore the 

possibility that time slows down as things speed up.”

(14)

Operationalizing

Transformative Research

• In a traditional research portfolio, we can expect that:

– Some fraction are breakthroughs

– Some are incremental

• If a set-aside program is focused specifically on producing breakthrough research, it should lead to

– A larger number of

research breakthroughs

Entire population distribution better

Subset (tail) of the population does better

Increasing performance on metric of interest

(15)

Evaluation Approach

• Identify comparison group(s)

• Identify indicators for comparison

– Whether impact is (more/differently)

transformative

– Whether research approaches are (more)

unusual

• Compare outcomes

(16)

Data Challenge

• Comparison research has to be “similar

enough” to the transformative research

• Finding comparable comparison groups

(17)

Limitations of Bibliometrics

• Traditional bibliometrics cannot tell the

difference between traditional and

transformative outcomes

(18)

Big Data Helped Us Partly

Big data” refers to

datasets whose

size is beyond the

ability of typical

database software

tools to capture,

store, manage,

and analyze

• Potential for

emerging tools to

(19)

CASE

 

STUDY:

 

USING

  

BIG

 

DATA

 

IN

 

EVALUATING

 

A

 

TRANSFORMATIVE

 

RESEARCH

 

PROGRAM

(20)

Transformative Research Program

Outcome Evaluation

• Does the program’s research produce

higher

impact

as compared to research funded by

other programs

?

• Are the approaches used by the program’s

researchers

more innovative

as compared to

research funded by

other programs

?

(21)

Uses of Text Mining

• Comparison Group Selection, selecting

researchers working in similar sub-fields

• Data Analysis:

– Selecting subject-specific experts to assess

research

– Comparing similarity of research topics from

test and comparison groups

(22)

Evaluation Methods

Test

 

and

 

Comparison

 

Groups

Groups Source

Test Group – Test Researchers 35 Researchers

Transformative Research Program

Groups Source

Test Group – Test Researchers 35 Researchers

Transformative Research Program

Group 1 – Similar Researchers

35 Researchers

One Program in the Same Funding Agency

Groups Source

Test Group – Test Researchers 35 Researchers

Transformative Research Program

Group 1 – Similar Researchers

35 Researchers

One Program in the Same Funding Agency

Group 2 – Similar Program

39 Researchers

High-Prestige Program from a Different Funding Organization

Groups Source

Test Group – Test Researchers 35 Researchers

Transformative Research Program

Group 1 – Similar Researchers

35 Researchers

One Program in the Same Funding Agency

Group 2 – Similar Program

39 Researchers

High-Prestige Program from a Different Funding Organization

Group 3 – Program Finalists 35 Researchers

Applied to Same Funding Agency

Groups Source

Test Group – Test Researchers 35 Researchers

Transformative Research Program

Group 1 – Similar Researchers

35 Researchers

One Program in the Same Funding Agency

Group 2 – Similar Program

39 Researchers

High-Prestige Program from a Different Funding Organization

Group 3 – Program Finalists 35 Researchers

(23)

Selection of Comparison Group 1

• Match researcher and

research dimensions:

• Research area • Year of award

• Years since degree • Institutional prestige

• Prior research program funding

• Terminal degree(s) received

• Receipt of early career awards

23

Comparison Groups Matched?

Group 1 –

Similar Researchers Yes

Group 2 – Similar Program By design Group 3 – Program Finalists Group 4 – Random Researchers No

• Matching helped us select

an equal or balanced

(24)

Identifying Similar Research Areas

• All other matching indicators are straightforward

– quantitative

• Research area is a qualitative indicator

– Accessed and gleaned from award proposal text

– 40,000 award proposals used in the analysis, typical award proposal title and abstract length ~500 words

(25)

Selecting Similar Research Areas

• Topic Modeling and Similarity Index

– Based on topic modeling using Latent Dirichlet Allocation to assess similarity (Blei 2003, Talley 2011)

– Topic co-occurrence in unstructured text in NIH award titles and abstracts from 2007 to 2010

– Values 0 to 1, representing proportion of co-occurrence of topics (1 being identical)

• Each award was matched with 7 - 200 similar

awards with their respective index, these values

were used in matching

• Selected 35 researchers from more than 12,000 NIH

awardees

25 Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet Allocation. Journal of Machine Learning Research, 3, 993-1022.

Talley, E. M. et al (2011). Database of NIH grants using machine-learned categories and graphical clustering. Nature Methods, 8, 443-444.

(26)

Data Analyses

Qualitative

 

and

 

Quantitative

 

Analyses

Examination of researchers’ publications:

1. Expert Assessments – quantitative / qualitative

2. Text-Mining Analysis – quantitative / qualitative

3. Bibliometric Analysis – quantitative

Test and Comparison Groups Experts Text-Mining Bibliometrics

Test 162 354 3,287

Group 1 – Select Researchers 154 243 3,274

(27)

Expert Assessments

• 94 experts conducted over 1,500 ratings of over

500 publications

• For each test and comparison group researcher,

the five most impactful publications were

identified (represented the “body of work” for a

researcher)

• “Body of work” and individual papers reviewed at

least three times

27 Expert Assessments

Impact Innovativeness

(28)

Identifying Subject-Specific Experts

• Sought experts from past NIH awardees in

similar fields to the test program, used field

similarity from topics in researchers’ awards (~

730)

– Excluded awards used in comparison groups – Selected those with recent early career awards

– Selected those with previous long-term awards (e.g., NIH/MERIT)

(29)

Analyzing Research Topic Divergence

• Used award-attributed

publications in PubMed through NIH SPIRES from awards 2007-2010 • Helps support study

findings and could be used to identify emerging research 29 0. 00 0. 05 0. 10 0. 15 0. 20 0. 25 0. 30 Divergence P u bli ca ti on P ropor ti on 2.6 3 3.4 3.8 4.2 4.6 5 5.4 5.8 6.2 6.6 NDPA matched-R01 Random R01 HHMI

No statistical difference (Kolmogorov-Smirnov p = 0.44, 0.64, 0.15)

Test Group Group 1 Group 2 Group 4

• Investigated how similar research was to broader scientific community

(30)

Opportunities for Use of Big Data and

Text Mining in Evaluations

• Selection of comparison groups

– Topic modeling and similarity metrics help match closely related topics

• Selecting experts for assessing research

– Subject-specific experts can be identified using topic modeling approaches

• Comparing research similarity

– Validates results and may identify innovative research areas not similar to research in the broader scientific community

(31)

Thank you!

Bhavya Lal – [email protected] Vanessa Peña – [email protected]

1899 Pennsylvania Avenue NW Washington, D.C. 20006

31

For our full evaluation study:

http://commonfund.nih.gov/pdf/IDA_Paper_P_4899.pdf

Acknowledgements: Seth Jonas, Elizabeth Lee, Amy Richards, and Alyson Wilson (STPI), and Ned Talley (NIH)

References

Related documents

The previously mentioned equivalency means that the overall proposed spatial model estimator which comprises three distinct steps (instrumentation, screening, and estimation) is

While the traditional understanding of human rights is to restrain state authority to prevent abuses against the individual, in the last few decades human rights have been recast in

(3) The aeration pool and discharge outlet of SY power plant were focused, mercury isotope composition of dissolved gaseous mercury in the post-desulfurization

Thus, instead of defining a mi- crobial signature as a linear combination in real space corresponding to the abun- dances of a group of taxa, we consider microbial signatures given

Luke’s Place envisions a family court system that responds efficiently to end violence against women and effectively provides for the safety, emotional, and financial needs of

Table 9: Mean scores by school location for students drawn from main NAEP who took the Writing Online test and for a different group of students responding to the same essays

Suntory’s Toki Japanese Whiskey, Cointreau Orange Liqueur, Barrow’s Intense Ginger Liqueur, Fresh Lemon Juice, Cane Sugar, Green Chartreuse Herbal Liqueur Wash.

We proposed an extremely efficient, although slightly less accurate method for graph clustering that uses filtering of Kronecker deltas and the idea of propagation of the solution