1
The Development of a Function Concept Inventory
This paper describes the development of a concept inventory, a test designed to investigate undergraduate students’ understanding of the concept of function. A central purpose was to address conceptual understanding. We outline a set of elements of the understanding of function, based on key properties of the function concept, which were used to construct test items. We describe the design and validation process for the concept inventory and comment on some implications for the refinement of the instrument and its use.
Keywords: function; concept inventory; conceptual understanding; assessment design.
Introduction
Functions are central to present day mathematics. As Selden and Selden (1992) explain,
the function concept has evolved with mathematics and now plays a unifying role. For instance,
going beyond calculus, functions are widely used in the comparison of abstract mathematical
structures. Yet comprehension of the function concept is remarkably complex and studies have
shown that undergraduate students often have difficulties with this concept (Carlson 1998) and
even with the notion of variable (Trigueros and Ursini 2003). Pettersson (2012) identified
function as being a threshold concept in mathematics; that is, that it is transformative
(understanding the concept leads to a new perception of the subject), irreversible (the change in
perception is unlikely to be forgotten), integrative (the new understanding reveals connections
and relations with other topics), bounded (in the sense that these concepts often lie at the
borders of disciplinary areas), and troublesome (in that it presents difficulties to students) as
described by Meyer and Land (2003). Since students usually first encounter the concept of
function in school, university lecturers might assume that undergraduates have crossed this
particular threshold of understanding. However due to the complexity and troublesome nature of
the concept, this is often not the case (Pettersson, Stadler and Tambour 2013).
Working with functions in various contexts and in diverse areas of mathematics,
2
and processes but as mathematical objects. The importance of this has been recognised by many
authors. Based on Piagetian constructivism, Dubinsky and colleagues (e.g., Briedenbach,
Dubinsky, Hawks and Nichols 1992) postulated a hierarchy of concept development, in which
the student starts from actions, shifting to processes, developing mathematics objects and
ultimately mental schemas. Transitions through the stages of
action -->process-->object-->schema
are usually not linear, but involve shifting between stages through which a concept develops.
Dubinsky and McDonald elaborate actions with reference to the concept of function as follows:
With an action conception of function, for example, an individual may be limited to thinking about formulas involving letters which can be manipulated or replaced by numbers and with which calculations can be done. We think of this notion as preceding a process conception, in which a function is thought of as an input-output machine. What actually happens, however, is that an individual will begin by being restricted to certain specific kinds of formulas, reflect on calculations and start thinking about a process, go back to an action interpretation, perhaps with more sophisticated formulas, further develop a process conception and so on. In other words, the construction of these various conceptions of a particular mathematical idea is more of a dialectic than a linear sequence. (Dubinsky and McDonald 2001 p.277)
They suggest that “An object is constructed from a process when the individual
becomes aware of the process as a totality and realizes that transformations can act on it” and “a
schema for a certain mathematical concept is an individual’s collection of actions, processes, objects, and other schemas which are linked by some general principles to form a framework in
the individual’s mind that may be brought to bear upon a problem situation involving that
concept” ( p.276-277). Sfard (1991) talks of a duality, with processes and objects acting as two
sides of the same coin and conceptualisation shifting between them. She suggests a process of
reification, consisting of 3 stages of concept construction - interiorization, condensation and
reification -- through which processes become objects. In interiorization, operations on lower
3
eventually give rise to a new concept; condensation involves the squeezing of lengthy
sequences of operations into more manageable units, thinking of a process as a whole without
going into details but attaching a label for a new concept to be born; reification involves an
ontological shift, a quantum leap, a sudden ability to see something familiar in a totally new
light. A new entity (an object) is soon detached from the process that produced it. Sfard (1991)
speaks of reification as a “rather complex phenomenon” (p.30), causing obstacles and
frustration for learners, illustrating why reaching an understanding of a function, for instance,
as an object can be said to be troublesome though transformative.
Given the importance of function in mathematics, it is useful for a lecturer to have
information on their students’ level of understanding of this concept. In this paper, we will
describe the development of an instrument to elicit this information, namely a function concept
inventory. In particular, we were hoping to gain insight into students’ understanding of some
key properties of a function object.
Literature Review
A concept inventory is an instrument, or test, designed to explore conceptual or relational
understanding as opposed to procedural or instrumental competence (e.g., Skemp 1976; Hiebert and Lefevre 1986). The term concept inventory originates in the Physics Education literature.
Hestenes, Wells and Swackhamer (1992) developed a concept inventory for the concept of
force. Their intention was to explore students’ knowledge and understanding about this basic
concept. To do this, they decomposed the force concept into six conceptual dimensions (similar
to what we have described as ‘key properties’), and then designed multiple-choice questions to
illuminate understanding and misunderstanding in each dimension. Hestenes et al. (1992) claim
their inventory is a very good detector of Newtonian thinking and probes commonsense
misconceptions. They say their instrument has proven valuable at every level of physics
instruction from high school to university, providing sound technical knowledge required for
effective teaching. Not only this, but it has been widely used as a pre- and post-test to evaluate
4
Force Concept Inventory has “spawned a dramatic movement of reform in physics education”
(p.1018). Concept inventories have also been used in other subjects such as biology and
chemistry (Garvin-Doxas and Klymkowsky 2008; Mulford and Robinson 2002).
Carlson and her colleagues developed the Precalculus Concept Assessment (PCA) to
test students’ understanding of function and of rates of change (Carlson 1998; Carlson,
Oehrtman and Engelke 2010). In a similar manner to Hestenes et al. (1992), they first developed
the PCA taxonomy to articulate foundational knowledge for beginning calculus. Their
taxonomy consists of three categories of reasoning abilities and three categories of
understandings which they claim are essential for using central concepts of precalculus and
understanding key concepts of beginning calculus. The reasoning abilities identified in the PCA
taxonomy are: process view of a function (viewing a function as a process instead of an action);
covariational reasoning (dealing with change in two related variables); computational abilities.
The categories of understandings are: understand meaning of function concepts (such as
composition, inverse, rate of change, evaluation); understand growth rates of function types (for
example linear, rational, exponential functions); understand function representations (graphical,
algebraic, numerical, contextual) (Carlson et al. 2010 p. 120). Carlson and her team then used
the taxonomy to design, develop and validate the PCA in a four-phase process. This included
reviewing existing research on learning precalculus and beginning calculus, conducting a series
of focussed studies on characterizing reasoning abilities and understandings, carrying out
clinical interviews to validate questions and distractors for multiple choice items, and using
quantitative data from the final version of the PCA to establish the meaning of a PCA score.
They explain how the PCA is useful for assessing pre-post learning and thus for comparing
various approaches to teaching precalculus courses, and suggest it may have potential as a
‘calculus readiness’ assessment tool.
O’Callaghan (1998) developed a conceptual model to describe the understanding of
functions. The elements of his framework stem from applying theory about the sources of
5
and he described this model in terms of four competencies: modelling (the ability to represent a
problem situation using functions); interpreting (the ability to interpret different representationsof functions); translating (the ability to move from one representation to another); reifying (the
creation of a mental object from a process or procedure) (O’Callaghan 1998 p 25). He then
designed an instrument to test these competencies, by attempting to operationalise these
abstractions and formulate them in terms of a problem-solving environment, and used it
to investigate the effects of different types of instruction on students’ knowledge of functions. Infact, the desire to carry out such an investigation motivated his development of a conceptual
model.
In our study, we wished to investigate aspects of students’ understanding of some key
properties of function as an object. Carlson et al. (2010) recognised a ‘process view of function’
as central to the understanding of precalculus and beginning calculus and included it as one of
the three reasoning abilities in their PCA taxonomy. They explain that for many students taking
precalculus modules “reasoning is dominated by a static image of arithmetic computation used
to evaluate a function at a single numerical value” (p.115). This is problematic as students who
are unable to imagine a continuum of input values producing a continuum of output values, that
is, conceptualise a function as a process, have difficulty inverting and composing functions,
which can in turn hamper their effective use of functions to solve word problems. The PCA
attempts to assess students’ understandings of the meanings of function concepts such as
composition and inverse among others. The four competencies described by O’Callaghan
(1998), forming components of his model for function, include ‘reifying’ which goes beyond
the process view of a function. Reification represents the “ultimate goal” or “final stage in the
acquisition of function” and is defined as “the creation of a mental object from what was
initially perceived as a process” (O’Callaghan 1998 p.25). The research question which
accompanies this competency (reification) involves students performing operations (such as
6
quadratic) and thus overlaps with the assessment of the understandings outlined in the PCA
taxonomy (understanding meaning of function concepts and growth rate of function type).
Furthermore, it would seem that the questions O’Callaghan poses to assess reifying
competence could be successfully completed by students using a process view of function. For
instance, one of these questions gives expressions for two functions, C=0.10(p-1000) and
p=100n-n2, (O’Callaghan 1998 p.30) and asks the students to find C(p(50)) and then an
expression for C(p(n)), which it would appear could be completed using an operational
approach (action or process view) rather than a ‘structural’ approach (object view).
Breidenbach et al. (1992, p.251) contend that students with an action conception of function
would be able to perform the steps necessary to find an expression such as that for C(p(n)) when given specific formulae for the functions involved. In an attempt to avoid this type of response, we hoped that the inclusion of questions in which a specific formulaexample or description of a function was not given would give better insight into students’ understanding of function and
this influenced the design of our questions (for instance, Questions 5 and 10 described below).
Another instrument, called the Calculus Concept Inventory (CCI), was developed by
Epstein (2013) and his co-workers. The aim of the CCI is to measure conceptual understanding
rather than computational skill, with a focus on understanding the concept of derivative, and as
such involves material beyond the notion of function itself. The authors report that the first
version, drafted in 2005, was too difficult for freshman students and was revised in 2006. The
revised test has 22 multiple choice items and has been administered as a pre- and post-test in a
variety of universities, predominantly in North America. Epstein (2013) reports that the results
suggest that the class performance (measured by a normalized gain) in almost all traditional
courses showed little or no improvement from the pre-test to the post-test, however the courses
taught using interactive engagement (IE) methods showed significant gains. Epstein defines IE
methods as those which involve ‘activities which yield immediate feedback through discussion
7
While alternative instruments to test understanding of the function concept exist
(O’Callaghan (1998), Carlson et al. (2010)) they were not all readily available to us at the time
when we designed our concept inventory, nor were they precisely aligned with the elements of
understanding we identified as most important for our students. For instance, our own
experience of teaching has shown us that students often have difficulty in determining whether a
given relationship (in the form of a formula(e), graph, table or verbal description) represents a
function or not and so we thought this was an important aspect to have included in a concept
inventory. However, neither Carlson et al. (2010) nor O’Callaghan (1998) seem to address this. We also believed that existing instruments would require more time (to administer and correct) than we felt was practical for an instrument which would ideally be available for use as a pre and post-test with each cohort of students. Therefore, we set about designing a new instrument.
Development of the Function Concept Inventory
Motivation
The three authors of this paper have each taught courses to first year undergraduate students in
which functions play an important role. At the time of this study, we were teaching groups of
finance or humanities students attending one Irish university, pre-service primary teachers or
humanities students at a second Irish institution, and materials-engineering students in a UK
university. In all three cases, we were concerned that our students seemed to come to their
university programme with a procedural view of functions, seeing a function merely as a
formula relating variables, a machine for finding the output from a given input, or a form of
equation, but unappreciative of key conceptual ideas such as uniqueness of image or existence
conditions for an inverse function. We wanted to investigate these concerns with a view to
gaining insights into how to address them in our programmes. Thus, in an effort to evaluate our
students’ understanding of the function concept we developed a function concept inventory. We
made attempts to seek a pre-existing instrument, but neither that of O’Callaghan (1998) or
8
did not cover all aspects of the concept of function that we wanted to investigate. At the time,
we were not aware of the CCI, although retrospectively we see that CCI questions go beyond
the material we were addressing. Also, we were concerned that O’Callaghan’s instrument,
containing 14 constructed response questions with multiple parts would take too long to both
administer and correct if it were to be used year after year. Thus we designed our own
assessment tool.
Process
The process that we followed in creating the concept inventory is similar to that advocated by
the AERA, APA and NCME (1999) in the Standards for Psychological and Educational
Testing. In that document, four phases of test development are identified (p. 37):
Phase 1: Delineation of the purpose of the test and scope of the construct or the extent of the domain to be measured;
Phase 2: Development and evaluation of the test specifications;
Phase 3: Development, field testing, evaluation, and selection of the items and scoring guides and procedures; and
Phase 4: Assembly and evaluation of the test for operational use.
In the next section, we will outline the stages of the development of our inventory.
Theoretical Basis for Elements of Understanding Identified
Following the example of Hestenes et al. (1992), we began by identifying key properties of the
function concept with which we were concerned; we did this by drawing on the literature on the
subject as well as by drawing on our own experience as teachers. For example, from our own
experience we felt that students sometimes struggle with the difference between a function and
an equation. The literature provided more evidence of this; Vinner (1983) investigated students’ concept definitions and concept images of the notion of function and found that one of the
major categories of definition was that a function is an algebraic term, a formula, or an equation
(p. 300). Sajka (2003) also found that the concept of function is closely related to that of
equations in some students’ minds (p. 238). If we think of the concept of functions from a
9
functions and equations, in which their important difference is only vaguely perceived, towards
a recognition of them as different conceptual objects related in important ways.
Carlson et al. (2010) and O’Callaghan (1998) recognised the value of being able to
work with different representations of functions, with O’Callaghan noting that two of the mostcommon representational systems for functions to this day are graphs or tables. This is perhaps
not surprising considering the historic development of the function concept can be traced from
tables to curves and on to the formal definition in analysis (Balacheff and Gaudin 2010). The
work of Vinner (1983) and Vinner and Dreyfus (1989) highlighted the difficulties that students
have with defining functions and being able to classify a relationship as being a function or not.
Slavit (1997) argued that a property-oriented view of function can help students appreciate
functions as objects. In addition, Carlson et al. (2010) included understanding the meaning of
function concepts or properties as one of their dimensions of understanding. The ability to use
functions in context or as part of a mathematical model has been recognised as important in the literature, for example O’Callaghan (1998) cited modelling and interpreting as two of his four
competencies in this area. He explained these as formulating a mathematical representation of a
problem situation and reversing this process. Covariational reasoning, or the ability to
determine how the output values of a function are changing by imagining changes to the input
values, was included by Carlson et al. (2010) as one of their core reasoning abilities and this
resonated with our experience as lecturers.
Design of a set of elements of understanding
Based on these insights we suggested a set of elements of understanding which we
should like students to develop, enabling them to work with key properties of functions. The
aim of mathematics teaching at university level should be to give students tools and
opportunities to develop their understanding of concepts, that is, where students appear to have
only an action and/or process view of functions, to move to the reification or object stage of
concept construction. It is difficult to judge whether a person has an object view of function, but
10
allow us to tell if a student has made some progress on the action/process/object continuum. The
six elements of understanding identified are:
1. the ability to distinguish between functions and equations;
2. the ability to recognise and relate different representations of functions and use them
interchangeably;
3. the ability to classify relationships as functions or not functions;
4. the ability to have a working familiarity with properties of functions such as
one-one/many-one, increasing/decreasing, linearity, composition, inverses;
5. the ability to use functions in context, modelling and interpreting;
6. the ability to engage with co-variational reasoning.
Design of Instrument
Using these elements of understanding, we designed an initial set of fourteen
assessment questions. With considerable discussion of what to include, we wrote thirteen of
these questions ourselves, drawing on and modifying questions we had used in our courses, and
also choosing one item on co-variational reasoning available from the PCA (Carlson et al.,
2010). We intended that the test would be administered in class-time and so did not want it to be
too long. After further discussion we reduced the number of questions to twelve which spanned
all elements of understanding listed above. .
Q1. Let f(x) =3x+5. Given the equation 3a+5=2, which of the following are true? (There may be more than one true statement.)
(a) f(a)=3a+5, True False
(b) f(a) = 2, True False
(c) f(x)=2 for all x, True False
(d) f(x)=2 for some value of x. True False
11
For example, Question 1 (Figure 1) tests the ability to distinguish between function and
equation, seeing these as two distinct but related objects. For parts (b) and (d) students need to
make a connection between the function f and the equation 3a+5=2, but in part (c) students need
to recognise the difference between the equation and the function. Students may recognise f as a
linear function and thus realise it has unique functional values for all real numbers; thereby
recognising that f(x) = 2 => x = -1, and that no value of x other than -1 can give a functional
value of 2. The question thus tests Element 1 and relates also to Element 4.
Figure 2: Question 5
Question 5 (Figure 2) tests Element 2 and touches on Element 4. Students need to be able to
think in terms of a function defined for all real values and visualise its square. As f has not been
specified, students cannot take an operational approach but must focus on structure. They
should recognise that (b) does not represent the square of a function, that (c) excludes values for
which the square would be defined and that (d) is acceptable for a constant function (something
that students frequently ignore). Note that we assume here that students will recognise the horizontal axis in each graph as the line y=0 and therefore that the function graphed in (a) takes no negative values.
Q5. Suppose f(x) is a function defined for all real values of x. Which of the following are possible graphs of g(x) = [f(x)]2? (Please circle all of the options you think are possibilities.)
(a)
(b)
(c)
12
Except for Element 6, most of the elements were assessed using more than one question: for
example, Element 3 was tested using three questions. These questions are given in Figure 3 (the
full concept inventory can be found in Breen, Jaworski and O’Shea (2012)). Questions 8 and 9
were designed to examine the students’ ability to decide whether relationships were functions or
not when the information was presented in a graphical or tabular form, while Question 10
concerns elements of the formal definition of a function. We see each of these questions as
testing the students’ ability to work with the key properties of the function concept. In
Question 8 students are asked to deduce function relationships from a graphical representation;
this requires them to identify relationships which are functions and those which are not. As no
formula or analytical expression has been given for the function, it is difficult for the student to
take an operational approach in terms of following through a series of actions or a process.
Question 9 challenges the view of functions as always defined by formulae: for example, a
student with an action or process view of function might look for a formula relating house size
to its selling price, whereas a student with an object view might see a function as a relationship
between two sets of numbers which is required to have unique outputs. In particular, Question
10 requires students to think in terms of the formal definition of a function allowing (i) but not
(ii). Since a specific function is not given here, we hoped to test whether students were able to
think about properties of functions without performing actions or calculations or working
through a process.
Ten of the twelve questions contained in the inventory were multiple choice. The
remaining two were short constructed response questions for which partial credit could be
awarded. Such questions are much more difficult to mark, and would probably cause problems
when comparing results if the test was to be used widely; however we feel that the high quality
of information on students’ understanding gleaned from this type of question makes the case
for including them in a test compelling. We hope to report on the findings relating to these
questions elsewhere. Some of the multiple choice questions had multiple parts, and when
13
Q8. Consider the curve below which describes a relationship between t and s.
Which of the following is true? (Please circle one of (a)-(d), or (e).) a) s is a function of t but t is not a function of s.
b) t is a function of s but s is not a function of t. c) s is a function of t and t is a function of s.
d) s is not a function of t and t is not a function of s. e) I don’t know.
Q9. Let s denote the size of a house and p denote its selling price. The tables below show the sizes and prices of houses sold each month by a suitable estate agent.
May S p 100m2 €300,000 120m2 €375,000 75m2 €257,000 90m2 €300,000 110m2 €350,000 June s p 100m2 €302,000 120m2 €370,000 70m2 €200,000 131m2 €400,000 120m2 €350,000 July s P 110m2 €330,000 90m2 €270,000 80m2 €240,000 50m2 €125,000 120m2 €360,000 For which months is it true that p could be a function of s? (Please circle all of the options you think could be functions.)
(a) May (b) June (c) July (d) I don’t know Q10. Suppose f(x) is a function defined for all real values of x. For each of the following statements decide if the statement is always, sometimes or never true.
i. There are two different real numbers a and b such that f(a)=f(b).
Always Sometimes Never
ii. There are three different real numbers a, b, c such that f(a)=b and f(a)=c.
Always Sometimes Never
14
Validation
Piloting and Refinement
The inventory was piloted in three different ways: it was administered to a group of students
attending a bridging course at an Irish university; it was sent to a group of Irish second level
mathematics teachers for comment; it was given to university lecturers for comment. Based on
the feedback from this piloting process, some of the wording of the questions was altered and
one question was changed substantially. This question concerned the property of injectivity, a
topic with which the Irish mathematics teachers felt their students would not be acquainted. The
question was adapted so that it asked about increasing functions rather than injective ones; we
wished to administer the assessment before any instruction on the function concept at university
took place and thus it was necessary to make sure that the material was covered by the school
syllabi. The authors considered sample answers to the partial credit items and together
developed an agreed marking scheme.
Rasch Analysis
Following the pilots, the concept inventory was administered to three groups of first year
students in October 2011: a group of 53 first year engineering students in a UK university
studying a basic mathematics course; a group of 37 BA and BEd students in one Irish
university, and a group of 127 BA and Finance students in a second Irish university. All of the
students in Ireland had chosen to study Mathematics and were taking a first Calculus course.
The test was taken during class time at the beginning of the students’ first semester at
university before the topic of functions was covered in any of the three modules. Ethical
approval for this study was sought and received prior to administration of the concept inventory.
The students were told that participation was voluntary and were given 30 minutes to complete
the inventory. The scripts were marked by research assistants and the data was compiled into a
15
When designing a test such as this one, it is important to consider whether it is valid and
reliable. Validity here refers to the extent to which the concept inventory measures the variable
that it is intended to measure (Gravetter and Furzano 2012, p. 78). In our case, we would want
to know if our test instrument measures the trait of conceptual understanding of functions. An
important aspect of validity in this is content validity (Sireci, 2007), that is the extent to which
subject experts agree that the test concerns the concept in question and covers all aspects of this
concept. As noted previously, we piloted the instrument in various ways, with students,
secondary teachers and with university mathematicians. The experts in this pilot phase (i.e. the
teachers and mathematicians) did not raise any questions as to the content validity of the
concept inventory. A further question regarding the validity of the concept inventory is whether
the test items combine to give a measure of one single trait i.e., that of conceptual understanding
of function. In our design phase we outlined the different aspects or elements of this type of
understanding using the literature and our own experience. We designed the test items based on
these elements, and then it was necessary to check whether these items actually measured a
single construct as set out above: that is, whether the six elements described were contributing
to an understanding of the concept of function. Note that this scenario is common in test
construction. For example, when measuring mathematical literacy, the PISA studies use items
that concern different mathematical content areas as well as thinking processes but together
form an instrument which measures the underlying construct (OECD 2014).
In order to further study this aspect of the validity of the concept inventory we used
Rasch Analysis (Bond and Fox 2007) by means of the computer software Winsteps (Linacre
2009). The Rasch model is an Item Response Theory model which can be used to evaluate tests,
especially those that claim to measure one construct. This is for example, how the PISA studies
validate their test instruments and construct measures of mathematical literacy (OECD 2014).
Similarly, Wilson and MacGillivray (2007) used Rasch Analysis to validate a test designed to
measure the basic mathematical skills of first year university students. The test, in their case,
16
Pantziara and Philippou (2012) used the Rasch model to investigate the validity and reliability
of a test to measure understanding of fractions; we will use similar methods here.
The Rasch model is based on the assumption that useful measurement involves the
consideration of a single trait or construct at a time (i.e. assumption of unidimensionality), and
it incorporates a quality control mechanism using error estimates and fit statistics to verify this.
For an introduction to this model, please see Edwards and Alcock (2010). The Rasch analysis
computes weighted and un-weighted mean square statistics (called the infit and outfit statistics
respectively) for each item. These are chi-square statistics divided by their degrees of freedom
and thus have expected values of 1. Bond and Fox (2007 p.243) report that a reasonable range
of infit and outfit statistics for test items is 0.7 to 1.3. We used the dichotomous Rasch model;
that is we graded each multiple choice item on our test as either correct or incorrect. Fit
statistics for all items were computed, with infit statistics ranging from 0.85 to 1.18 and outfit
statistics between 0.74 and 1.29. Thus, all items are shown to be behaving well and
contributing to the measurement, providing evidence that the items on the test are working
together to measure a single construct. (Full details of the item infit and outfit statistics can be
found in Table 1.) In addition, the point-measure correlation (equivalent to the point biserial
correlation) was computed for each item. This measures the correlation between scores on an
item with the average scores on the remainder of the test. Wolfe and Smith (2007 p 206)
recommend that these correlations should ideally be above 0.3, but report that for a test such as
a concept inventory any positive correlation is acceptable. There seems to be a difference of
opinion in the literature about this cut-off with Jarrett et al. (2012) quoting various sources who
recommend that point biserial correlations should be above 0.2. In our case, all of these
correlations were positive, with only those of Q9 (point-measure correlation =0.2) and Q3b
(point-measure correlation =0.23) lying significantly below 0.3 and none lying below 0.2. This
17
QuestionNumber
Measure Infit MNSQ Outfit MNSQ Point Measure Correlation 1(a) -1.39 1.07 1.18 0.33 1(b) -0.71 1.05 1.12 0.34 1(c) -1.44 0.86 0.75 0.53 1(d) -0.62 0.97 0.93 0.42 3(a) -1.33 1.01 1.03 0.39 3(b) -1.31 1.18 1.29 0.23 4 0.15 1.08 1.17 0.29 5 2.5 0.92 0.74 0.29 6 -.11 0.87 0.84 0.49 8 1.98 0.97 0.96 0.27 9 1.9 1.06 1.05 0.2 10(i) -0.28 1.03 1.02 0.35 10(ii) 0.32 0.97 0.95 0.38 11 -0.47 0.91 0.85 0.48 12 0.8 1.03 0.99 0.32
Table 1: Fit statistics and measures of multiple choice questions
In order for a test to lead to a useful measure, it is important that it is able to
discriminate between students with high and low levels of the trait in question (Wolfe and Smith
2005). For example, if all questions are very easy or all are very hard then most students will
have similar scores and the test will not give good information. Rasch Analysis computes item
difficulties for each question (using the number of correct answers in the sample) and person
measures for each participant (using the number of correct answers for that person). These
measures and difficulties are given on the same scale. Therefore to have a good test it is
18
of person measures. If there is a significant difference in the range of person measures and item
difficulties the test may be too easy or too hard for the group in question. The results of the
Rasch analysis using the responses to the multiple choice questions showed that there was a
good spread of item difficulties (see also Table 1 below), and an item-person map showed that
the range of item difficulties matched well on the whole with the range of individuals’ scores.
Questions 5, 8 and 9 have measures that are located above the highest person measure for this
group. Since this group consists of students at the beginning of first-year in university, it is
desirable that some items on our test are difficult for these students. This allows the concept
inventory to be responsive to increases in the understanding of the function concept if it was
used as a pre and post-test. Therefore, the assessment instrument seems to be appropriate for
this group.
As well as being a valid test, we need to examine whether our concept inventory is
reliable or not. In this context, reliability refers to the stability or consistency of the measure
(Gravetter and Furzano 2012 p. 85). According to Adams and Wieman (2011 p. 1303), there are
three main methods of testing reliability of an instrument such as ours. The first is to create
different versions of the test and administer them; they point out that this is very
time-consuming. An alternative would be to administer the same test to the same group at a later
point in time; this was not possible for us since our courses covered the notion of function in
some detail and thus the students’ performance on a re-test would be affected. The second
method advocated by Adams and Wieman (2011) is to administer the same test to equivalent
populations and compute a stability coefficient. In our analysis, we randomly split our sample in
two parts and computed the resulting item difficulties for each sample. The correlation between
the item measures for the two parts was 0.971. The third is to consider internal consistency
measures. We used the Rasch model to do this. To begin with, we looked at the item reliability
index. According to Bond and Fox (2007 p.311) this index estimates ‘the replicability of item
placement within a hierarchy of items along the measured variable if these same items were to
be given to another sample of comparable ability’. This index is given on a scale running from 0
19
unlikely to change. Similarly, we computed the person reliability index (which is analogous to
the Cronbach alpha coefficient and measures how robust the person ordering would be if a
similar test was used with the same group of students). In our case the person reliability was
0.45 and the Cronbach alpha coefficient was 0.531. We removed some items from the test
(namely Q3b and the Q9) and recomputed the person reliability and Cronbach alpha coefficients
but this did not lead to significant increases. The data indicates that the concept inventory would
not be a good high-stakes test but it does allow us to discriminate between students with high
and low levels of understanding of function. Indeed Adams and Wieman (2011 p.1304) state
that low values of Cronbach’s alpha on a concept inventory are acceptable given the restrictions
on the size of such a test and the possible uses of the resulting data. They also posit that high
values of such an index could indicate redundant questions. Jarrett et al. (2012) quote Miller
(1995) who said that for instruments such as concept inventories tests of internal consistency,
like the person reliability index or the Cronbach alpha coefficients, could under-estimate
reliability.
The maximum possible score (for the whole test including the partial credit questions)
was 26 marks. A marking scheme is available (Breen et al. 2012). The mean score for the group
was 12.9862, with a standard deviation of 4.33171. The median score was 13 marks. No student
achieved a perfect score; in fact the highest personal score was 23 marks while the lowest was
0. Table 1 shows the (multiple-choice) questions and parts of questions and their difficulty
measures. Note that low measures correspond to easy items and high measures correspond to
difficult ones: questions 5, 8, and 9 were the hardest questions for the students tested, while
questions 1 and 3 were the easiest questions (note that this remained the case for each subgroup
when the group was randomly divided into two halves). When computing these measures and
the fit statistics, we graded the responses as right or wrong and did not award partial credit.
Discussion
Our aim in this paper was to describe the design of a function concept inventory. As function is
20
it was believed that a concise valid and reliable instrument exploring conceptual understanding
of functions would be of considerable value, not only to ourselves but to the wider community.
The design began with the identification of elements of understanding of key properties
of the function concept. This was related directly to ideas of action/process/object as set out
above. We drew on the literature to recognise that an object perspective was desirable while
recognising the partial steps on the way to this as indicated by Dubinsky and colleagues (e.g.
Breidenbach et al. 1992; Dubinsky and McDonald 2001) and by Sfard (1991). Thus the
questions as a whole, while addressing key properties of functions, as indicated in the literature,
were designed to provide insight into students’ understanding of function. We then wrote
questions related to these elements and pilot-tested them. Our questions sometimes concerned
more than one element (for instance, Question 9 addressed Elements 3 and 5); this was
deliberate on our part since we wanted the questions to span the types of understanding we had
identified as important, we wanted each element to have more than one question associated to it
if possible, and we did not want the test to be too long. We endeavoured to present a number of
questions in such a way that it would be difficult for students to adopt an action or process
approach in response, in an attempt to ascertain whether the student has made some progress on
the action/process/object continuum.
Epstein (2013) reports that the results for 250 students who completed a pilot CCI
assessment prior to undertaking Calculus I appeared to be mostly at the random guess level. The
same could be said of the students’ performance (collectively) on some but not all of the
questions on our concept inventory instrument as reported in Table 1. However Table 1 shows
that the concept inventory does have a good range of item difficulties. The Rasch analysis also
supported this finding. Adams and Wieman (2011) comment on the necessity of a test like this
to have such a range, since it allows discrimination between students and gives the researchers
information about the levels of mastery of the concept in question. In addition, if the questions
are not accessible to a student before studying the course, they remark that the test would only
21
secondary teachers, since we wanted to be sure that all items concerned material familiar to
school-leavers.
The majority of students who completed the inventory had problems working with
functions in real-life contexts. It would be interesting to run the concept inventory test again in
the next few years in Ireland, since the second level curriculum has recently been revised and
now has as its aim the teaching of mathematics ‘in contexts that allow learners to see
connections within mathematics, between mathematics and other subjects and between
mathematics and applications to real-life’ (NCCA 2012 p.6). The students in this study would
not have had the benefit of working with the new curriculum; therefore, it would be interesting
to investigate whether students who have studied mathematics with a more applicable focus
would do better on the concept inventory. It would also provide a means of attempting to
measure a change in the implemented curriculum.
From our experience of administering this concept inventory, we would make some
changes to the test. Consider, for example Question 3 given in Figure 4.
Q3. Suppose f(x) is an increasing function.
a) Is it true that 3f(x) must be an increasing function?
Yes No
b) Is it true that f(x)+3 must be an increasing function?
Yes No
Figure 4: Question 3
Reflection on our results shows that the correct answer to part (b) of this question may
have been chosen for invalid reasons– by this we mean that students may have assumed the ‘+3’
term guaranteed f(x)+3 was increasing. The question may have discriminated better or elicited
responses more reflective of the students’ understanding of the ‘increasing’ property if it had
22
value and the concept of increasing, and required a more object-like consideration of the nature
of this function. We would also label the axes in Question 5 in order to avoid confusion.
Before modifying our concept inventory for future use, we would first like to evaluate it
using think-aloud interviews, as other researchers (see for example Epstein 2013; Carlson et al
2010; O’Callaghan 1998) have done. This method of evaluation would allow us to delve deeper
into the type of understanding required to attempt each of our questions and the reasoning
behind the students’ choices of responses.
Many universities already administer diagnostic tests to students in first-year
mathematics modules and then provide support for those students who do not do well on these
tests. The diagnostic tests usually focus on basic mathematical skills; however the mathematical
community places high value on conceptual understanding and concept inventories could also
be used as a diagnostic tool. Hestenes et al. (1992) suggest their Force Concept Inventory
can be used thus to identify and classify misconceptions, with errors made by students
being more informative than correct answers to the questions posed. Thus, they claim
their inventory is particularly useful for teachers in raising their awareness of the
misconceptions among their own students. As teachers, we gained important information
from the responses of our students on our function concept inventory; the results showed us thatour students’ understanding of the function concept was limited. This information has been used
when redesigning our courses and in particular when designing learning activities. We are more
aware that, since the concept of function is so fundamental to a Calculus course, it can be
revisited many times in a module and the students can be given many different opportunities to
develop their understanding of the concept (Meyer and Land 2003). Thus, the information from
the concept inventory can give a teacher data on the areas where their students had most
difficulty, and the teacher can then tailor the module appropriately.
The Function Concept Inventory described here could also be used by researchers or
23
conceptual understanding and, in particular, the elements of understanding underlying the
instrument’s design. O’Callaghan’s (1998) conceptual model for describing an understanding of
functions was initially developed as part of a project to test whether students following a
‘Computer-Intensive Algebra (CIA)’ curriculum would develop a richer understanding of
functions that their counterparts following a ‘Teaching Algebra (TA)’ or more traditional
curriculum. The test O’Callaghan developed to probe each of the competencies he had
identified (modelling, interpreting, translating, reifying) was used to investigate whether CIA
students were more competent than TA students when working with functions in each of these
ways. Hestenes et al. (1992) also used the Force Concept Inventory to test the effectiveness of
pedagogical techniques in both school and university settings. They claim they have abundant
evidence that their inventory is a very accurate and reliable instrument for evaluating
instruction. However, Caballero et al. (2012) point out that challenges arise when concept
inventories are used to make comparative evaluations of curricular course reforms if core course
content is affected by the reforms.
In particular, we were intrigued by Epstein (2013) reporting on results obtained on the
CCI following the use of interactive engagement (IE) methods between pre- and
post-applications of their inventory. Perhaps the teaching of O’Callaghan’s CIA curriculum would
have been considered to include IE methods. It would be interesting to see if the use of IE
methods similarly improved students responses to the Function Concept Inventory presented
here. This is certainly an area which begs further research.
Furthermore, the Function Concept Inventory may also be useful as a placement
assessment in institutions in which there is a choice of precalculus and first calculus/analysis
courses available to undergraduate students. Diagnostic tests have been used to date in Irish
institutions in the placement of students (Burke, Mac an Bhaird and O’Shea 2012) but,
depending on the programme of study followed by the students concerned there are cases where
24
Acknowledgements
The authors would like to acknowledge the support of a NAIRTL (National Academy for Integration of Research, Teaching and Learning) grant for this research.
References
Adams, W.K. and Wieman, C.E. (2011). Development and validation of instruments to measure learning of expert-like thinking. International Journal of Science Education, 33, 1289-1312.
AERA (American Educational Research Association), APA (American Psychological Association) and NCME (National Council on Measurement and Education) (1999). Standards for educational and psychological testing. Washington DC.
Balacheff, N. and Gaudin, N. (2010). Modelling students’ conceptions: The case of function. Research in Collegiate Mathematics Education VII, Conference Board of the
Mathematical Sciences, Issues in Mathematics Education, 16, 207-234.
Bond, T. G. and C.M. Fox. (2007). Applying the Rasch model – Fundamental measurement in the human sciences (Second edition). New Jersey, Lawrence Erlbaum Associates.
Burke, G., Mac an Bhaird, C., O'Shea, A. (2012) 'The impact of a monitoring scheme
on engagement in an online course.'. Teaching Mathematics and Its
Applications, 31:191 - 198.
Breen, S., Jaworski, B., and O’Shea, A. (2012). Concept Inventory for Functions.
Available from:
http://staff.spd.dcu.ie/breens/documents/ConceptInventoryforFunction.pdf
[Accessed 19 February 2015].
Breidenbach, D., Dubinsky, E., Hawks, J. and Nichols, D. (1992). Development of the process conception of function. Educational Studies in Mathematics, 23 (3), 247-285.
Caballero, M.D., Greco, E.F., Murray, E.R., Schatz, M.F., Bujak, K.R., Marr, M.J., Catrambone, R. and Kohlmyer, M.A. (2012) Comparing large lecture mechanics curricula using the Force Concept Inventort: A five thousand student study. American Journal of Physics, 80(638).
Carlson, M. (1998). A cross-sectional investigation of the development of the function concept. Research in Collegiate Mathematics Education III, Conference Board of the
Mathematical Sciences, Issues in Mathematics Education, 7(2), 114-162.
Carlson, M., Oehrtman, M. and Engelke, N. (2010). The precalculus concept assessment: A tool for assessing students’ reasoning abilities and understandings. Cognition and
25
Ding, L. & Beichner, R. (2009). Approaches to data analysis of multiple-choice
questions. Physical Review Special Topics - Physics Education Research, 5,
(020103), 1-17.
Dubinsky, E. and McDonald, M.A., 2001.
APOS: A Constructivist Theory of Learning
in Undergraduate Mathematics Education Research
In: D. Holton ed., The
Teaching and Learning of Mathematics at University Level. Dordrecht: Kluwer
Academic Publishers, pp.275-282.
Edwards, A. & Alcock, L. (2010). Using Rasch analysis to identify uncharacteristic responses to undergraduate assessments. Teaching Mathematics and Its Applications, 29, 165-175. Epstein, J. (2013). The calculus concept inventory – Measurement of the effect of teaching
methodology in mathematics. Notices of the American Mathematical Society, 60 (8), 1018-1026.
Garvin-Doxas, K and Klymkowsky, M.W. (2008). Understanding randomness and its impact on student learning: Lessons learned from building the biology concept inventory (BCI). CBE Life Sciences Education, 7 (2), 227-233.
Gravetter, Frederick J.; Forzano, Lori-Ann B. (2012).
Research Methods for the
Behavioral Sciences
(4th ed.). Belmont, CA: Wadsworth.
Hestenes, D., Wells, M. and Swackhamer, G. (1992). Force concept inventory, The Physics Teacher, 30, 141-158.
Hiebert, J., and Lefevre, P. (1986). Conceptual and procedural knowledge in mathematics: An introductory analysis. In J. Hiebert (Ed.), Conceptual and procedural knowledge: The case of mathematics (pp. 1-27). Hillsdale, NJ: Lawrence Erlbaum Associates.
Jarrett, L., Ferry, B., Takacs, G. (2012). Development and validation of a concept inventory for introductory-level climate change science. International journal of Innovation in Science and Mathematics Education, 20(2), 25-41.
Kaput, J. J. (1989) Linking representations in the symbol systems of algebra. In S. Wagner & C. Kieran (Eds.) Research issues in the learning and teaching of algebra (pp.167-194) Reston, VA: National Council of Teachers of Mathematics; Hillsdale, NJ; Lawrence Erlbaum Associates.
Linacre, J. M. (2009) Winsteps (Version 3.68) [Computer Software]. Chicago: Winsteps.com. Mulford, D. .A. and Robinson, W. R. (2002). An inventory for alternate conceptions among
first-semester general chemistry students. Journal of Chemical Education, 79 (6), 739-744.
Meyer, J.H.F. and Land, R. (2003). Threshold concepts and troublesome knowledge: linkages to ways of thinking and practicing. Edinburgh: ETL Project (Occasional Report 4).
26
Available from: http://www.etl.tla.ed.ac.uk/docs/ETLreport4.pdf [Accessed 20 June 2013].
Miller, M.B. (1995). Coefficient alpha: A basic introduction from the perspectives of classical test theory and structural equation modelling. Structural Equation Modelling :A Multidisciplinary Journal, 2(3), 255-273.
NCCA, (2012). Leaving Certificate Mathematics Syllabus. Available from
http://www.ncca.ie/en/Curriculum_and_Assessment/Post-Primary_Education/Project_Maths/Syllabuses_and_Assessment/LC_Maths_for_examin
ation_in_2014.pdf [Accessed 8 June 2014].
O’Callaghan, B.R. (1998). Computer-intensive algebra and students’ conceptual knowledge of functions. Journal for Research in Mathematics Education, 29(1), 21-40.
OECD, (2014). PISA 2012 Technical Report. Paris.
Pantziara, M. and Philippou, G. (2012). Levels of students’ ‘conception’ of fractions. Educational Studies in Mathematics, 79, 61-83.
Pettersson, K., (2012). The Threshold Concept of a Function - A Case Study
of a Student’s Development of Her Understanding. Sweden: MADIF-8. Available from:
http://www.mai.liu.se/SMDF/madif8/Pettersson.pdf [Accessed 26 April 2013].
Pettersson, K., Stadler, E. and Tambour, T. (2013) Development of students’ understanding of the threshold concept of function. Paper presented at CERME8. Available at
www.cerme.org [Accessed 26 April 2013].
Sajka, M. (2003). A secondary school student’s understanding of the concept of function – A case study. Educational Studies in Mathematics, 53, 229-254.
Selden, A. and Selden, J. (1992) Research perspectives on conceptions of functions: Summary and overview. In G. Harel and E. Dubinsky (Ed.s) The Concept of Function: Aspects of Epistemology and Pedagogy (pp.1-16): MAA Notes Vol. 25. Mathematical Association of America.
Sfard, A. (1991). On the dual nature of mathematical conceptions: Reflections on processes and objects as different sides of the same coin. Educational Studies in Mathematics, 22, 1-36.
Sireci, S. (2007). Content Validity. In N. J. Salkind and K. Rasmussen (Eds).
Encyclopedia of Measurement and Statistics. (pp. 182-184). Thousand Oaks,
CA: Sage Pubications. Relational
Skemp,R. (1976).Understanding and Instrumental Understanding,
Mathematics
Teaching, 77, 20-26, Dec 76
Slavit, D. (1997). An alternate route to the reification of function. Educational Studies in Mathematics, 33, 259-281.
27
Trigueros, M. and Ursini, S. (2003). First year undergraduates’ difficulties in working with different uses of variable, CBMS Issues in Mathematics Education, 12, 1-29.
Vinner, S. (1983). Concept definition, concept image, and the notion of function. International Journal of Mathematical Education in Science and Technology, 14(3), 293-305. Vinner, S. and Dreyfus, T. (1989). Images and definitions for the concept of function. Journal
for Research in Mathematics Education, 20(4) , 356-366.
Wilson, T.M. and Macgillivray, H.L. (2007). Counting on the basics: mathematical skills among tertiary entrants. International Journal of Mathematical Education in Science and Technology, 38(1), 19-41.