THE USE OF STUDENTS’ SELF ASSESSMENT IN LEARNING MATHEMATICS

(1)

International Research Journal of Human Resources and Social Sciences Impact Factor- 2.561

Vol. 2, Issue 12, Dec 2015 ISSN(O): (2349-4085) ISSN(P): (2394-4218)

© Associated Asia Research Foundation (AARF)

Website: www.aarf.asia Email : [email protected] , [email protected]

THE USE OF STUDENTS’ SELF-ASSESSMENT IN LEARNING

MATHEMATICS

Cabasag Lota1, Garvida, Marc1, Zipagan Renante1

1

Faculty, Mathematics Department, College of Education-Laboratory High School,

Rizal Technological University

ABSTRACT

This research paper aimed to test the effectiveness of the use of students’

self-assessment in mathematics class. Two independent experiments were conducted to assess

whether the use of self-assessment as a form of formative assessment will be more effective in

terms of mathematics performance and mathematics attitude to the Grade eight students of

the Laboratory High School of Rizal Technological University during the school year

2014-2015. Two groups in each experiment were exposed to self-assessment while the other two

groups were exposed to an ordinary formative assessment method which is the conduct of

daily quizzes which is more of objective type.

It has been found that both formative assessment were equally effective in terms of the

mathematics performance but in terms of attitude level, the use of self-assessment is

statistically more effective. The performance of the students between the controlled and

experimental group is not significantly different due to attribution theory and the maturity

level of the respondents.

It is recommended to teachers to vary their assessment and consider self-assessment as part

of their formative assessment strategy in mathematics especially to a more mature group of

students. It is also important to develop more standardized self-assessment for teachers and

use more books that are rich in self-assessment exercises.

Keywords: Self-assessment, formative assessment, mathematics performance, mathematics

(2)

Introduction

Assessment practices have started to change over the last several years with teachers

building a larger repertoire of assessment tools and strategies. There is a greater

understanding of the importance of timely assessments for learning as well as regular

assessments of learning.

Assessment is commonly used as means to evaluate students’ performance and thus

students sometimes may likely to study only for the sake of passing the exam and not driven

by internal desire to learn it because it is a must.

One type of assessment that has been shown to raise students’ achievement

significantly is student self-assessment (Chappuis & Stiggins, 2002; Rolheiser & Ross, 2001).

A self-assessment is the process by which the student gathers information about and reflects

on his or her own learning. It is the student’s own assessment of personal progress in

knowledge, skills, processes, or attitudes. Self-assessment leads a student to a greater

awareness and understanding of himself or herself as a learner” (Ministry of Education, 2013,

p. 3).

That is why the authors of this paper prompted them to investigate the effectiveness of

the self-assessment to the grade eight students of the Laboratory High School.

Statement of the Problem

It is the purpose of this study to determine the effect of self-assessment in improving

the performance of the 8th grade students of the Laboratory High School

Specifically, the study sought to find answers to the following research questions:

1. What is the performance of the students in the pre and posttest?

2. Is there a significant difference between the performance of the controlled and

experimental groups?

3. What is the attitude level of the respondents in the pre and post test?

4. Is there a significant difference between the attitudes of the controlled and

experimental groups?

Hypotheses

The following hypotheses were tested:

1. There is no significant difference between the respondents’ performance of the

(3)

2. There is no significant difference between the respondents’ attitude of the controlled

and experimental groups

Scope and Delimitation of the Study

The respondents of this study were purposively chosen based on the results of the test

and a five-point-scale checklist administered to all the four blocks of grade eight students of

the Laboratory High School during the school year 2013-2014.

The result of this study is limited to the result of the validated pre and post test

administered as a primary basis for students’ performances prior and after the treatment.

Conceptual Framework

This study is backed by the theory of self-regulation. Under this theory, students tend

to be more productive and will likely to improve on their endeavor when they are personally

aware of their own strengths and limitations in learning. Thus, the assumption is through

self-assessment wherein students are given the opportunity to be self-aware of their own learning,

the outcome will likely to occur significantly positive in the both performance and in attitude

as illustrated in the foregoing model.

[image:3.595.76.488.345.506.2]

Figure 1

Conceptual Model

1.3 Studies of Self-Assessment and Rubric Use

Having examined the theoretical arguments, the question of empirical evidence arises.

Several studies of the effectiveness of student self-assessment have been undertaken. One

such study by Olina and Sullivan (2002) investigated the effect of teacher- and

self-evaluation on student learning and self-efficacy of 189 Latvian high school students and their

6 teachers in science classrooms. Using ratings of student projects, posttest scores, student

attitude surveys, and teacher attitude surveys, Olina and Sullivan compared the learning gains

Mathematics

Performance

Mathematics

Attitude

Formative Assessment



Self-assessment

(4)

of three groups of students: a teacher-evaluation condition, a self-plus-teacher evaluation

condition, and a no-evaluation condition. The results were interesting. Students in the

teacher- and self-plus-teacher-evaluation groups outperformed students in the no-evaluation

group, as expected. However, the effect produced on student attitudes provides great

evidence for the benefits of self-assessment. Although students in the no-evaluation group

reportedly enjoyed conducting experiments more than the other students, Olina and Sullivan

report that ―students in the self-plus-teacher evaluation group had greater confidence about their ability to independently conduct experiments in the future than the other two groups‖ (p.

19). This lends credibility to theories of assessment leading to greater student

self-efficacy.

Another notable study was conducted by Andrade, Wang, Du, and Akawi (2009) on

the relation between long- and short-term rubric use (including assessment) and

efficacy for writing by 268 elementary and middle school students. Although average

self-efficacy ratings of all condition groups increased over time, Andrade, et al. reported a greater

increase in self-efficacy of girls in the treatment group (using rubrics to check their work)

than in the control group. The effect of the rubrics on self-efficacy of boys appeared to be

statistically insignificant. To explain these results, the authors relied on research on

attribution theory stating:

[G]irls tend to be more concerned with mastering a writing task than do boys,

who, on average, tend to be more concerned with showing someone else that

they are capable. Our findings regarding the differences in increases in

self-efficacy after self-assessment may reflect these different achievement goals:

Girls may derive more satisfaction and confidence from self-generated

evidence of progress on a writing assignment than do boys, who seek

confirmation of progress from others, including perhaps their teachers and

peers. (Andrade, et al., 2009, p. 296)

Thus, the researchers concluded, self-assessment shielded female students from

―potentially debilitating effects of negative adult feedback‖ (p. 296). This may have led girls

to attribute shortcomings to effort, not inherent ability, which allowed them to see how they

could improve. This, in turn, led to increased self-efficacy ratings. ―The boys in the study,

however, might have been less influenced by the presence of the rubric because they placed

less value on their own feedback‖ (p. 296).

(5)

A quasi-experimental pre and posttest method of research with pre and posttest

matched group design was used in this study. In this design, two groups are pre tested and

matched in terms of certain variables and as in this study, gender and the performance in the

pre-test (Calderon, 2013). One group, called the treatment group, was exposed to the use of

self-assessment while the other group, called the control group, was exposed to the

conventional formative assessment which is the daily quizzes. After the experimental period

which covers the whole grading period, the two groups were given the same test covering the

subject matter studied during the period (post test). In this study, two experiments were

conducted by two different teachers. The four blocks of the grade eight students were handled

by two teachers with two blocks each. Two experiments were made so as to verify and

compares results for a more valid conclusions. In has been ensured that all other variables

except the independent variable were kept constant or equal during the experimental period.

The excess of the achievement of both groups between the pre and posttest will be attributed

to the assessment method and the excess of the achievement of the experimental group over

the achievement of the control group is attributed to the experimental factor. The matched

group experimental design was employed so as to eliminate one variable which is the

sectioning which was believed that may affect the result of the experiment.

Population and Sample Size

The following table shows the number of samples taken from this study.

Table1

Distribution of the Respondents of the Study

Groups No. of

Students

No. of

Samples

with match

Experiment A Section Amethyst Treatment 1 34 19

Section Tourmaline Controlled 1 34 19

Experiment B Section Opal Treatment 2 35 16

Section Onyx Controlled 2 34 16

Table 1 shows the distribution of the respondents. The numbers of students enrolled in

each section or blocks are almost the same, however, 19 pairs or 55.88% were obtained in

(6)

were in terms of gender and the pre-test performance of the students which was administered

prior to the experimentation.

Sampling Technique

The sampling technique used in this study was a purposive sampling technique. The

purposive sample is selected from a population by an arbitrary method. This method is

decided on the basis of available information thought to be representative of the total

population or by intuition on the basis of criteria deemed to be self-evident (Macaballug,

2009). This kind of sampling is a probability sampling and therefore subjected to a

non-parametric test.

Description of Respondents

The respondents of the study are grade 8 students of the Laboratory High School

during the school year 2013-2014. The grade 8 respondents with a matched in the experiment

were included in the study.

Research Instrument

The primary instruments used in this study are the assessment tools used by the

teachers in their day to day conduct of the lesson. Among which are the samples of questions

and rubrics which were used by the students to self-assess their own work which are

individual, pairs or groups.

Another instrument was the pre and posttest which were validated through item

analysis. The pre and posttest is parallel test covering the same content and level of difficulty.

The pretest was one of the researchers’ bases in matching the respondents while the posttest

served as basis for the performance of the respondents.

The attitude scale was used to determine the respondents’ attitude in math class.

Attitude scale is of five parts namely the: selecting a math class with five indicators; time and

place for studying with six indicators; study strategies for the class with eleven indicators;

math test with seven indicators; and anxiety with four indicators. Two equivalent set of this

scale was made which were also parallel one which was administered prior to the experiment

while the other one was administered after the experiment.

Statistical Treatment of Data

Since the data was obtained from a non-random sampling, the statistical tool used is a

(7)

whether significant difference exist between two independent groups, the Wilcox on

Rank-Sum Test will be used. This is a counterpart of a t-test in the parametric test. Also mean and

standard deviation were also used to describe the performance and attitude of the students.

The performance scale can be interpreted through the following arbitrary values:

Score Verbal interpretation

0-10 Unsatisfactory (US)

11-20 Fair Satisfactory (FS)

21-30 Satisfactory (S)

31-40 Very Satisfactory (VS)

41-50 Outstanding (O)

Also, the following arbitrary scale will be used to describe the attitude level of the

respondents.

Score Verbal interpretation

4.21-5.00 Strongly Agree -Very High

3.41-4.20 Agree - High

2.61-3.40 Fair - Average

1.81-2.60 Disagree -Low

1.0-1.80 Strongly Disagree - Very Low

Results and Discussions

(8)

[image:8.595.72.526.119.261.2]

Table 2

Mean Performance and Standard Deviations of the respondents in the Pre and Post

Test Groups Before Treatment After the Treatment Difference

Mean SD Mean SD Mean SD

Experiment A Treatment 1 20.87(FS) 3.79 27.13 (S) 6.41 6.26 -2.62 Controlled

1

20.87(FS) 3.79 28.6(S) 4.54 7.73 0.75

Experiment B Treatment 2

19.4(FS) 2.87 26.33(S) 6.33 6.93 -3.46

Controlled 2

19.2(FS) 2.83 24(S) 5.07 4.8 -2.24

Table 2 displays the mean and standard deviations of the two experiments conducted. In all of

the four groups, an increase in the mean are observed. Difference among the pre and posttest

are 6.26 and 7.73 in Experiment A while 6.33 and 5.07 in experiment B. This remarkable

increase in the mean implies that after the grading period in which the teacher discussed the

lesson, there is indeed a positive change in the performance whether or not a treatment was

made.

It implies that a traditional form of assessment and the self-assessment are both effective

assessment strategy. It is therefore undeniably claimed that the traditional assessment form is

by giving only quizzes is still indispensible.

Table 3

Difference between Performances of the Respondents in the Post Test

Groups

Mean Computed

t-value Critical Value Interpretation Experiment A

Treatment 1 27.13

166.5 123 No Difference

Controlled 1 28.6

Experiment

B

Treatment 2 26.33

96 83 No Difference

Controlled 2 24

Table 3 shows the corresponding computed and critical values of the two groups in each of

the experiment. The computed values of 166.5 and 96 are less than the critical value of 123

(9)

difference between the performance of students who are exposed to the use of

self-assessment and those that are not. It further implies that the use of self-self-assessment is better

[image:9.595.68.524.160.311.2]

than the traditional one.

Table 4

Difference between Respondents Attitude before and after the Treatment

Groups

Before

Treatment

After the

Treatment

Difference

WM Int. WM Int. Mean

Experiment A Treatment 1 3.41 High 4.8 V. High 1.39

Controlled 1 3.41 High 4.69 V. High 1.28

Experiment B Treatment 2 3.30 Average 3.99 High 0.69

Controlled 2 3.50 Average 3.87 High 0.37

Table 4 displays the students’ means and corresponding verbal interpretations of the

attitude level. Experiment A and B in both the controlled and experimental group mark a

significant change positively in their attitude as reflected from the mean difference of 1.39,

1.28, 0.69 and 0.37, respectively.

This change in the attitude might be the influence of the assessment employed by the

[image:9.595.63.533.536.705.2]

teacher.

Table 5

Difference between the Attitude of the Treatment and Controlled Group after the

Experiment

Groups

WM Computed

t-value

Critical

Value

Interpretation

Experiment

A

Treatment 1 4.67

119 123 There is a

difference Controlled 1 4.96

Experiment

B

Treatment 2 3.99

79.95 83 There is a

(10)

Table 5 shows the computed and critical values of the experiments. The computed

values are less than the critical values, thus rejection of the null hypothesis is rejected, and

therefore, there is a significant difference between the groups of students who are exposed to

the use of self assessment than those who are not.

This is the same with the findings of the other researcher (e.g. Olina and Sullivan

(2002)), wherein groups who were exposed to the use of self –assessment outperformed the

grouped which are not exposed.

Conclusions

1. Self-assessment and the traditional formative strategy such as giving daily quizzes are

both effective

2. The grade 8 students of the Laboratory High School improved their performance in

mathematics

3. The use of self – assessment does not directly increased mathematics performance

however it does affect in the attitude.

Recommendations

1. Encourage teachers to vary their assessment strategies and include self-assessment

strategy

2. Conduct trainings and seminars to teachers pertaining to the preparation of self

assessment , administering and scoring of self-assessment

3. Create modules, books and other instructional materials which are rich in

self-assessment activities.

References

Black, P and Harrison C. Lee, Marshall, B & William D. (2004) Working inside the

black box: Assessment for learning in the classroom. Phi Delta Kappan.

Ministry of Education, (2006) A guide to effective instruction in

Mathematics,k-8,volume 4. Toronto, Assessment and home connections. ON: Queen’s Printer for

Ontario.

Olina , Z & Sullivan, H.J. (2002) Effects of teacher and self assessment on student

performance. Paper presented atthe annual convention of the American

Education Research Association, April 1-5,2002, New Orleans ,LA.

(11)

Rolheiser, C., &Ross, J. (2001) Student self-evaluation : What research says and

what practice show. Retrieved December 28, 2014 from

http://www.cdl.org/resource-library/articles/selfeval.php.

Stiggins, R., & Chapuis, J (2006) What a difference a word makes: Assessmenmt

FOR learning Rather than assessment OF learning helps student succed.

Journal of Staff Development.

Stiggins, R. (2008). An introduction to student-involved assessment FOR learning.