The primary conclusion we draw from these examples is that it is possible to design tasks and contexts in which teachers elicit students’ thinking about disci- plinary core ideas and crosscutting concepts by engaging them in scientific prac- tices. Tasks designed with the characteristics we have discussed (three dimensions, interconnections among concepts and practices, a way to identify students’ place on a continuum) produce artifacts, discussions, and activities that provide teachers with information about students’ thinking and so can help them make decisions about how to proceed or how to adjust subsequent instruction or to evaluate the learning that took place over a specified period of time.
Questions have been raised about whether students can achieve the ambi- tious performance expectations in the NGSS. The implementation of the NGSS is a complex subject that is beyond the scope of our charge; however, each of the examples shown has been implemented with diverse samples of students,21 and there have been students who succeeded on them (although there are also students who did not). The tasks in our examples assess learning that is part of a well- designed, coherent sequence of instruction on topics and in ways that are very similar to NGSS performance expectations. Each example offers multiple opportu- nities to engage in scientific practices and encourage students to draw connections among ideas, thus developing familiarity with crosscutting concepts.
CONCLUSION 4-1 Tasks designed to assess the performance expectations in the Next Generation Science Standards will need to have the following characteristics:
21Samples included students from rural and inner-city schools, from diverse racial and ethnic
• multiple components that reflect the connected use of different scientific practices in the context of interconnected disciplinary ideas and crosscutting concepts;
• reflect the progressive nature of learning by providing information about where students fall on a continuum between expected beginning and ending points in a given unit or grade; and
• an interpretive system for evaluating a range of student products that is specific enough to be useful for helping teachers understand the range of stu- dent responses and that provides tools to helping them decide on next steps in instruction.
CONCLUSION 4-2 To develop the skills and dispositions to use scientific and engineering practices needed to further their learning and to solve prob- lems, students need to experience instruction in which they (1) use multiple practices in developing a particular core idea and (2) apply each practice in the context of multiple core ideas. Effective use of the practices often requires that they be used in concert with one another, such as in supporting expla- nation with an argument or using mathematics to analyze data. Classroom assessments should include at least some tasks that reflect the connected use of multiple practices.
CONCLUSION 4-3 It is possible to design assessment tasks and scoring rubrics that assess three-dimensional science learning. Such assessments pro- vide evidence that informs teachers and students of the strengths and weak- nesses of a student’s current understanding, which can guide further instruc- tion and student learning and can also be used to evaluate students’ learning.
We emphasize that implementing the conception of science learning envi- sioned in the framework and the NGSS will require teachers who are well trained in assessment strategies such as those discussed in this chapter. Professional devel- opment will be essential in meeting this goal.
CONCLUSION 4-4 Assessments of three-dimensional science learning are challenging to design, implement, and properly interpret. Teachers will need extensive professional development to successfully incorporate this type of assessment into their practice.
On the basis of the conclusions above, the committee offers recommen- dations about professional development and for curriculum and assessment development.
RECOMMENDATION 4-1 State and district leaders who design professional development for teachers should ensure that it addresses the changes called for by the framework and the Next Generation Science Standards in both the design and use of assessment tasks and instructional strategies. Professional development must support teachers in integrating practices, crosscutting con- cepts, and disciplinary core ideas in inclusive and engaging instruction and in using new modes of assessment that support such instructional activities.
Developing assessment tasks of this type will require the participation of several different kinds of experts. First, for the tasks to accurately reflect science ideas, scientists will need to be involved. Second, experts in science learning will also be needed to ensure that knowledge from research on learning is used as a guide to what is expected of students. Third, assessment experts will be needed to clarify relationships among tasks and the forms of knowledge and practice that the items are intended to elicit. Fourth, practitioners will be needed to ensure that the tasks and interpretive frameworks linked to them are usable in classrooms. And fifth, as we discuss further in Chapter 6, this multidisciplinary group of experts will need to include people who have knowledge of and experience with population subgroups, such as students with disabilities and students with varied cultural backgrounds, to ensure that the tasks are not biased for or against any subgroups of students for reasons irrelevant to what is being measured.
We note also that curricula, textbooks, and other resources, such as digi- tal content, in which assessments may be embedded, will also need to reflect the characteristics we have discussed—and their development will present similar chal- lenges. For teachers to incorporate tasks of this type into their practice, and to design additional tasks for their classrooms, they will need to have worked with many good examples in their curriculum materials and professional development opportunities.
RECOMMENDATION 4-2 Curriculum developers, assessment developers, and others who create resource materials aligned to the science framework and the Next Generation Science Standards should ensure that assessment activities included in such materials (such as mid- and end-of-chapter activi-
ties, suggested tasks for unit assessment, and online activities) require students to engage in practices that demonstrate their understanding of core ideas and crosscutting concepts. These materials should also reflect multiple dimensions of diversity (e.g., by connecting with students’ cultural and linguistic identi- ties). In designing these materials, development teams need to include experts in science, science learning, assessment design, equity and diversity, and sci- ence teaching.
I
n Chapter 4, we focused on assessments that are used as a part of classroom instructional activities. In this chapter we turn to assessments that are distinct from classroom instruction and used to monitor or audit student learning over time. We refer to them as “monitoring assessments” or “external assessments.”1 They can be used to answer a range of important questions about student learning, such as: How much have the students in a certain school or school system learned over the course of a year? How does achievement in one school system compare with achievement in another? Is one instructional technique or curricular program more effective than another? What are the effects of a particular policy measure, such as reduction in class size? Table 5-1 shows examples of the variety of questions that monitoring assess- ments may be designed to answer at different levels of the education system.The tasks used in assessments designed for monitoring purposes need to have the same basic characteristics as those for classroom assessments (discussed in Chapter 4) in order to align with the Next Generation Science Standards (NGSS): they will need to address the progressive nature of learning, include multiple components that reflect three-dimensional science learning, and include an interpretive system for the evaluation of a range of student products. In addition, assessments for monitoring need to be designed so that they can be given to large numbers of students, are suf- ficiently standardized to support the intended monitoring purpose (which may involve
1External assessments (sometimes referred to as large-scale assessments) are designed or selected
outside of the classroom, such as by districts, states, countries, or international bodies, and are typi- cally used to audit or monitor learning.