초록

일반적으로 검사점수는 검사의 성격과 결과의 활용 목적에 따라 그 형태와 산출 및 보고 방식이 결정된다. 우리나라 대학수학능력시험의 경우 검사 목적(대학 신입생 선발)에 보다 부합하는 결과 활용을 위하여 다양한 변화를 겪어 왔다. 특히 2008학년도 대학수학능력시험에서는 표준점수와 백분위가 폐지되고 등급만이 보고되며, 이때의 등급은 측정 이론적으로 스테나인(stanines)에 기반한다. 수능 점수 형태는 검사의 목적과 결과 활용에 직접적으로 연관된 것으로 새로운 변화에 따른 수능의 성격과 점수 체제 전반에 대한 논의가 필요하다. 이를 위해 미국의 SAT와 ACT, 일본의 대학입시센터시험, 영국 GCE의 성격과 점수체제를 비교ㆍ분석하였다. 미국의 SAT와 ACT의 척도점수체제, 일본 대학입시센터시험의 원점수체제, 영국 GCE의 등급체제는 각국의 독자적 검사 특성에 부합하는 점수체제로서, 대학수학능력시험의 개선, 특히 점수의 일관성과 공정성 확보를 위한 점수체제 개선이라는 측면에서 많은 시사점을 제공한다.

From 2008 School Year College Scholastic Ability Test(CSAT), only 9-grade scores will be reported without standardized scores and percentile ranks. Our 9-grade scores are based on traditional stanines. So when the statistical assumptions of stanines are violated, resulted scores would be out of targeted distribution. It means correspondence between purpose and use of CSAT can be made not only by type of scores but also by test characteristics. In order to get directions for reforming CAST's test and scoring procedures, we reviewed several foreign College Entrance Exams in U.S., Japan, and U.K. We found that scaled scores of SAT and ACT in U.S., raw scores of National Center Test for University Admissions in Japan, and 5-grade scores of GCE in U.K. came from and corresponded to its own test characteristics. We also discussed consistency and equity of CSAT scores compared with those of foreign tests.