<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="ru">
	<id>http://wiki.mipt.ru/index.php?action=history&amp;feed=atom&amp;title=Development%3AQuiz_report_statistics</id>
	<title>Development:Quiz report statistics - История изменений</title>
	<link rel="self" type="application/atom+xml" href="http://wiki.mipt.ru/index.php?action=history&amp;feed=atom&amp;title=Development%3AQuiz_report_statistics"/>
	<link rel="alternate" type="text/html" href="http://wiki.mipt.ru/index.php?title=Development:Quiz_report_statistics&amp;action=history"/>
	<updated>2026-05-07T14:05:55Z</updated>
	<subtitle>История изменений этой страницы в вики</subtitle>
	<generator>MediaWiki 1.42.1</generator>
	<entry>
		<id>http://wiki.mipt.ru/index.php?title=Development:Quiz_report_statistics&amp;diff=11732&amp;oldid=prev</id>
		<title>Олег Давидович: 1 версия импортирована</title>
		<link rel="alternate" type="text/html" href="http://wiki.mipt.ru/index.php?title=Development:Quiz_report_statistics&amp;diff=11732&amp;oldid=prev"/>
		<updated>2024-10-21T08:53:04Z</updated>

		<summary type="html">&lt;p&gt;1 версия импортирована&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;ru&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Предыдущая версия&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Версия от 08:53, 21 октября 2024&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;ru&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(нет различий)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>Олег Давидович</name></author>
	</entry>
	<entry>
		<id>http://wiki.mipt.ru/index.php?title=Development:Quiz_report_statistics&amp;diff=11731&amp;oldid=prev</id>
		<title>1&gt;Mudrd8mz: Text replacement - &quot;class=&quot;nicetable&quot;&quot; to &quot;class=&quot;wikitable&quot;&quot;</title>
		<link rel="alternate" type="text/html" href="http://wiki.mipt.ru/index.php?title=Development:Quiz_report_statistics&amp;diff=11731&amp;oldid=prev"/>
		<updated>2021-07-14T13:24:39Z</updated>

		<summary type="html">&lt;p&gt;Text replacement - &amp;quot;class=&amp;quot;nicetable&amp;quot;&amp;quot; to &amp;quot;class=&amp;quot;wikitable&amp;quot;&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Новая страница&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;These statistics are designed for use with summative tests where students have just one attempt and complete that attempt.&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
== Test statistics ==&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Average grade&amp;#039;&amp;#039;: For discriminating, deferred feedback, tests aim for between 50% and 75%. Values outside&lt;br /&gt;
these limits need thinking about. Interactive tests with multiple tries invariably lead to higher averages.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Median grade&amp;#039;&amp;#039;: Half the students score less than this figure.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Standard deviation&amp;#039;&amp;#039;: A measure of the spread of scores about the mean. Aim for values between 12% and&lt;br /&gt;
18%. A smaller value suggests that scores are too bunched up.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Skewness&amp;#039;&amp;#039;: A measure of the asymmetry of the distribution of scores. Zero implies a perfectly symmetrical&lt;br /&gt;
distribution, positive values a ‘tail’ to the right and negative values a ‘tail’ to the left.&lt;br /&gt;
&lt;br /&gt;
[[Image:Skewness.jpg]]&lt;br /&gt;
&lt;br /&gt;
Aim for a value of -1.0. If it is too negative, it may indicate lack of discrimination between students who do better than average. Similarly, a large positive value (greater than 1.0) may indicate a lack of discrimination near the pass fail border.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Kurtosis&amp;#039;&amp;#039;: Kurtosis is a measure of the flatness of the distribution.&lt;br /&gt;
&lt;br /&gt;
A normal, bell shaped, distribution has a kurtosis of zero. The greater the kurtosis, the more peaked is the&lt;br /&gt;
distribution, without much of a tail on either side.&lt;br /&gt;
&lt;br /&gt;
Aim for a value in the range 0-1. A value greater than 1 may indicate that the test is not discriminating&lt;br /&gt;
very well between very good or very bad students and those who are average.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Coefficient of internal consistency (CIC)&amp;#039;&amp;#039;: It is impossible to get internal consistency much above 90%.&lt;br /&gt;
Anything above 75% is satisfactory. If the value is below 64%, the test as a whole is unsatisfactory and&lt;br /&gt;
remedial measures should be considered.&lt;br /&gt;
&lt;br /&gt;
A low value indicates either that some of the questions are not very good at discriminating between&lt;br /&gt;
students of different ability and hence that the differences between total scores owe a good deal to chance&lt;br /&gt;
or that some of the questions are testing a different quality from the rest and that these two qualities do not&lt;br /&gt;
correlate well – i.e. the test as a whole is inhomogeneous.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Error ratio (ER)&amp;#039;&amp;#039;: This is related to CIC according to the following table: it estimates the percentage of the&lt;br /&gt;
standard deviation which is due to chance effects rather than to genuine differences of ability between&lt;br /&gt;
students. Values of ER in excess of 50% cannot be regarded as satisfactory: they imply that less than half&lt;br /&gt;
the standard deviation is due to differences in ability and the rest to chance effects.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! CIC&lt;br /&gt;
! ER&lt;br /&gt;
|-&lt;br /&gt;
| 100&lt;br /&gt;
| 0&lt;br /&gt;
|-&lt;br /&gt;
| 99&lt;br /&gt;
| 10&lt;br /&gt;
|-&lt;br /&gt;
| 96&lt;br /&gt;
| 20&lt;br /&gt;
|-&lt;br /&gt;
| 91&lt;br /&gt;
| 30&lt;br /&gt;
|-&lt;br /&gt;
| 84&lt;br /&gt;
| 40&lt;br /&gt;
|-&lt;br /&gt;
| 75&lt;br /&gt;
| 50&lt;br /&gt;
|-&lt;br /&gt;
| 64&lt;br /&gt;
| 60&lt;br /&gt;
|-&lt;br /&gt;
| 51&lt;br /&gt;
| 700&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Standard error (SE)&amp;#039;&amp;#039;: This is SD x ER/100. It estimates how much of the SD is due to chance effects and is a&lt;br /&gt;
measure of the uncertainty in any given student’s score. If the same student took an equivalent test, his&lt;br /&gt;
or her score could be expected to lie within ±SE of the previous score. The smaller the value of SE the&lt;br /&gt;
better the test, but it is difficult to get it below 5% or 6%. A value of 8% corresponds to half a grade&lt;br /&gt;
difference on a typical scale – if the SE exceeds this, it is likely that a substantial proportion of the&lt;br /&gt;
students will be wrongly graded in the sense that the grades awarded do not accurately indicate their true&lt;br /&gt;
abilities.&lt;br /&gt;
&lt;br /&gt;
== Question statistics ==&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Facility index (F)&amp;#039;&amp;#039;: The mean score of students on the item.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! F&lt;br /&gt;
! Interpretation&lt;br /&gt;
|-&lt;br /&gt;
| 5 or less&lt;br /&gt;
| Extremely difficult or something wrong with the question.&lt;br /&gt;
|-&lt;br /&gt;
| 6-10&lt;br /&gt;
| Very difficult.&lt;br /&gt;
|-&lt;br /&gt;
| 11-20&lt;br /&gt;
| Difficult.&lt;br /&gt;
|-&lt;br /&gt;
| 20-34&lt;br /&gt;
| Moderately difficult.&lt;br /&gt;
|-&lt;br /&gt;
| 35-64&lt;br /&gt;
| About right for the average student.&lt;br /&gt;
|-&lt;br /&gt;
| 66-80&lt;br /&gt;
| Fairly easy.&lt;br /&gt;
|-&lt;br /&gt;
| 81-89&lt;br /&gt;
| Easy.&lt;br /&gt;
|-&lt;br /&gt;
| 90-94&lt;br /&gt;
| Very easy.&lt;br /&gt;
|-&lt;br /&gt;
| 95-100&lt;br /&gt;
| Extremely easy.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Standard deviation (SD)&amp;#039;&amp;#039;: A measure of the spread of scores about the mean and hence the extent to which&lt;br /&gt;
the question might discriminate. If F is very high or very low it is impossible for the spread to be large.&lt;br /&gt;
Note however that a good SD does not automatically ensure good discrimination. A value of SD less than&lt;br /&gt;
about a third of the question maximum (i.e. 33%) in the table is not generally satisfactory.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Random guess score (RGS)&amp;#039;&amp;#039;: This is the mean score students would be expected to get for a random guess at&lt;br /&gt;
the question. Random guess scores are only available for questions that use some form of multiple choice.&lt;br /&gt;
All random guess scores are for deferred feedback only and assume the simplest situation e.g. for multiple&lt;br /&gt;
response questions students will be told how many answers are correct.&lt;br /&gt;
&lt;br /&gt;
Values above 40% are unsatisfactory – and show that True/False questions must be used sparsely in summative tests.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Intended weight&amp;#039;&amp;#039;: The question weight expressed as a percentage of the overall test score.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Effective weight&amp;#039;&amp;#039;: An estimate of the weight the question actually has in contributing to the overall spread of&lt;br /&gt;
scores. The effective weights should add to 100% - but read on.&lt;br /&gt;
&lt;br /&gt;
The intended weight and effective weight are intended to be compared. If the effective weight is greater&lt;br /&gt;
than the intended weight it shows the question has a greater share in the spread of scores than may have&lt;br /&gt;
been intended. If it is less than the intended weight it shows that it is not having as much effect in&lt;br /&gt;
spreading out the scores as was intended.&lt;br /&gt;
&lt;br /&gt;
The calculation of the effective weight relies on taking the square root of the covariance of the question&lt;br /&gt;
scores with overall performance. If a question’s scores vary in the opposite way to the overall score, this&lt;br /&gt;
would indicate that this is a very odd question which is testing something different from the rest. And the&lt;br /&gt;
computer cannot calculate the effective weights of such questions resulting in warning message boxes&lt;br /&gt;
being displayed.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Discrimination index&amp;#039;&amp;#039;: This is the correlation between the weighted scores on the question and those on the&lt;br /&gt;
rest of the test. It indicates how effective the question is at sorting out able students from those who are&lt;br /&gt;
less able. The results should be interpreted as follows&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Index&lt;br /&gt;
! Interpretation&lt;br /&gt;
|-&lt;br /&gt;
| 50 and above&lt;br /&gt;
| Very good discrimination&lt;br /&gt;
|-&lt;br /&gt;
| 30 – 50&lt;br /&gt;
| Adequate discrimination&lt;br /&gt;
|-&lt;br /&gt;
| 20 - 29&lt;br /&gt;
| Weak discrimination&lt;br /&gt;
|-&lt;br /&gt;
| 0 - 19&lt;br /&gt;
| Very weak discrimination&lt;br /&gt;
|-&lt;br /&gt;
| -ve&lt;br /&gt;
| Question probably invalid&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Discrimination efficiency&amp;#039;&amp;#039;: This statistic attempts to estimate how good the discrimination index is relative&lt;br /&gt;
to the difficulty of the question.&lt;br /&gt;
&lt;br /&gt;
An item which is very easy or very difficult cannot discriminate between students of different ability,&lt;br /&gt;
because most of them get the same score on that question. Maximum discrimination requires a facility&lt;br /&gt;
index in the range 30% - 70% (although such a value is no guarantee of a high discrimination index).&lt;br /&gt;
&lt;br /&gt;
The discrimination efficiency will very rarely approach 100%, but values in excess of 50% should be&lt;br /&gt;
achievable. Lower values indicate that the question is not nearly as effective at discriminating between&lt;br /&gt;
students of different ability as it might be and therefore is not a particularly good question.&lt;br /&gt;
&lt;br /&gt;
--[[User:Phil Butcher|Phil Butcher]] 15:53, 27 September 2010 (UTC)&lt;/div&gt;</summary>
		<author><name>1&gt;Mudrd8mz</name></author>
	</entry>
</feed>