=>
Statistics in Language Studies (a)
Gap-fill exercise
Fill in all the gaps using the AWL words in the list, then press "Check" to check your answers. Use the "Hint" button to get a free letter if an answer is giving you trouble. Note that you will lose points if you ask for hints or clues!
adequate analyse assume chapter chapters classical constructing data deviation emphasising illustrate infer inference Inference inferences involved obtain obtained obviously potential relevant resources scope selected similar Statistical statistical straightforward
4.1 The problem
Until now we have been considering how to describe or summarise a set of
considered simply as an object in its own right. Very often we want to do more than this: we wish to use a collection of observed values to make
about a larger set of
values; we would like to consider a particular set of
we have
as representing a larger class. It turns out that to accomplish this is by no means
. What is more, an exhaustive treatment of the difficulties
is beyond the
of this book. In this
we can only provide the reader with a general outline of the problem of making
from observed values. A full understanding of this exposition will depend to some degree on familiarity with the content of later
. For this reason we suggest that this
is first read to
a general grasp of the problem, and returned to later for re-reading in the light of subsequent
.
We will
the problem of
by introducing some of the cases which we will
in greater detail in the
to come. One, for example, in
8, concerns the size of the comprehension vocabulary of British children between 6 and 7 years of age. It is
not possible, for practical reasons, to test all British children of this age. We simply will not have the
. We can only test a sample of children. We have learned, in
2 and 3, how to make an
description of an observed group, by, for example,
a histogram or calculating the mean and standard
of the vocabulary sizes of the subset of children
. But our interest is often broader than this; we would like to know the mean and standard
which would have been obtained by testing all children of the
age. How close would these have been to the mean and standard
actually observed? This will depend on the relationship we expect to hold between the group we have
to measure and the larger group of children from which it has been
. How far can we
the characteristics of this latter group to be
to those of the smaller group which has been observed? This is the
problem of
: how to
from the properties of a part the likely properties of the whole. It will turn up repeatedly from now on. It is worth
at the outset that because of the way in which samples are
in many studies in linguistics and applied linguistics, it is often simply not possible to generalise beyond the samples. We will return to this difficulty.
Check
Hint
OK
=>