Sample (statistics)


Sample (statistics)

In statistics, a sample is a subset of a population. Typically, the population is very large, making a census or a complete enumeration of all the values in the population impractical or impossible. The sample represents a subset of manageable size. Samples are collected and statistics are calculated from the samples so that one can make inferences or extrapolations from the sample to the population. This process of collecting information from a sample is referred to as sampling.

The best way to avoid a biased or unrepresentative sample is to select a random sample, also known as a probability sample. A random sample is defined as a sample where the probability that any individual member from the population being selected as part of the sample is exactly the same as any other individual member of the population. Several types of random samples are simple random samples, systematic samples, stratified random samples, and cluster random samples.

A sample that is not random is called a nonrandom sample or a nonprobability sample. Some examples of nonrandom samples are convenience samples, judgment samples, purposive samples, quota samples, snowball samples, and quadrature nodes in quasi-Monte Carlo methods.

Mathematical description of random sample

In mathematical terms, given a random variable "X" with distribution "F", a random sample of length "n" =1,2,3,... is a set of "n" independent, identically distributed (iid) random variables with distribution "F". [Samuel S. Wilks, "Mathematical Statistics", John Wiley, 1962, Section 8.1 ]

A sample concretely represents "n" experiments in which we measure the same quantity. For example, if "X" represents the height of an individual and we measure n individuals, X_i will be the height of the "i"-th individual. Note that a sample of random variables (i.e. a set of measurable functions) must not be confused with the realizations of these variables (which are the values that these random variables take). In other words, X_i is a function representing the measurement at the i-th experiment and x_i=X_i(omega) is the value we actually get when making the measurement.

The concept of a sample thus includes the "process" of how the data are obtained (that is, the random variables). This is necessary so that mathematical statements can be made about the sample and statistics computed from it, such as the sample mean and covariance.

References

ee also

*Estimation theory
*Sampling (statistics)
*Sample size
*Multiset#Cumulant generating function

External links

* [http://www.socialresearchmethods.net/kb/sampstat.htm Statistical Terms Made Simple]


Wikimedia Foundation. 2010.

Look at other dictionaries:

  • Статистика малых выборок (small-sample statistics) — Принято считать, что начало С. м. в. или, как ее часто называют, статистике «малых п», было положено в первом десятилетии XX века публикацией работы У. Госсета, в к рой он поместил t распределение, постулированное получившим чуть позже мировую… …   Психологическая энциклопедия

  • statistics — /steuh tis tiks/, n. 1. (used with a sing. v.) the science that deals with the collection, classification, analysis, and interpretation of numerical facts or data, and that, by use of mathematical theories of probability, imposes order and… …   Universalium

  • Sample size determination — is the act of choosing the number of observations to include in a statistical sample. The sample size is an important feature of any empirical study in which the goal is to make inferences about a population from a sample. In practice, the sample …   Wikipedia

  • Statistics — is a mathematical science pertaining to the collection, analysis, interpretation or explanation, and presentation of data. Also with prediction and forecasting based on data. It is applicable to a wide variety of academic disciplines, from the… …   Wikipedia

  • Statistics New Zealand — Tatauranga Aotearoa Statistics New Zealand logo Agency overview Jurisdiction New Zealand Headquarters …   Wikipedia

  • Sample mean and sample covariance — are statistics computed from a collection of data, thought of as being random.ample mean and covarianceGiven a random sample extstyle mathbf{x} {1},ldots,mathbf{x} {N} from an extstyle n dimensional random variable extstyle mathbf{X} (i.e.,… …   Wikipedia

  • sample — [sam′pəl, säm′pəl] n. [ME, aphetic for asample < Anglo Fr, for OFr essample: see EXAMPLE] 1. a part, piece, or item taken or shown as representative of a whole thing, group, species, etc.; specimen; pattern [samples of wallpaper] 2. an… …   English World dictionary

  • sample — ► NOUN 1) a small part or quantity intended to show what the whole is like. 2) Statistics a portion of a population, serving as a basis for estimates of the attributes of the whole population. 3) a specimen taken for scientific testing or… …   English terms dictionary

  • Sample maximum and minimum — Box plots of the Michelson–Morley experiment, showing sample maximums and minimums. In statistics, the maximum and sample minimum, also called the largest observation, and smallest observation, are the values of the greatest and least elements of …   Wikipedia

  • Sample size — The sample size of a statistical sample is the number of observations that constitute it. It is typically denoted n , a positive integer (natural number).Typically, all else being equal, a larger sample size leads to increased precision in… …   Wikipedia