Dot plot (statistics)

Dot plot (statistics)

A dot chart or dot plot is a statistical chart consisting of data points plotted on a simple scale, typically using filled in circles. There are two common, yet very different, versions of the dot chart. The first is described by Wilkinson as a graph that has been used in hand-drawn (pre-computer era) graphs to depict distributions.[1] The other version is described by Cleveland as an alternative to the bar chart, in which dots are used to depict the quantitative values (e.g. counts) associated with categorical variables.[2]

Contents

Wilkinson dot plots

A dot plot, as described by Wilkinson, of 50 random values from 0 to 9.

The dot plot as a representation of a distribution consists of group of data points plotted on a simple scale. Dot plots are used for continuous, quantitative, univariate data. Data points may be labelled if there are few of them.

Dot plots are one of the simplest statistical plots, and are suitable for small to moderate sized data sets. They are useful for highlighting clusters and gaps, as well as outliers. Their other advantage is the conservation of numerical information. When dealing with larger data sets (around 20–30 or more data points) the related stemplot, box plot or histogram may be more efficient, as dot plots may become too cluttered after this point. Dot plots may be distinguished from histograms in that dots are not spaced uniformly along the horizontal axis.

Although the plot appears to be simple, its computation and the statistical theory underlying it are not simple. The algorithm for computing a dot plot is closely related to kernel density estimation. The size chosen for the dots affects the appearance of the plot. Choice of dot size is equivalent to choosing the bandwidth for a kernel density estimate.

Cleveland dot plots

Dot plot may also refer to plots of points that each belong to one of several categories. They are an alternative to bar charts or pie charts, and look somewhat like a horizontal bar chart where the bars are replaced by a dots at the values associated with each category. Compared to (vertical) bar charts and pie charts, Cleveland argues that dot plots allow more accurate interpretation of the graph by readers by making the labels easier to read, reducing non-data ink (or graph clutter) and supporting table look-up.

In the R programming language this type of plot is also referred to as a stripchart[3] or stripplot.[4]

References

  1. ^ Wilkinson, Leland (1999). "Dot plots". The American Statistician (American Statistical Association) 53 (3): 276–281. doi:10.2307/2686111. JSTOR 2686111. 
  2. ^ Cleveland, William S. (1993). Visualizing Data. Hobart Press. ISBN 0963488406. hdl:2027/mdp.39015026891187. 
  3. ^ Peter Dalgaard. Introductory Statistics with R. Springer. ISBN 0387954759. 
  4. ^ Paul Murrell (2005). R Graphics. Chapman & Hall/CRC. ISBN 158488486X. http://www.stat.auckland.ac.nz/~paul/RGraphics/rgraphics.html. 

Other references

  • Wild, C. and Seber, G. (2000) Chance Encounters: A First Course in Data Analysis and Inference John Wiley and Sons. ISBN 0-471-32936-3

External links


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • Dot plot — may refer to: Dot plot (bioinformatics), for comparing two sequences Dot plot (statistics), data points on a simple scale This disambiguation page lists articles associated with the same title. If an internal link …   Wikipedia

  • Dot plot (bioinformatics) — This article is about the biological sequences comparison plot. For the statistical plot, see Dot plot (statistics). A dot plot (a.k.a. contact plot or residue contact map) is a graphical method that allows the comparison of two biological… …   Wikipedia

  • Plot (graphics) — Scatterplot of the eruption interval for Old Faithful (a geyser). A plot is a graphical technique for representing a data set, usually as a graph showing the relationship between two or more variables. The plot can be drawn by hand or by a… …   Wikipedia

  • List of statistics topics — Please add any Wikipedia articles related to statistics that are not already on this list.The Related changes link in the margin of this page (below search) leads to a list of the most recent changes to the articles listed below. To see the most… …   Wikipedia

  • Box plot — In descriptive statistics, a boxplot (also known as a box and whisker diagram or plot) is a convenient way of graphically depicting groups of numerical data through their five number summaries (the smallest observation, lower quartile (Q1),… …   Wikipedia

  • Chart — For other uses, see Chart (disambiguation) , Graph (disambiguation) , and Diagram For information about charts in Wikipedia, see Wikipedia:Graphs and charts. A pie chart. A chart is a graphical representation of data, in which the …   Wikipedia

  • Sequence alignment — In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences.[1]… …   Wikipedia

  • GGobi — Infobox programming language name = GGobi paradigm = year = designer = developer = Deborah Swayne, Michael Lawrence, Hadley Wickham, Duncan Temple Lang, Di Cook, Heike Hofmann and Andreas Buja latest release version = 2.1.7 latest release date =… …   Wikipedia

  • Stemplot — A stemplot (or stem and leaf plot), in statistics, is a device for presenting quantitative data in a graphical format, similar to a histogram, to assist in visualizing the shape of a distribution. They evolved from Arthur Bowley s work in the… …   Wikipedia

  • Pie chart — of populations of English native speakers A pie chart (or a circle graph) is a circular chart divided into sectors, illustrating proportion. In a pie chart, the arc length of each sector (and consequently its central angle and area), is… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”