Scagnostics

Scatterplot diagnostics measures
You can help expand this article with text translated from the corresponding article in German. (March 2022) Click [show] for important translation instructions.
  • Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wikipedia.
  • Consider adding a topic to this template: there are already 9,118 articles in the main category, and specifying|topic= will aid in categorization.
  • Do not translate text that appears unreliable or low-quality. If possible, verify the text with references provided in the foreign-language article.
  • You must provide copyright attribution in the edit summary accompanying your translation by providing an interlanguage link to the source of your translation. A model attribution edit summary is Content in this edit is translated from the existing German Wikipedia article at [[:de:Scagnostics]]; see its history for attribution.
  • You may also add the template {{Translated|de|Scagnostics}} to the talk page.
  • For more guidance, see Wikipedia:Translation.
Scatterplot matrix of the scagnostics measures for the 91 scatterplots of the variables of the Boston Housing data set

Scagnostics (scatterplot diagnostics) is a series of measures that characterize certain properties of a point cloud in a scatter plot. The term and idea was coined by John Tukey and Paul Tukey, though they didn't publish it; later it was elaborated by Wilkinson, Anand, and Grossman. The following nine dimensions are considered:[1][2]

  1. For the outliers in the data:
    1. outlying
  2. For the density of data points:
    1. skewed
    2. clumpy
    3. sparse
    4. striated
  3. For the shape of the point cloud:
    1. convex
    2. skinny
    3. stringy
  4. For trends in the data:
    1. monotony

References

  1. ^ Wilkinson, Leland (23 April 2008). "Scagnostics". Retrieved 25 March 2022. {{cite journal}}: Cite journal requires |journal= (help)
  2. ^ Wilkinson, Leland; Anand, Anushka; Grossman, Robert (2005). "Graph-theoretic scagnostics". In Proc. 2005 IEEE Symp. On Information Visualization (INFOVIS: 157–164. CiteSeerX 10.1.1.329.1315.

External links

  • Python library pyscagnostics
  • R package scagnostics
  • v
  • t
  • e
Statistics
Descriptive statistics
Continuous data
Center
  • Mean
  • Median
  • Mode
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
  • Z-test (normal)
  • Student's t-test
  • F-test
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
  • Category
  • icon Mathematics portal
  • Commons
  • WikiProject
Stub icon

This statistics-related article is a stub. You can help Wikipedia by expanding it.

  • v
  • t
  • e