Scatterplot

A scatter plot uses Cartesian coordinates to display values for typically two variables for a set of data. The data are displayed as a collection of points, each having the value of two variables determining the position.

The major role of scatterplots lies in revealing associations (EDAV - Dependency Relationship) between variables, not just linear associations, but any kind of association.

Features

An Example

ggplot2movies: rating vs. votes (y ~ x)

We can observe that

Overplotting

As we can see in the above example, scatterplots may overplot, making dots stacked together. There are some strategies for this problem

Smooth Scatterplot

A smooth scatterplot is a combination of a scatterplot, a Heatmap, and a Density Contour Plot. It plots the points and uses colors and shapes to show the magnitude of clusters. Use the R built-in function smoothScatter(x,y) to create a smooth scatterplot.

Creative Commons License by zcysxy