In RAWGraphs, scatterplot and bubble graph are presented as variants of the same layout. This layout is useful to see correlations among dimensions. In this example, we will use it to visualize some data related to a list of movies.
GOAL OF THE TUTORIAL: comparing movies according box office, production budget, return on investment and genre.
The first step is to copy and paste your data into RAWGraphs.
The dataset must contain at least two columns, one for the X Axis (in our case, Production Budget) and one for the Y Axis (Box Office). Optionally, you can add a column for the Size (ROI), a column for the Color (Genre), and one or more for the label (Movie).
Here you can see the dataset we are going to use:
You can also download the dataset through the link on the left of this post or load the data from using the “Try our samples” button then choosing “Movies – Dispersion”.
In few second you should see the green bar saying “26 records in your data have been successfully parsed!”
After pasting your data, scroll down and select the “scatter plot” visual model.
The scatterplot layout has five visual variables.
|X Axis||Number, date||[fa class=”fa-asterisk”]||The horizontal position of the circles.|
|Y Axis||Number, date||[fa class=”fa-asterisk”]||The vertical position of the circles.|
|Size||Number||Area of the circles. If not provided, size will be proportional to the count of rows.|
|Color||String, numbers, dates||For each unique value found in the dimension, a color is created.|
|Label||String, numbers, dates||[fa class=”fa-tags”]||Values will be used as labels. If more than one dimension is dragged, the label will contains the values separated by comma.|
It’s time to map your data and create the visualization. On the left you have the list of dimensions in your dataset, on the right the available visual variable of the scatter plot layout.
Drag and drop the dimensions as following:
In this way we created a circle for each movie in the dataset, disposing it on the cartesian plane according to its production budget and its box office. We then set the area of each circle proportional to the ROI. We colored circles according to genres, and finally put the movie’s name as label.
To reach the final results we can customize the visualization with some parameters you can find next to the visualization.
For the scatterplot the options available are the following:
|Width||artboard width in pixels|
|Height||artboard height in pixels|
|Max Radius||The radius for the circle with the highest value in the dimension mapped as Size.|
|Set origin at (0,0)||If selected, the scatterplot origin will be set at the zero value for both the axes. If not selected, origin will be set to the minimum value for the two axes.|
|colour scale||list of uniques values in the dimension mapped as “color”. If set to ordinal, you can set a color for each value. If set to linear, the app will try to find the minimum and maximum value contained in the dimension, and then creating a gradient among those two values.|
For the final results we have set the options as following:
Here is the finished chart. You can export it in svg or png using the download function at the bottom of the page.
"How to make a scatterplot", by RAWGraphs Team. Licensed under CC BY-NC-SA 4.0. Accessed: July 17, 2020, from undefined
Copy to clipboard