Please note: this guide has been written in relation to version 2.0 of the spatial portal due to brought into production in 2019. A different version of the portal will allow similar functions with some small differences in the user interface. If you are unable to access a required function, please contact support@ala.org.au.


The scatterplot function links the sampled values of any two environmental variables on a species (or genus etc) with the map. Points on the scatterplot represent the environment found at each occurrence record, as given by the environmental variables of the two axes of the scatterplot.


The scatterplot (environmental space) and the map (geographic space) are linked. Dragging a rectangle over an area of the scatterplot to enclose occurrence points will highlight the corresponding points on the map. You can also define an active area on the map and have all occurrences within that area highlighted on the scatterplot.







The scatterplot allows you to:

  • Examine the environmental niche of one or two taxa or taxonomic groups. Using a species as the primary taxa and its genus as the secondary taxa can be highly informative. For example, survey effort for the species may be able to be evaluated. Two similar species can be plotted to see how they may be partitioning the environment.
  • Identify where species do not seem to occur. Areas in the scatterplot that do not contain occurrences are just as interesting as areas that do. Species may not occur in an area of the scatterplot for at least four reasons:
    1. The species may not be able to survive in that environment.
    2. The species may occur in that environment but there has been no surveys to look for it.
    3. The environment may have been surveyed but the species was not found, even though it was there.
    4. The species may be in that environment, but misidentified.(the scatterplot functionality has been designed to help discern what reason seems the most likely)
  • Identify where a species tolerates a wide range of environmental conditions.
  • Identify sub-populations that may adapt best to various climate change scenarios.
  • Identify what environmental combinations exist within an area.


From the menu option, select Tools and then Scatterplot - single.


Options

The scatterplot requires a minimum of three parameters – a species or taxonomic group and two environmental variables.

  • A primary species or any taxonomic group. The occurrences of this group are the points on the scatterplot (e.g. Eucalyptus gunnii).
  • Two environmental layers (e.g. Temperature – annual mean (Bio01) and Precipitation – annual (Bio12)).

In addition, a second species or taxonomic group can be selected as a background (e.g. the species Eucalyptus nitida). 


In the example below, the primary species is shown with blue dots on the scatterplot and on the map, the background species is shown on the scatterplot in grey dots and the environmental layers are the 2 axes.


Faceting the scatterplot

Any legend permits modification of the display of the associated layer. Faceting the scatterplot modifies both the points (occurrences) in both the map (geographic) and environmental (scatterplot) space. Facets are available in the Facets drop down list.



In the example below, the facet River regions has been selected. After a little while (for many points) the points on the scatterplot and the map will be coloured according to the this facet.

For more detailed information on Scatterplot faceting »




Additional Options

  • Highlight active area occurrences. Selecting this option will highlight those occurrences on the scatterplot occurring within the map’s defined active area.
  • Display possible environments in area produces a grey-scale background on the scatterplot that delineates the combination of environments that occur within the full range of possible environmental values. Environmental combinations outside the environmental envelope do not occur in nature and are shown in light blue. The grey-scale represents the geographic (map) area of the environmental combinations. Light colours imply a greater area of the environmental combination than darker colours, thus providing an indication of the spatial extents of the environments.
  • The Select records with missing values check box will highlight those occurrences on the map that have one or more missing environmental values. In most cases, missing values will occur when occurrences are off the extent of the environmental surface, e.g. terrestrial occurrences occurring in marine or limnetic environments or vice versa. The number of occurrences with missing environmental values is also listed.
  • Selecting occurrence points on the scatterplot. A rectangle area can be created on the scatterplot by dragging the mouse between any two points. When the left mouse button is released:
    • the area perimeter will be displayed on the scatterplot with a black border.
    • primary taxa in this area will be highlighted on the map.
    • the number of records selected will be listed.
    • the range of both environmental variables for the rectangle area will be listed.
    • the add in/out layers to the map button creates two new layers to the layers list: an IN-group that includes all the occurrences within the environmental rectangle and an OUT-group that contains all occurrences not within the environmental rectangle. The legend and look of the IN/OUT layers can be altered the same way as for other species layers. See the Layer Interaction Panel for a screenshot of the species legend.
  • The Download data button creates a CSV (comma-separated values) file of the data used to create the scatterplot. The variables are the:
    • Occurrence record identifier.
    • Status of the scientific name occurrence: Uploaded or In Active Area (for those records highlighted in the active area of the scatterplot, there will be both an uploaded and an in active area record).
    • X – environmental variable value.
    • Y – environmental variable value.
  • The Download image button will create a png image file of the scatterplot as displayed when the button is pressed. This file can be viewed or downloaded.


A case study

A case study on using the scatterplot tool to investigate the distribution of Banksia integrifolia in Australia, is given by Dr Ben Raymond of the Australian Antarctic Division, Hobart.

Read the Case Study »


A worked example

  • Australia as the area to be analysed.
  • Banksia marginata as the foreground species.
  • Banksia (genus) as the background taxonomic group.
  • Environmental parameter 1 is Temperature – annual mean (Bio01).
  • Environmental parameter 2 is Precipitation – annual (Bio12).


Note: to select a foreground or background species choose the option Search for a species by scientific or common name.




In Step 4, we are selecting the environmental/contextual layers. 



In the context of the scatterplot, it is likely that you will want to use the Add from search box and either enter part of the name of the layer or its short name (e.g., Bio01 for Temperature – annual mean) and then check the box on the left of the name to select it. 


Ditto with the second layer. At the bottom of the window, you will see the number of layers selected and this should equal 2 before clicking on the Next key (bottom right).



An alternative is to import the names of the two layers from a file – and this can be done from the top dropdown box. The names of the two selected layers can of course also be exported using the Export set button at the bottom of the table. You can then re-import the list using the import option from the top dropdown box.










The background taxa group is the genus Banksia. This gives us a good indication of what environments the genus covers and what portion of that environment is covered by B. Marginata. The genus occurrences are only mapped on the scatterplot in grey in the background. The B. Marginata is shown by blue points on both the scatterplot and map. If the highlight records in the active area was selected, then those records would be ringed with a red circle.


In the worked example, we will use temperature (Temperature – annual mean (Bio01)) and precipitation (Precipitation – annual (Bio12)) as the two environmental variables to define the environment. Once these two variables have been added, the scatterplot is generated. As there is a large number of occurrences (Banksia has over 117,500 records), processing can take up to a minute or so. The distribution of Banksia (grey dots) covers a significant portion of the scatterplot, thereby indicating that the genus can handle a wide range of temperature and rainfall conditions. To learn more about the environment used by the genus, make it the primary taxa.


Banksia marginata is located toward the bottom of the scatterplot distribution and clearly follows the outline of the genus ‘envelope’ on the low temperature end, but over a broad range of precipitation. It covers annual precipitation of less than 500mm and greater than 3000mm with mean annual temperatures from 5°C to 20°C – a very impressive range!


Let’s look at some of the outliers to see where they occur. First, the low temperature end. Drag a rectangle over the lower end occurrences on the scatterplot. This highlights the corresponding points on the map, highland areas inTasmania and one occurrence on alpine NSW. 


Let’s do the same at the high temperature end to see where these occurrences are located. Drag the rectangle on the scatterplot and then examine the highlighted occurrences on the map. Not unexpectedly – the high temperature occurrences are found in the northern end of it's distribution around Perth and in northern NSW.


Note: the range of temperature and rainfall values of the rectangle are listed above the scatterplot. In this case, a mean annual temperature range of 18.4655°C to 21.2818°C and rainfall between 44.8943mm and 1963.0616mm. Also note that there are 208 records selected.



The selected occurrences could be used to create two new mapped layers – an ‘IN-group’ containing only those 20 occurrences and an ‘OUT-group’ containing all the rest. This option can be useful for filtering/separating out a subset of occurrences for further analysis in say the spatial prediction model. 


Where there are occurrences that have one or two missing environmental values of temperature or rainfall, if an IN/OUT groups are created these occurrences are added to the OUT-group by default. If you click the checkbox saying Select records with missing values, then the corresponding occurrences will be highlighted on the map and added to the IN-group. 


Next, let’s consider why B. Marginata doesn’t occur in a few environments on the scatterplot.



There is a hole in the distribution of B. Marginata at around 12.5°C and 1500 mm that is filled by other banksia species (shown by the grey Banksia background points) so that environment exists in nature. But why are there no occurrences here? There are at least four possibilities:

  1. B. Marginata may not be able to survive in that environment. This is possible here but unlikely.
  2. B. Marginata may occur in that environment but there had been no surveys in the area represented by that environment. This is a likely scenario and best addressed by a targeted survey.
    NOTE: You can use the Environmental Envelope option for defining an Active Area (see Active Area Help) to map the locations that conform to this environmental combination.
  3. The environment may have been surveyed but the species was not seen, even though it was there. 
  4. B. Marginata may have been seen in that environment, but misidentified. Not an uncommon taxonomic problem! The Atlas is incorporating identification keys using Identify Life (http://www.identifylife.org/). You could examine what banksias occur in the area represented by this hole by using the Map All option on the Environmental Envelope noted in (2. above) and see if their characters could be confused with B. Marginata.

The same situation doesn’t occur with the ‘dent’ in the environment at around 14°C and 2500mm. Obviously, that environment doesn’t exist in Australia (at least not represented by the environmental layers we have chosen) – and it is therefore not surprising that no banksias are to be found. The banksias background covers much of the potential environmental range indicating the ubiquity of the genus. The grey-scale of the ‘display possible environments in area’ indicate the size of the corresponding mapped areas, with black representing only a small area with this environment in Australia, and reversely white, a large area. For example, there are only small areas of Australia with extreme rainfall (around Tully in Northern Queensland), and a large area of very low rainfall. This can be examined further by examining the environmental layers: Temperature – annual mean (Bio01) and Precipitation – annual (Bio12).


Mean annual temperature and annual rainfall were chosen because these variables were very likely to constrain the spatial distribution of banksias. You may wish to use the Prediction Tool (MaxEnt) to find out which environmental variables best seem to control the distribution of B. Marginata.


Demonstration Youtube Video

By Lee Belbin, Geospatial Team Leader

Please note: these videos were recorded in an earlier version of the spatial portal. Functions are still available within the new version.