User Guide

This is the user guide for FINE. You can access the Shiny application here.

Important Terminology

  • (Fire) hotspots: Locations where the temperature detected by monitoring satellites is higher than surrounding areas (Indonesian Meteorology, Climatology, and Geophysical Agency, 2023)

  • Administrative Divisions of Indonesia:

    • Province: First-level administrative division - corresponds to “provinsi” or “daerah istimewa” (as in the case of Aceh)

    • City/Regency: Second-level administrative division - corresponds to “kota”/“kabupaten” respectively

    • District: Third-level administrative division - corresponds to “kecamatan”

    • Sub-district: Fourth-level administrative division - corresponds to “desa” or “kelurahan”

Exploratory Data Analysis

Point Map

The first tab of the Exploratory Data Analysis page is the interactive point map. It allows users to view information about individual fire hotspots that have been detected.

There are several aspects users can filter by. Users can select a specific area to examine, be it by province, city/regency, district, or sub-district. Users can also select one or multiple satellite types that the points were detected by, as different satellites may give different levels of detail and accuracy in detecting fire hotspots. Additionally, users can also select a date range that they would like to filter by using a slider. The left knob represents the start date and the right knob represents the end date.

By default, the latest hotspots within the date range will be highlighted in red, whereas all other points will be colored pink.

However, users can also choose to color the points by the confidence levels with which the satellites detected the hotspots. This is done by selecting the “Color by confidence level?” checkbox:

Time Series

The second tab of the Exploratory Data Analysis page allows users to view time series plots of aggregated count of hotspots within a year.

Like the point map, users can select a specific area to examine, be it by province, city/regency, district, or sub-district. However, instead of a flexible date range, users can only select a specific year from 2015-2019, as the count of hotspots are aggregated by month.

Users can also select one or more confidence levels to filter by. In this case, the count of hotspots will be aggregated by both month and confidence level, with each time series being displayed in a different color according to the confidence level.

Kernel Density

The kernel density estimation mapping is done to show the density of point data on the map. In this case, the point data is the forest fire hotspots across the map.

The users can select city, then from city they can then select which province and from that province, which city they want to observe. Some cities will take longer to load in than others. The city field must be filled in before the map could be plotted. The user then has to choose the year they wish to observe, their choices are from 2015 to 2019. The users can pick whether they want automatic bandwidth for the city being observed, or if they would rather pick fixed bandwidth so that they can pick their bandwidth, their choices ranging from 100 to 1000 bandwidth. For fixed bandwidth, there are four kernel functions that the user can pick from : quartic, gaussian, disc, and epanechikov.

After selecting all the desired fields, the user may press submit. After submitting the fields, the user needs to wait for a bit before the map is plotted out and loaded in. The legend will give context to the density of the points observed.

Spatial Cluster

In order to investigate the presence of spatial point patterns such as spatial clustering, we can use a statistical test known as the Complete Spatial Randomness Test, or CSR for short. This test is a second order spatial point pattern analysis, and it aims to examine how individual points impact the occurance of other points, and what spatial point patterns emerge as a result. In particular, we will be looking out for random, clustered or dispersed point patterns for occurances of fire hospots within the selected city during the specifed month.

When carrying out the test for CSR, users have 4 function types to choose from: the F function, G function, K function and L function. More info on these in the tabset below. Users may also specify the desired province, city, year and month they desire to examine. Other customisable parameters include the number of simulations to run the test for. This number of simulations correlates to the significance level of the test, can can be viewed in the test parameters in the test parameters box on the top right hand side of the display. Users can also select their desired correction method, which is the edge correction to be applied to the test function in order to reduce bias.

The hypotheses for the CSR statistical test are as follows:

H0, the Null Hypothesis: The distribution of forest fires in this region is random.

H1, the Alternative Hypothesis: The distribution of forest fires in this region is not random.

In order to evaluate this statistical test, we must look at the grey envelope on the plot. This envelope represents the Confidence Interval for the chosen significance level. When the observed value lies outside the grey envelope, we can conclude that the observed data is statistically significant at the stated stated significance level. We can therefore conclude that the points have some form of interaction with one another, and the spatial clustering pattern is not random. For specific interpretation on how to determine the spatial clustering pattern observed for each function, please refer to the below tabset.

Warning

This statistical test requires sufficient data in order to function properly. As such, one may encounter an error popup when the specified parameters lack sufficient data. If this occurs, please change parameters and try again. For more on error popups, please refer to the Errors section.

Description

The F Function, also known as the Empty Space function, measures the distribution of distances from an arbitrary location (not necessarily a point) to its nearest observed point.

For more information, please refer to the official documentation.

Interpretation

If the observed F Function follows a concave upwards curve (increases slowly at first, then more rapidly at longer distances), we can say that the observed fires follow a clustered pattern. However, if the observed F function follows a concave downwards curve (increases rapidly at first, then more slowly at longer distances), we can say that the observed fires follow a dispersed pattern.

Description

The G Function measures the distribution of distances from an arbitrary point to its nearest neighbour.

For more information, please refer to the official documentation.

Interpretation

When the observed G Function value is greater than the estimate (lies above the theoretical), we can say that the observed fires follow a clustered pattern. Generally, this lies below the theoretical estimate. On the other hand, when the observed value is less than the estimate, we can say the observed fires follow a dispersed pattern. Generally, this lies above the theoretical estimate.

Description

Ripley’s K Function attempts to combat the limitation of the nearest neighbours approach of using only the nearest distance by using an estimate of spatial dependence instead. This is done by taking a measure of how many events lie about each points at varying radius length, and computing this calculation for all point events.

For more information, please refer to the official documentation.

Interpretation of Statistical Test Result

When the observed K Function value is greater than, or lies above the theotetical, we can say that the forest fire hotspots are clustered and this radius. However, when the observer value lies below the theotetical value, we can assume that the forest fires are dispersed at this radius.

Description

The L function is the normalised version of the K function, where we take the K function divided by pi, the squared rooted to get the corresponding L function value.

For more information, please refer to the official documentation.

Interpretation

When the L value lies above the theoretical value, the we can say that the forest fire hotspots for this region are spatially clustered at that particular distance. On the other hand, when the L value lies below the theoretical value, the forest fire hotspots are spatially dispersed at this distance.

Spatiotemporal

Space-time K-function

The space-time K-function is a statistical tool used in spatiotemporal analysis to study the clustering or dispersion of point events in both space and time. It extends the classical spatial Ripley’s K-function to incorporate the time dimension and allows for the analysis of spatiotemporal point patterns. In the inhomogenous case, it takes into account the spatial and temporal variation in the intensity of the point process under study. Meanwhile, the homogenous case assumes that the underlying process generating the point pattern is constant in space and time.

Users can input a specific sub-district to look for spatiotemporal autocorrelation throughout a specific year. All fields of the filter must be filled before an output can be computed.

There are three possible types of outputs, adjusted by the Plot Type selection dropdown: Contour, Image, and Perspective.

For perspective plots, there are two additional parameters that can be adjusted: azimuthal angle and colatitude. They are in the form of sliders bounded from 0-360. This allows users to adjust the perspective of the 3D plot. Azimuthal Angle rotates the perspective horizontally, whereas Colatitude rotates the perspective vertically.

Emerging Hot Spot Analysis (EHSA)

Emerging Hot Spot Analysis (EHSA) is a spatiotemporal analysis technique used to identify locations where the density of a specific event is increasing or decreasing rapidly over time, referred to as “hot spots” and “cold spots” respectively.

‘Hot spot’ in this context does not refer to fire hotspots, but spatial clusters with significantly high number of fire hotspots.

Users can perform EHSA on an areal basis using aggregated count of fire hotspots within a province or a city over a certain year. The analysis is done by computing local Gi* statistics on each of the sub-districts and performing a Mann-Kendall statistical test. Users can further customise how the test is performed by specifying number of time lags, number of simulations, and a significance threshold.

Mann-Kendall

The Mann-Kendall test is a non-parametric statistical test used to detect whether a set of data values is monotonically increasing or decreasing, as well as the statistical significance of the trend. It is particularly useful when the data is not normally distributed or when the data has a significant amount of noise. This section is aimed to help users further drill down on the insights they have gotten from the EHSA tab by performing Mann-Kendall tests on computed Gi* statistics within a sub-district in a specified year.

Additionally, users can specify the number of simulations and type of contiguity weights they want to use to compute the Gi* statistics.

The output is a time series plot showing the change of Gi* statistic over the year and a table containing the result of the Mann-Kendall test.

Errors

When an error occurs, a popup message stating the cause of the error will appear. Click “Ok” to close the popup, then adjust the parameters accordingly to fix the error.