how_to_use_gbay
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
how_to_use_gbay [2018/12/13 12:40] – stritiha | how_to_use_gbay [2023/04/21 15:30] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 8: | Line 8: | ||
When hovering with your mouse above a node, additional options appear. A node can be selected as a target node, which means that the output of running the network will contain the posterior probability distribution of this node. When we select a node as a target node, a target icon will appear in its upper right corner. | When hovering with your mouse above a node, additional options appear. A node can be selected as a target node, which means that the output of running the network will contain the posterior probability distribution of this node. When we select a node as a target node, a target icon will appear in its upper right corner. | ||
- | You may want to save the configuration of the run (includSing | + | You may want to save the configuration of the run (including |
=== 2.2 Set non-spatial evidence === | === 2.2 Set non-spatial evidence === | ||
Line 21: | Line 21: | ||
When the input data represents **hard evidence** (we know the state of the node at each pixel, with 100% certainty, e.g. we know that the land cover of a pixel is forest), then the input raster has one band, where the value of each pixel corresponds to the number of a state of the node. For continuous nodes, the value of the pixel represents the real value (e.g. forest cover of 75%). | When the input data represents **hard evidence** (we know the state of the node at each pixel, with 100% certainty, e.g. we know that the land cover of a pixel is forest), then the input raster has one band, where the value of each pixel corresponds to the number of a state of the node. For continuous nodes, the value of the pixel represents the real value (e.g. forest cover of 75%). | ||
- | When we use **soft evidence** (a probability distribution for each pixel), the input raster should have as many bands as the number of states of the corresponding node. Each band represents the probability of a state (e.g. the probability that the land cover of a pixel is forest). The values of all bands should sum up to 100. | + | When we use **soft evidence** (a probability distribution for each pixel), the input raster should have as many bands as the number of states of the corresponding node. Each band represents the probability of a state (e.g. the probability that the land cover of a pixel is forest). The values of all bands should sum up to 100. |
+ | |||
+ | If soft evidence is given (to a root or intermediate node), the effect is identical to the Netica' | ||
=== 3.2 Vector === | === 3.2 Vector === | ||
- | When using vector data (**.shp** files or ESRI geodatabase files), the input nodes are represented in the attribute table of the dataset. | + | When using vector data (zipped |
- | For **hard evidence**, the attribute table should have a column corresponding to the name of the input node, with values corresponding to the states of the node. | + | For **hard evidence**, the attribute table should have a column corresponding to the name of the input node, with values corresponding to the states of the node. A value of 0 means NODATA. |
- | For **soft evidence**, each state of the input node should be represented by a column of the attribute table, with values representing the probabilities of the states (which should sum up to 100). The column names should be the node name, followed by two underscores, | + | For **soft evidence**, each state of the input node should be represented by a column of the attribute table, with values representing the probabilities of the states (which should sum up to 100). The column names should be the node name, followed by two underscores, |
+ | |||
+ | Example: | ||
+ | | ||
The vector file can be added to the network by dragging it to the box labelled **“Vector file”**. | The vector file can be added to the network by dragging it to the box labelled **“Vector file”**. | ||
Please note that gBay does not modify the geometry of the vector file, but simply performs inference on each object, using information from the attribute table. | Please note that gBay does not modify the geometry of the vector file, but simply performs inference on each object, using information from the attribute table. | ||
+ | |||
+ | ==== 4. Run the network ==== | ||
+ | |||
+ | Once you have set up the network, selected the target nodes, and uploaded the spatial inputs, you can click on **“Run”** to run the network. gBay will use your spatial data to perform inference in the BN for each pixel or feature, and produce an output of the results. | ||
+ | |||
+ | Any potential **errors or warnings** will appear as pop-ups. If you want to see the progress of the processing, select the option to “Show console”. | ||
+ | |||
+ | If you have a complex network and are running it with a **large** spatial dataset, this may take some time. In this case, you should enter your **email** address, where you will receive a notification when the process is completed, along with a link where you can download the data. | ||
+ | |||
+ | {{: | ||
+ | |||
+ | ==== 5. Outputs ==== | ||
+ | |||
+ | === 5.1 Raster === | ||
+ | The output of a gBay run is a posterior probability distribution raster of each target node (named target_node.tif). This raster has one band for each state of the target node, where the value represents the probability of the state. In addition, the last band of the raster represents the most probable state. | ||
+ | |||
+ | In addition, for each target node, some metrics of the posterior probability are calculated and stored in an additional raster file called target_node_stats.tif. For discrete target nodes, this file contains one band with Shannon’s evenness index of the posterior probability distribution: | ||
+ | |||
+ | \(J = H'/ | ||
+ | (We use \(log_2\) to reflect the common use in information theory.) | ||
+ | |||
+ | |||
+ | The index is a standardized measure of entropy (can be compared between nodes with different numbers of states) and expresses uncertainty. It has values between 0 and 1, where 1 denotes a uniform distribution between all possible states (maximum uncertainty), | ||
+ | |||
+ | If the target node is continuous, the stats output contains six bands with the following values: | ||
+ | 1. Evenness index | ||
+ | 2. Mean | ||
+ | 3. Median | ||
+ | 4. Standard deviation | ||
+ | |||
+ | ==== Overview of gBay spatial inputs and outputs ==== | ||
+ | |||
+ | | ^ **Input format** | ||
+ | ^ Raster | ||
+ | | ::: | ::: | Value = node state (discrete nodes) or continuous value (continuous nodes) | ||
+ | | ::: | ::: | **Soft evidence: | ||
+ | | ::: | ::: | One band per state | additional last band: | | ||
+ | | ::: | ::: | value = probability of state | value = most likely state | | ||
+ | | ::: | ::: | | **target_stats.tif**: | ||
+ | | ::: | ::: | | 1. Evenness index | | ||
+ | | ::: | ::: | | Only for continuous nodes: | ||
+ | | ::: | ::: | | 2. Mean | | ||
+ | | ::: | ::: | | 3. Median | ||
+ | | ::: | ::: | | 4. Standard deviation | ||
+ | ^ Vector | ||
+ | | ::: | ::: | Column of attribute table with same name as input node | attribute table with a column for each state of the target node: | | ||
+ | | ::: | ::: | value = node state (discrete nodes) or continuous value (continuous nodes) | ||
+ | | ::: | ::: | **Soft evidence: | ||
+ | | ::: | ::: | Columns with node name and state number | ||
+ | | ::: | ::: | (lu_t0__s1, lu_t0__s2, lu_t0__s3): | ||
+ | | ::: | ::: | value = probability of state | | |
how_to_use_gbay.1544701249.txt.gz · Last modified: 2023/04/21 15:30 (external edit)