---
title: "Wildcards"
output: rmarkdown::html_vignette
vignette: >
%\VignetteIndexEntry{Wildcards}
%\VignetteEngine{knitr::rmarkdown}
%\VignetteEncoding{UTF-8}
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
```
## What are wildcards?
Wildcards are special characters that represent any valid alternative
character. In the context of c-squares, wildcards can be used to compactly
notate a range of raster cells. Let's say you have the following string of four
c-squares defined:
```{r defs, warning = TRUE}
library(csquares)
csq <- as_csquares("1000:1|1000:2|1000:3|1000:4")
print(csq)
```
These are four squares of 5° by 5°. As you can see the first four digits
of each of the four squares are identical. Therefore, these squares could be
notated as `"1000:*"` where `*` is the wildcard symbol. The wildcard thus represent
each of the four posibble values that are allowed at its position. Constructing a
`csquares` object with `as_csquares()` results in an identical object when the either
the long format or the compact notation with wildcard is used.
```{r from_wild}
csqw <- as_csquares("1000:*")
print(csqw)
identical(csq, csqw)
```
## Expanding wildcards
*'Ceci n'est pas une pipe'*
The `csquares` object does not store wildcards, as it will use the expanded format
with explicit notation, where each raster cell is included in the character string and
is separated by the pipe `"|"` character. The function `as_csquares()` wraps a
call to `expand_wildcards()` which expands c-square codes containing wildcards. It will
replace the short notation with wildcard by pipe seperated csquares codes.
```{r expand}
expand_wildcards("1000:*") |>
as.character()
```
## Matching wild cards
Wildcards can also be used to query spatial data and select specific blocks of
raster cells. When used for searching, the wildcard is sometimes represented
by the percentage character instead of an asterisk. For this R package it
doesn't matter. Bother characters are interpreted exactly the same.
The example below shows how you can filter a specific quadrant from the
global killer whale data set using a wildcard notation.
```{r filter}
library(dplyr, warn.conflicts = FALSE)
library(sf, warn.conflicts = FALSE)
orca_sf <-
orca |>
as_csquares(csquares = "csquares") |>
st_as_sf()
plot(orca_sf["orcinus_orca"])
## Note that the first number in the csquares code (1)
## represents the North East quadrant
## The remainder of the code consists of wildcards.
plot(
orca_sf |>
filter(
in_csquares(csquares, "1***:*")
) |>
drop_csquares() |>
select("orcinus_orca")
)
```
### Mode
Consider the following `csquares` object:
```{r mode}
csq_example <- as_csquares(c("1000:100|1000:111|1000:206|1000:207", "1000:122"))
```
It is a `vector` of two elements, the first containing 4 squares, the second 1 square.
Let's say we want to check if `"1000:1**"` matches with any of the elements in the `vector`.
```{r mode_any}
in_csquares(csq_example, "1000:1**")
```
Which is `TRUE` for both elements in the vector. This makes sense as both elements indeed contain
csquares codes that match with `"1000:1**"`. However, the first element also contains elements that
don't match. The reason that the example still returns `TRUE` in both cases is because the `mode` is
set to `"any"`. Meaning that it will return `TRUE` if any of the codes in an element matches.
You should set the mode to `"all"` if you only want a positive match when all codes in an element match:
```{r mode_all}
in_csquares(csq_example, "1000:1**", mode = "all")
```
### Strict
Now, let's compare the same `csquares` object with `"1000:*"` which has a lower resolution than the
squares in the object:
```{r not_so_strict}
in_csquares(csq_example, "1000:*")
```
Again, it matches with all elements in the `vector`, even though the resolution is different. This
is because `in_csquares()` isn't very strict by default. Meaning that it will also match with any
'child' square with a higher resolution. If you want a strict match where the resolution also matches
use `strict = TRUE`:
```{r strict}
in_csquares(csq_example, "1000:*", strict = TRUE)
```