Package: vectra 0.7.1

Gilles Colling

vectra: Columnar Query Engine for Larger-than-RAM Data

A minimal columnar query engine with lazy execution on datasets larger than RAM. Provides 'dplyr'-like verbs (filter(), select(), mutate(), group_by(), summarise(), joins, window functions) and common aggregations (n(), sum(), mean(), min(), max(), sd(), first(), last()) backed by a pure C11 pull-based execution engine and a custom on-disk format ('.vtr'). Reads and writes 'GeoTIFF' (including tiled and 'BigTIFF' layouts) and a tiled raster format ('.vec') with overview pyramids and time cubes for larger-than-RAM raster data.

Authors:Gilles Colling [aut, cre, cph]

vectra_0.7.1.tar.gz
vectra_0.7.1.tar.gz(r-4.7-arm64)vectra_0.7.1.tar.gz(r-4.7-x86_64)vectra_0.7.1.tar.gz(r-4.6-arm64)vectra_0.7.1.tar.gz(r-4.6-x86_64)
vectra_0.7.1.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
vectra/json (API)
NEWS

# Install 'vectra' in R:
install.packages('vectra', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/gcol33/vectra/issues

Pkgdown/docs site:https://gillescolling.com

Uses libs:
  • openmp– GCC OpenMP (GOMP) support library

On CRAN:

Conda:

openmp

4.57 score 25 scripts 220 downloads 77 exports 7 dependencies

Last updated from:8c680df70d. Checks:6 OK. Indexed: no.

TargetResultTimeFilesSyslog
linux-devel-arm64OK221
linux-devel-x86_64OK161
source / vignettesOK244
linux-release-arm64OK170
linux-release-x86_64OK154
wasm-releaseOK157

Exports:acrossanti_joinappend_vtrarrangebind_colsbind_rowsblock_fuzzy_lookupblock_lookupchunk_feedercollectcollect_chunkedcountcreate_indexcross_joindelete_vtrdescdiff_vtrdistinctexplainfilterfull_joinfuzzy_joinglimpsegroup_bygroup_mapgroup_modifyhas_indexinner_joinleft_joinlinklookupmaterializemutateoffloadpullreframerelocaterenameright_joinselectsemi_joinsliceslice_headslice_maxslice_minslice_tailsummarisesummarizetallytbltbl_csvtbl_sqlitetbl_tifftbl_xlsxtiff_band_namestiff_crstiff_extract_pointstiff_metadatatransmuteungroupvec_build_overviewsvec_close_rastervec_extract_pointsvec_open_rastervec_raster_layoutvec_raster_timesvec_read_pixel_seriesvec_read_time_slicevec_read_windowvec_to_tiffvec_write_rastervec_write_time_cubevtr_schemawrite_csvwrite_sqlitewrite_tiffwrite_vtr

Dependencies:cligluelifecyclerlangtidyselectvctrswithr

Format Backends

Rendered fromformats.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-05-08
Started: 2026-04-21

Getting Started with vectra

Rendered fromquickstart.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-04-21
Started: 2026-04-21

Indexing and Query Optimization

Rendered fromindexing.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-04-21
Started: 2026-04-21

Joins

Rendered fromjoins.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-04-21
Started: 2026-04-21

Offloading: streaming, monoids, and out-of-core fits

Rendered fromoffload.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-06-12
Started: 2026-06-12

Species Distribution Models

Rendered fromsdm.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-06-12
Started: 2026-06-12

Star Schemas and Lookup

Rendered fromschema.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-05-08
Started: 2026-04-21

String Operations and Fuzzy Matching

Rendered fromstring-ops.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-05-08
Started: 2026-04-21

vectra Engine Reference

Rendered fromengine.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-04-21
Started: 2026-04-21

Working with Large Data

Rendered fromlarge-data.Rmdusingknitr::rmarkdownon Jun 12 2026.

Last update: 2026-05-08
Started: 2026-04-21

Readme and manuals

Help Manual

Help pageTopics
Apply a function across multiple columnsacross
Append rows to an existing .vtr fileappend_vtr
Sort rows by column valuesarrange
Bind rows or columns from multiple vectra tablesbind_cols bind_rows
Fuzzy-match query keys against a materialized blockblock_fuzzy_lookup
Probe a materialized block by column valueblock_lookup
Turn a query into a resettable chunk generatorchunk_feeder
Execute a lazy query and return a data.framecollect
Fold a function over a query, one batch at a timecollect_chunked collect_chunked.default collect_chunked.vectra_node collect_chunked.vectra_partition
Count observations by groupcount tally
Create a hash index on a .vtr file columncreate_index
Cross join two vectra tablescross_join
Logically delete rows from a .vtr filedelete_vtr
Mark a column for descending sort orderdesc
Compute the logical diff between two .vtr filesdiff_vtr
Keep distinct/unique rowsdistinct
Print the execution plan for a vectra queryexplain
Filter rows of a vectra queryfilter
Fuzzy join two vectra tables by string distancefuzzy_join
Get a glimpse of a vectra tableglimpse
Group a vectra query by columnsgroup_by
Apply a function to each shard of a partitiongroup_map group_map.vectra_partition group_modify group_modify.vectra_partition
Check if a hash index exists for a .vtr columnhas_index
Limit results to first n rowshead.vectra_node
Join two vectra tablesanti_join full_join inner_join left_join right_join semi_join
Define a link between a fact table and a dimension tablelink
Look up columns from linked dimension tableslookup
Materialize a vectra node into a reusable in-memory blockmaterialize
Add or transform columnsmutate
Spill a query to disk and stream it back (the offload functor)offload
Print a vectra query nodeprint.vectra_node
Extract a single column as a vectorpull
Summarise with variable-length output per groupreframe
Relocate columnsrelocate
Rename columnsrename
Select columns from a vectra queryselect
Select rows by positionslice
Select first or last rowsslice_head slice_max slice_min slice_tail
Summarise grouped datasummarise summarize
Create a lazy table reference from a .vtr filetbl
Create a lazy table reference from a CSV filetbl_csv
Create a lazy table reference from a SQLite databasetbl_sqlite
Create a lazy table reference from a GeoTIFF rastertbl_tiff
Create a lazy table reference from an Excel (.xlsx) filetbl_xlsx
Read per-band names from a GeoTIFFtiff_band_names
Read CRS metadata from a GeoTIFFtiff_crs
Extract raster values at point coordinatestiff_extract_points
Read GDAL_METADATA from a GeoTIFFtiff_metadata
Keep only columns from mutate expressionstransmute
Remove grouping from a vectra queryungroup
Build overview pyramids for a .vec rastervec_build_overviews
Close a .vec raster handlevec_close_raster
Extract band values at (x, y) points from a .vec rastervec_extract_points
Open a .vec rastervec_open_raster
Tile layout of an open .vec rastervec_raster_layout
Distinct time stamps stored in a .vec time cubevec_raster_times
Read the full time series at a single pixel from a .vec time cubevec_read_pixel_series
Read a single time slice from a .vec time cubevec_read_time_slice
Read a window of pixels from a .vec rastervec_read_window
Export a .vec raster to GeoTIFFvec_to_tiff
Write a raster matrix or 3D array to a .vec raster filevec_write_raster
Write a 4D time-cube raster to .vecvec_write_time_cube
Create a star schema over linked vectra tablesvtr_schema
Write query results or a data.frame to a CSV filewrite_csv
Write query results or a data.frame to a SQLite tablewrite_sqlite
Write query results to a GeoTIFF filewrite_tiff
Write data to a .vtr filewrite_vtr