--- title: "File upload and Volumes reference guide for Seven Bridges API R Client" date: "`r Sys.Date()`" output: rmarkdown::html_document: toc: true toc_float: true toc_depth: 4 number_sections: false theme: "flatly" highlight: "textmate" css: "sevenbridges.css" vignette: > %\VignetteEngine{knitr::rmarkdown} %\VignetteIndexEntry{File upload and Volumes reference guide for Seven Bridges API R Client} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>", eval = FALSE ) ``` # File upload Seven Bridges platforms provide a few different methods for data import: - Import from FTP or HTTP with the web interface - The file upload API that you can directly call with the `sevenbridges2` package - The command line uploader - Import from cloud storage - Volume - Import from a DRS server In this chapter we will explain how you can use the `sevenbridges2` API library to upload your files to the Platform. Although it is more intuitive to have these operations available on the `File` object, they are separated and stored directly on the authentication object `Auth`, because there are a separate group of endpoints themselves. ## Upload single file You can upload files from your local computer to the Platform using the `upload()` method on your `Auth` object. The method allows you to upload only a single file for now. To upload a file, you should provide its full path on your local computer as the `path` parameter. To specify the upload destination for your file you can use either `project` or `parent` parameter. These two parameters should not be used together. * **project** - `Project` object or project ID. * **parent** - `File` object (of type `Folder`) or its ID. By calling the `upload()` method you are creating an upload job that by default starts to run immediately. If you don't want to start the job immediately, just set the `init` parameter to `TRUE` in order to only initialize the object. This upload job is wrapped into an object of the class `Upload` where you can see its details and call other actions on it. Let's initialize an upload job that will upload a file into a project: ```{r} # Authenticate a <- Auth$new(platform = "aws-us", token = "") # Get the desired project to upload to destination_project <- a$projects$get(project = "") # Create upload job and set destination project upload_job <- a$upload( path = "/path/to/your/file.txt", project = destination_project, overwrite = TRUE, init = TRUE ) ``` If you would like to upload your file into a folder, you need to set the `parent` parameter: ```{r} # Get destination folder object destination_folder <- a$files$get(id = "") up <- a$upload( path = "/path/to/your/file.txt", parent = destination_folder, overwrite = TRUE, init = TRUE ) ``` ## Upload fields and operations Since we have initialized the upload job, let's see which actions can we run. ### Print upload job First, let's print the `Upload` object to see what the API returned as the response. ```{r} up$print() ``` ``` ── Upload ───────────────────────────────────────────────────────────────────── • initialized: TRUE • part_length: 1 • part_size: 33554432 • file_size: 232 • overwrite: FALSE • filename: file.txt • project: /api-testing • path: /path/to/your/file.txt • upload_id: 4OvRx8Z9vghNoAUqsgYtNuM2IsiIM8kghhjgi7igu79HX9QKZpDEh5TZDrmhPxF ```
top
### File size, part size and part number/length In the previous example we can see that the API returned the upload id and some information about sizes. First we see the `file_size` in bytes (232), which is the real size of the file. File upload actually splits files into parts in the background; parts are then being uploaded one by one or in parallel and then merged again on destination. Each part can weigh a maximum of 5 GB, while the default `part_size` is recommended and set to be 32MB (which is 33554432B in our example). Lastly, number of parts or `part_length` field, is also an important measure. Maximum number of parts can be 10.000. Since users can control part size through the `part_size` parameter in `upload()` function, they should be careful not to set a size that is too small for very large files, so that total number of parts doesn’t exceed the limit of maximum 10.000.
top
### Start upload Call the `start()` method on the upload job object do start the upload process. ```{r} # Start upload up$start() ``` If you want to skip the step where you need to call the `start()` method to start the actual upload process, just set the `init` parameter back to `FALSE` when creating the upload job and the upload process will start right away. ```{r} # Create upload job and start it immediately up <- a$upload( path = "/path/to/your/file.txt", project = destination_project, overwrite = TRUE, init = FALSE ) ```
top
### Get status information about the job In order to track the progress of the job, you can call the `info()` method on the upload object. ```{r} # Get upload progress info up$info() ``` Apart from basic information, the result will also provide the info on the number of uploaded parts up to that moment.
top
## List all ongoing uploads Going back to the authentication object, there are two more operations for uploads manipulation. One is the method `list_ongoing_uploads()` that allows you to see the list of all ongoing upload processes. ```{r} # List ongoing uploads a$list_ongoing_uploads() ```
top
## Abort upload process The other one is `abort()`. You can abort any upload process using the `upload_abort()` method. To do so, you need to provide the ID of a process within the `upload_id` parameter. ```{r} # Abort upload a$abort_upload(upload_id = "") ``` **Note** that in practice, if you start a big upload job, your R session will be blocked until this process is finished. This functionality is work in progress but the idea is to not block your main session in the future, while the upload is running. For now, you can create another R session on your own and track the progress of the upload job there. # Volumes Cloud storage providers come with their own interfaces, features, and terminology. At a certain level, though, they all view resources as data objects organized in repositories. Authentication and operations are commonly defined on those objects and repositories, and while each cloud provider might call these things different names and apply different parameters to them, their basic behavior is the same. Seven Bridges environments mediate access to these repositories using volumes. A volume is associated with a particular cloud storage repository that you have enabled Seven Bridges to read from (and, optionally, to write to). Currently, volumes may be created using two types of cloud storage repositories: Amazon Web Services' (AWS) S3 buckets and Google Cloud Storage (GCS) buckets. A volume enables you to treat the cloud repository associated with it as external storage. You can 'import' files from the volume to your Seven Bridges environment to use them as inputs for computation. Similarly, you can write files from the Seven Bridges environment to your cloud storage by 'exporting' them to your volume. Learn more about volumes on the [Seven Bridges Platform](https://docs.sevenbridges.com/docs/connecting-cloud-storage-overview), [CGC](https://docs.cancergenomicscloud.org/docs/connect-cloud-storage-overview), [BDC](https://sb-biodatacatalyst.readme.io/docs/connect-cloud-storage-overview) and [CAVATICA](https://docs.cavatica.org/reference/connect-cloud-storage-overview). All volume related operations for querying volumes, fetching a single volume, and creating volumes are grouped under `volumes` path (`Volumes` resource class) on the authentication object. When operating with a single volume, it is represented as an object of the `Volume` class which stores all volume information returned from the API and additional methods you are able to call directly on the volume, like volume update, deactivation, listing content, volume members management etc. **Note** that all operations with volumes require `advance_access` parameter to be set to TRUE. In most of the volume operations it is pre-set to `TRUE` by default. ## List volumes You can list all volumes you've registered by calling the `volumes$query()` method from the authentication object. The method doesn't have any additional query parameters that could allow you to search for volumes by specific criteria, except the ones that control the number of results returned using `limit` and `offset` parameters. ```{r} # Query volumes a$volumes$query() ``` The result returned is the `Collection` object with pagination ability. ## Get single volume information In order to retrieve information about a single volume of interest, you can get it using the `volumes$get()` method using its id as parameter. Volume ID is usually presented in the `/` form for Enterprise users, while for public program users it can be in the `/` form. ```{r} # Get volume a$volumes$get(id = "/") ``` ## Create volumes - AWS (S3) using IAM User authentication type For creating volumes we have exposed several functions for different cloud providers and authentication types: * `create_s3_using_iam_user`: creates S3 volume using IAM User authentication type * `create_s3_using_iam_role`: creates S3 volume using IAM Role authentication type * `create_google_using_iam_user`: creates GC volume using IAM User authentication type * `create_google_using_iam_role`: creates GC volume using IAM Role authentication type * `create_azure`: creates Azure volume (only RO privileges allowed) * `create_ali_oss`: creates AliCloud volume (only RO privileges allowed) For each of the functions it is possible to provide parameters via path (`from_path`) to a JSON file where all required fields should be listed. Examples of use are shown below: ```{r} # Create AWS volume using IAM User authentication type aws_iam_user_volume <- a$volumes$create_s3_using_iam_user( name = "my_new_aws_user_volume", bucket = "", description = "AWS IAM User volume", access_key_id = "", secret_access_key = "" ) aws_iam_user_volume_from_path <- a$volumes$create_s3_using_iam_user( from_path = "path/to/my/json/file.json" ) # Create AWS volume using IAM Role authentication type aws_iam_role_volume <- a$volumes$create_s3_using_iam_role( name = "my_new_aws_role_volume", bucket = "", description = "AWS IAM Role volume", role_arn = "", external_id = "" ) aws_iam_role_volume_from_path <- a$volumes$create_s3_using_iam_role( from_path = "path/to/my/json/file.json" ) # Create Google Cloud volume using IAM User authentication type gc_iam_user_volume <- a$volumes$create_google_using_iam_user( name = "my_new_gc_user_volume", access_mode = "RW", bucket = "", description = "GC IAM User volume", client_email = "", private_key = "" ) gc_iam_user_volume_from_path <- a$volumes$create_google_using_iam_user( from_path = "path/to/my/json/file.json" ) # Create Google Cloud volume using IAM Role authentication type # by passing configuration parameter as named list gc_iam_role_volume <- a$volumes$create_google_using_iam_role( name = "my_new_gc_role_volume", access_mode = "RO", bucket = "", description = "GC IAM Role volume", configuration = list( type = "", audience = "", subject_token_type = "", service_account_impersonation_url = "", token_url = "", credential_source = list( environment_id = "", region_url = "", url = "", regional_cred_verification_url = "" ) ) ) # Create Google Cloud volume using IAM Role authentication type # by passing configuration parameter as string path to configuration file gc_iam_role_volume_config_file <- a$volumes$create_google_using_iam_role( name = "my_new_gc_role_volume_cnf_file", access_mode = "RO", bucket = "", description = "GC IAM Role volume - using config file", configuration = "path/to/config/file.json" ) # Create Google Cloud volume using IAM Role authentication type # using from_path parameter gc_iam_role_volume_from_path <- a$volumes$create_google_using_iam_role( from_path = "path/to/full/config/file.json" ) # Create Azure volume azure_volume <- a$volumes$create_azure( name = "my_new_azure_volume", description = "Azure volume", endpoint = "", container = "top ## Volume object operations When you've created a new volume, you can notice it is represented as an object of the `Volume` class. To preview all volume information, use the `print()` method: ```{r} # Print volume info print(aws_iam_user_volume) ``` Within this volume you have the following operations available to execute: * `update`: update volume information * `list_contents` : list volume content * `get_file`: get single volume file info * `deactivate` : deactivate volume * `reactivate` : reactivate previously deactivated volume * `list_members`: list all volume members * `add_member`: add new volume member * `remove_member`: remove volume member * `get_member`: get a volume member information * `modify_member_permissions`: modify member permissions on the volume * `delete` : delete previously deactivated volume * `reload` : reload volume object to sync information * `list_imports`: list all imports from the specified volume * `list_exports`: list all exports to the specified volume ### Update volume You can update volume's `description`, `access_mode` and `service` information. Please consult our [API documentation](https://docs.sevenbridges.com/reference/update-a-volume-v2) on how to use the `service` parameter. ```{r} # If the volume is created with RO access mode and RO credential parameters, # and now we want to change it to RW, we should also set proper credential # parameters that are connected to the RW user on the bucket. # If it's created with RW credentials, but access mode is set to RO, then no # change is needed in the credentials parameters. aws_iam_user_volume$update( description = "Updated to RW", access_mode = "RW", service = list( credentials = list( access_key_id = "", secret_access_key = "", ) ) ) ``` ### Reload volume To keep your local `Volume` object up to date with the volume on the platform, you can always call the `reload()` function: ```{r} # Reload volume object aws_iam_user_volume$reload() ``` ### List volume's content This operation lists all volume files in the root directory of the bucket, unless the `parent` parameter is specified. In that case, it lists the content of that directory on the bucket. The output is a `VolumeContentCollection` collection object, that contains two fields: -`items` for storing a list of `VolumeFile` objects (files on the volume) and -`prefixes` for storing a list of `VolumePrefix` objects or folders on the volume. You can also specify the `limit` parameter to control the number of results returned. Same as `Collection` objects, here we also have pagination functions to return either the next page of results or all results. However, backward pagination is not available for volume contents. Users can also navigate through pages of results by using the `continuation token` parameter or `link` to fetch the next chunk of results. If you use the `link` parameter, it will overwrite all other parameters if set, since it already contains the `limit` and `continuation_token` info. ```{r} # List all files in root bucket directory content_collection <- aws_iam_user_volume$list_contents(limit = 20) # Print collection content_collection # List all files from a specific directory on the bucket folder_files_collection <- aws_iam_user_volume$list_contents( prefix = "" ) # Get the next group of results by setting the continuation token content_collection <- aws_iam_user_volume$list_contents( limit = 20, continuation_token = "" ) # Preview volume files content_collection$items # Preview volume prefixes/folders content_collection$prefixes # Preview links aws_iam_user_volume$links # Get the next group of results by setting the link parameter aws_iam_user_volume$list_contents(link = "") # Or use VolumeContentCollection object's next_page() method for this: content_collection$next_page() # You can also fetch all results with the all() method content_collection$all() ``` ### Volume files and prefixes Volume files and prefixes are also treated as objects and they contain some operations that can be called on them. #### Get VolumeFile info This operation returns a single volume file information. The input parameter can be file's id which is represented as location on the bucket (`location)`, or a link to that file resource. The link is a `href` field of the desired file received from the response when returning a list of volume contents with `list_contents()`. Empty arguments are not allowed along with setting both parameters together. ```{r} # Get single volume file info - by setting file_location vol_file1 <- aws_iam_user_volume$get_file( location = "" ) # Get single volume file info - by setting link vol_file1 <- aws_iam_user_volume$get_file(link = "full/request/link/to/file") ``` #### Reload VolumeFile object To keep your local VolumeFile object up to date with the volume file on the platform, you can always call the `reload()` function: ```{r} vol_file1$reload() ``` #### Get VolumePrefix info Unfortunately we don't have a separate operation to fetch only prefixes on the volume, therefore, we can get its prefixes only by using the `list_contents()` operation and look for the `prefixes` field in the returned `VolumeContentCollection` object. ```{r} # List content volume_content <- aws_iam_user_volume$list_contents() # Extract prefixes volume_prefixes <- volume_content$prefixes # Select one of the volume folders to list its content volume_folder <- volume_prefixes[[1]] # Print volume prefix information volume_folder$print() ``` You can also list the content of a volume prefix/folder on the volume, by calling `list_contents()` directly on the `VolumePrefix` object. ```{r} ## Select one of the volume folders to list its content volume_folder <- volume_prefixes[[1]] # List content volume_folder_content <- volume_folder$list_contents() ``` ### List volume members In order to fetch members of one volume or a specific member by its username, you can use `list_members()` and `get_member()` operations: ```{r} # List volume members aws_iam_user_volume$list_members() # limit = 2 # Get single member aws_iam_user_volume$get_member(user = "") ``` ### Remove members Volume admins can remove volume members by providing its username or object of the `Member` class to the `remove_member()` function: ```{r} # Remove member aws_iam_user_volume$remove_member("") # Remove member using the Member object members <- aws_iam_user_volume$list_members() aws_iam_user_volume$remove_member(members$items[[3]]) ``` ### Adding new members The function for adding new members to the volume can accept a `Member` object (for example used in a project) or its username. ```{r} # Add member via username aws_iam_user_volume$add_member(user = "", permissions = list( read = TRUE, copy = TRUE, write = FALSE, admin = FALSE )) # Add member via Member object aws_iam_user_volume$add_member( user = Member$new( username = "", id = "" ), permissions = list( read = TRUE, copy = TRUE, write = FALSE, admin = FALSE ) ) ``` ### Modifying a member's permissions Users can modify specific member's permissions on the volume by providing the privileges they want to change: ```{r} # Modify member permissions aws_iam_user_volume$modify_member_permissions( user = "", permissions = list(write = TRUE) ) ``` ### Deactivate and reactivate the volume Once deactivated, you cannot import from, export to, or browse within a volume. As such, the content of the files imported from this volume will no longer be accessible on the platform. However, you can update the volume and manage members. Note that you cannot deactivate the volume if you have running imports or exports unless you force the operation using the query parameter `force = TRUE`. **Note** that to delete a volume, first you must deactivate it and delete all files which have been imported from the volume to the platform. To reactivate the volume, just use the `reactivate()` function. ```{r} # Deactivate volume aws_iam_user_volume$deactivate() # Reactivate volume aws_iam_user_volume$reactivate() ``` ### Delete volume To be able to delete a volume, you first need to deactivate it and then delete all files on the Platform that were previously imported from the volume. ```{r} # Deactivate volume aws_iam_user_volume$deactivate() # Delete volume aws_iam_user_volume$delete() ``` # Imports Creating and connecting volumes to the Platform allows you to import your files/folders from a cloud bucket to the Platform. Imports operations are related to volumes, but in the API they are separated under `/imports` endpoints, so in our library they are also grouped under imports path on the authentication object (`Imports` resource class). A single import job is represented as an `Import` class object containing information about which file/folder has been or is being imported, from which volume, to which project/folder on the platform, import start and finish time, status of the job, logs etc. ## List volume imports To preview and query all import jobs you've created use the `query()` function on `Auth$imports` path: ```{r} # List imports all_imports <- a$imports$query() # Limit results to 5 imp_limit5 <- a$imports$query(limit = 5) # Load next page of 5 results imp_limit5$next_page(advance_access = TRUE) # Load all results at once until last page imp_limit5$all(advance_access = TRUE) ``` It is possible to use some query parameters as different criteria for filtering results like `volume`, `project`, `state` etc: ```{r} # List imports with state being RUNNING or FAILED imp_states <- auth$imports$query(state = c("RUNNING", "FAILED")) # List imports to the specific project imp_project <- auth$imports$query(project = "") ``` Listing imports is also available within `Project` and `Volume` objects, where resulting imports are related to the specific project or volume where they're called from. ```{r} ## Get the volume from which you want to list all imports vol1 <- auth$volumes$get(id = "/") vol1$list_imports() ## Get the project object for which you want to list imports test_proj <- auth$projects$get("") test_proj$list_imports() ``` ## Get a single import job Similar to other resource classes, the `get()` method will return a single import job object when provided with a job id. ```{r} # Get single import imp_obj <- a$imports$get(id = "") ``` ## Get bulk import jobs Users are able to fetch details for multiple import jobs by calling one bulk action - the `bulk_get()` method. The accepted input can be a list of import job IDs or a list of import job objects (of class Import). The result will be a Collection object containing a list of (updated) import jobs. ```{r} # Get details of multiple import jobs import_jobs <- a$imports$bulk_get( imports = list("", "") ) ``` ## Submit new import - import a volume file into the project In order to import volume files into a project, users can use the `submit_import()` method from the `Auth$imports` path, or directly on the selected `VolumeFile` object (file they want to import) where this function is also available. ```{r} ## First, get the volume you want to import files from vol1 <- a$volumes$get(id = "/") ## Then, get the project object/id where you want to import files test_proj <- a$projects$get("") ## List all volume files on the volume vol1_content <- vol1$list_contents() ## Select one of the volume files volume_file_import <- vol1_content$items[[3]] ## Perform a file import imp_job1 <- a$imports$submit_import( source_location = volume_file_import$location, destination_project = test_proj, autorename = TRUE ) # Alternatively you can also call import() directly on the VolumeFile object imp_job1 <- volume_file_import$import( destination_project = test_proj, autorename = TRUE ) ``` Preview import job details with the `print()` method: ```{r} # Print Import object print(imp_job1) ``` You can also import folders from the volume into the project, with the option to preserve or not to preserve folder structure: ```{r} # Select one of the volume folders to import volume_folder_import <- vol1_content$prefixes[[1]] # Perform a folder import imp_job2 <- a$imports$submit_import( source_location = volume_folder_import$prefix, destination_project = test_proj, overwrite = TRUE, preserve_folder_structure = TRUE ) # Alternatively you can also call import() directly on the VolumePrefix object imp_job2 <- volume_folder_import$import( destination_project = test_proj, overwrite = TRUE, preserve_folder_structure = TRUE ) # Print Import object print(imp_job2) ``` ## Reload import job In order to refresh the import job object and get the up to date info about its state, you can always call the `reload()` function: ```{r} # Reload import object imp_job1$reload() ``` ## Submit bulk import - import multiple volume files or folders into a project Users are able to perform the bulk action on multiple volume files or folders and import them into a project using a single call of the `bulk_submit_import()` method from the `Auth$imports` path. The required input should be a nested list of elements for each file/folder you want to import containing specific fields: `source_volume`, `source_location`, `destination_project`, `destination_parent`, `name`, `autorename` and `perserve_folder_structure.` ```{r} ## First, get the volume you want to import files from vol1 <- a$volumes$get(id = "/") ## Then, get the project object or the ID of the project into which you want # to import files test_proj <- a$projects$get("") ## List all volume files vol1_content <- vol1$list_contents() ## Preview the content and select one VolumeFile object and two VolumePrefix ## objects (folders) for the purpose of this example volume_file_import <- vol1_content$items[[1, 2]] volume_file_import volume_folder_import <- vol1_content$prefixes[[1]] volume_folder_import ## Construct the inputs list by filling the necessary information for each # file/folder to import to_import <- list( list( source_volume = "rfranklin/my-volume", source_location = "chimeras.html.gz", destination_project = "rfranklin/my-project" ), list( source_volume = vol1, source_location = "my-folder/", destination_project = test_proj, autorename = TRUE, preserve_folder_structure = TRUE ), list( source_volume = "rfranklin/my-volume", source_location = "my-volume-folder/", destination_parent = "parent-id", name = "new-folder-name", autorename = TRUE, preserve_folder_structure = FALSE ) ) bulk_import_jobs <- a$imports$bulk_submit_import(items = to_import) # Preview the results bulk_import_jobs # Get updated status by fetching details with bulk_get() and by passing the # list of import jobs created in the previous step a$imports$bulk_get(imports = bulk_import_jobs$items) ``` As you may see from the example above, users are able to import folders from the volume into the project or a project directory, with the option to preserve or not to preserve folder structure. Moreover, you are able to pass the objects of classes Volume, Project or File (with type = 'folder') for `source_volume`, `destination_project` and `destination_parent` fields when constructing the inputs list, besides using their string IDs. In the example above, the list of items for the bulk import job was manually created. Alternatively, you can use the `prepare_items_for_bulk_import()` utility function to generate the items list. This function allows you to prepare the list of bulk import items based on the provided `VolumeFile` or `VolumePrefix` objects, filling the following fields for each item: `source_volume`, `source_location`, `destination_project` or `destination_parent`, `autorename`, `preserve_folder_structure`. Note that the same `destination_project`/`destination_parent` and `autorename` values will be applied uniformly across all items in the resulting list. The `preserve_folder_structure` parameter, if provided, applies exclusively to `VolumePrefix` items. ```{r} ## First, get the volume you want to import files from vol1 <- a$volumes$get(id = "/") ## Then, get the project object or the ID of the project into which you want # to import files test_proj <- a$projects$get("") ## List all volume files vol1_content <- vol1$list_contents() ## Select two VolumeFile objects volume_file_1_import <- vol1_content$items[[1]] volume_file_2_import <- vol1_content$items[[2]] volume_files_to_import <- list(volume_file_1_import, volume_file_2_import) ## Construct the inputs list using the prepare_items_for_bulk_import() utility # function to_import <- prepare_items_for_bulk_import( volume_items = volume_files_to_import, destination_project = test_proj ) bulk_import_jobs <- a$imports$bulk_submit_import(items = to_import) # Preview the results bulk_import_jobs # Get updated status by fetching details with bulk_get() and by passing the # list of import jobs created in the previous step a$imports$bulk_get(imports = bulk_import_jobs$items) ``` Keep in mind that `prepare_items_for_bulk_import()` is designed solely to assist in constructing the list of items for submitting a bulk import job. It operates under certain constraints; refer to the function's documentation for further details. After obtaining the function's output, you can manually adjust individual items as needed. # Exports Exports are the actions of exporting your files from the Platform into a cloud bucket represented as a volume. Export operations are also related to volumes, but in the API they are separated under `/exports` endpoints, so in our library they are also grouped under the `exports` path on the authentication object (`Exports` resource class). A single export job is represented as an `Export` class object containing information about a file has been or is being exported, from which project/folder on the Platform, to which volume, export start and finish time, status of the job, logs etc. ## List file exports to the volumes Users can preview and query all export jobs they've created for the purpose of exporting their files from the Platform into a cloud bucket using volumes. The output is a `Collection` object storing a list of exports in its `items` field and providing pagination options. ```{r} # List exports all_exports <- a$exports$query() # Limit results to 5 exp_limit5 <- a$exports$query(limit = 5) # Load next page of 5 results exp_limit5$next_page(advance_access = TRUE) # List all results until last page exp_limit5$all() ``` It is possible to use some query parameters as different criteria for filtering results like `volume`, `state` etc: ```{r} # List exports with status RUNNING or FAILED exp_states <- a$exports$query(state = c("RUNNING", "FAILED")) # List exports into a specific volume exp_volume <- a$exports$query( volume = "/" # volume object or id ) ``` Listing exports is also available within `Volume` objects, where results contain all files exported to the specific volume they're being called from. ```{r} # Get the volume for which you want to list all exports vol1 <- a$volumes$get(id = "/") # List exports vol1$list_exports() ``` ## Get a single export job Similar to other resource classes, the `get()` method will return a single export job object when provided with job id. ```{r} # Get a single export exp_obj <- auth$exports$get(id = "") ``` ## Get bulk export jobs Users are able to fetch details for multiple export jobs by calling one bulk action - the `bulk_get()` method. The accepted input can be a list of export job IDs or a list of export job objects (of class Export). The result will be a Collection object containing a list of (updated) export jobs. ```{r} # Get details of multiple export jobs export_jobs <- a$exports$bulk_get( exports = list("", "") ) ``` ## Submit a new export - export a file from the platform to a volume In order to export platform files into volumes, users can use the `submit_export()` method from the `auth$exports` path, or directly on the selected `File` object (file they want to export) where this function is also available. ```{r} # First, get the volume you want to export files to vol1 <- a$volumes$get(id = "/") # Get the File object/id you want to export from the platform test_file <- a$files$get("") # Perform a file export exp_job1 <- a$exports$submit_export( source_file = test_file, destination_volume = vol1, destination_location = "new_volume_file.txt" # new name ) ``` Preview export job details with the `print()` method: ```{r} # Print export job info print(exp_job1) ``` Bear in mind that **folders export** from the platform to volumes is **not possible** with this function. For such cases (or export of multiple files) it is better to use bulk actions that will be added to the package soon. Users can also export files into specific volume directories, by providing the `prefix` within the `location` parameter as a folder name, which will then be virtually created on the volume: ```{r} # Export file into the folder 'test_folder' exp_job2 <- a$exports$submit_export( source_file = test_file, destination_volume = vol1, destination_location = "test_folder/new_volume_file.txt" # new name ) # Print export job info print(exp_job2) ``` __Important__ : - The file selected for export must not be a public file or an alias. Aliases are objects stored in your cloud storage bucket which have been made available on a Seven Bridges environment. The volume you are exporting to must be configured for read-write access. To do this, set the `access_mode` parameter to RW when creating or modifying a volume. - If this call is successful, the original project file will become an alias to the newly exported object on the volume. The source file will be deleted from the Seven Bridges environment and, if no more copies of this file exist, it will no longer count towards your total storage price on the Seven Bridges environment. ## Reload export job In order to refresh the export job object and get the up to date info about its state, you can always call the `reload()` function: ```{r} # Reload export object exp_job1$reload() ``` ## Submit bulk export - export multiple files into a volume Users are able to perform the bulk action on multiple project files and export them into a volume using a single call of the `bulk_submit_export()` method from the `Auth$exports` path. The required input should be a nested list of elements for each file you want to export containing specific fields: `source_file`, `destination_volume`, `destination_location`, `overwrite` and `properties` that accepts a list with fields: `sse_algorithm`, `sse_aws_kms_key_id` and/or `aws_canned_acl`. ```{r} ## First, get the project and files you want to export test_proj <- a$projects$get("") proj_files <- test_proj$list_files() ## Choose the first 3 files to export files_to_export <- proj_files$items[1:3] ## Then, get the volume you want to export files into vol1 <- a$volumes$get(id = "/") ## Construct the inputs list by filling the necessary information for each # file to export to_export <- list( list( source_file = files_to_export[[1]], destination_volume = vol1, destination_location = files_to_export[[1]]$name ), list( source_file = "second-file-id", destination_volume = vol1, destination_location = "my-folder/exported_second_file.txt", overwrite = TRUE ), list( source_file = files_to_export[[3]], destination_volume = vol1, destination_location = files_to_export[[3]]$name, overwrite = FALSE, properties = list( sse_algorithm = "AES256" ) ), copy_only = FALSE ) bulk_export_jobs <- a$exports$bulk_submit_export(items = to_export) # Preview the results bulk_export_jobs # Get updated status by fetching details with bulk_get() and by passing the # list of export jobs created in the previous step a$exports$bulk_get(exports = bulk_export_jobs$items) ``` As you may see from the example above, users are able to export files into a folder on a volume. Moreover, you are able to pass the objects of classes Volume or File (with type = 'file' only, since folders can't be exported) when constructing the inputs list, in addition to using their string IDs for `source_file` and `destination_volume` fields. However, `destination_location` on the volume must be set as character field. Lastly, `copy_only` field will apply to all files being exported, which means that each file would be copied to the volume, while source location would remain on the Platform if `copy_only` is set to TRUE. In the example above, the list of items for the bulk export job was manually created. Alternatively, you can use the `prepare_items_for_bulk_export()` utility function to generate the items list. This function allows you to prepare the list of bulk export items based on the provided `File` objects, filling the following fields for each item: `source_file`, `destination_volume`, `destination_location`, `overwrite`, `properties`. Check the function's documentation and [API documentation](https://docs.sevenbridges.com/reference/start-a-bulk-export-job) for more details. ```{r} ## First, get the project and files you want to export test_proj <- a$projects$get("") proj_files <- test_proj$list_files() ## Then, get the volume you want to export files into vol1 <- a$volumes$get(id = "/") ## Select two File objects file_1_export <- proj_files[[1]] file_2_export <- proj_files[[2]] files_to_export <- list(file_1_export, file_2_export) ## Construct the inputs list using the prepare_items_for_bulk_export() utility # function to_export <- prepare_items_for_bulk_export( files = files_to_export, destination_volume = vol1, destination_location_prefix = "my-folder/" ) bulk_export_jobs <- a$exports$bulk_submit_export(items = to_export) # Preview the results bulk_export_jobs # Get updated status by fetching details with bulk_get() and by passing the # list of export jobs created in the previous step a$exports$bulk_get(exports = bulk_export_jobs$items) ``` Keep in mind that `prepare_items_for_bulk_export()` is designed solely to assist in constructing the list of items for submitting a bulk export job. It operates under certain constraints; refer to the function's documentation for further details. After obtaining the function's output, you can manually adjust individual items as needed.