Title: | R Interface to SDMX Web Services |
---|---|
Description: | Provides functions to retrieve data and metadata from providers that disseminate data by means of SDMX web services. SDMX (Statistical Data and Metadata eXchange) is a standard that has been developed with the aim of simplifying the exchange of statistical information. More about the SDMX standard and the SDMX Web Services can be found at: <https://sdmx.org>. |
Authors: | Attilio Mattiocco [aut, cre] |
Maintainer: | Attilio Mattiocco <[email protected]> |
License: | EUPL |
Version: | 3.5-0 |
Built: | 2024-11-08 10:21:54 UTC |
Source: | CRAN |
This package provides functions to extract timeseries data and structural metadata from an SDMX Provider (e.g. ECB,OECD, EUROSTAT) via SDMX Web Service
Package: | RJSDMX |
Type: | Package |
The SDMX Connectors framework (of which RJSDMX is part) aims to offer data users the means for efficiently interacting with SDMX Web Service providers from within the most popular statistical tools. The source code of the SDMX Connectors project can be found at:
https://github.com/amattioc/SDMX
Information about the R Connector can be found in the dedicated wiki page:
https://github.com/amattioc/SDMX/wiki/RJSDMX:-Connector-for-R
In particular, all information related to configuration (network, tracing, security) can be found at:
https://github.com/amattioc/SDMX/wiki/Configuration
Attilio Mattiocco, Bank of Italy [email protected]
getProviders, getTimeSeries, sdmxHelp
## Not run: my_ts = getTimeSeries('ECB','EXR.M.USD.EUR.SP00.A') ## End(Not run)
## Not run: my_ts = getTimeSeries('ECB','EXR.M.USD.EUR.SP00.A') ## End(Not run)
Configure a new data provider (only SDMX 2.1 REST providers are supported). This function can be used to configure a new (SDMX 2.1 compliant, REST based) data provider.
addProvider(name, endpoint, needsCredentials = FALSE, needsURLEncoding = FALSE, supportsCompression = TRUE, description = "", sdmxVersion = "V2", supportsAvailability = F)
addProvider(name, endpoint, needsCredentials = FALSE, needsURLEncoding = FALSE, supportsCompression = TRUE, description = "", sdmxVersion = "V2", supportsAvailability = F)
name |
the name of the provider |
endpoint |
the URL where the provider resides |
needsCredentials |
set this to TRUE if the user needs to authenticate to query the provider |
needsURLEncoding |
set this to TRUE if the provider does not handle character '+' in URLs |
supportsCompression |
set this to TRUE if the provider is able to handle compression |
description |
a brief text description of the provider |
supportsAvailability |
set this to TRUE if the provider is able to handle availability queries |
sdmxVersion |
what sdmx version this provider supports |
## Not run: addProvider('test', 'http://sdw-wsrest.ecb.europa.eu/service', FALSE) getProviders() ## End(Not run)
## Not run: addProvider('test', 'http://sdw-wsrest.ecb.europa.eu/service', FALSE) getProviders() ## End(Not run)
Extract the codes of a dimension. This function is used to retrieve the list of codes available for the input dimension and flow.
getCodes(provider, flow, dimension)
getCodes(provider, flow, dimension)
flow |
the identifier of the dataflow |
dimension |
the identifier of the dimension |
provider |
the name of the provider |
getCodes(provider, dataflow, dimension)
## Not run: codes=getCodes('ECB', 'EXR', 'FREQ') ## End(Not run)
## Not run: codes=getCodes('ECB', 'EXR', 'FREQ') ## End(Not run)
Extract the dimensions of a DataFlow. This function is used to retrieve the list of dimensions of the input dataflow
getDimensions(provider, dataflow)
getDimensions(provider, dataflow)
dataflow |
the identifier of the dataflow |
provider |
the name of the provider |
getDimensions(provider, dataflow)
## Not run: dims = getDimensions('ECB','EXR') ## End(Not run)
## Not run: dims = getDimensions('ECB','EXR') ## End(Not run)
Extract the dsd identifier of a DataFlow. This function is used to retrieve the name of the keyfamily of the input dataflow.
getDSDIdentifier(provider, dataflow)
getDSDIdentifier(provider, dataflow)
provider |
the name of the provider |
dataflow |
the identifier of the dataflow |
getDSDIdentifier(provider, dataflow)
## Not run: id = getDSDIdentifier('ECB','EXR') ## End(Not run)
## Not run: id = getDSDIdentifier('ECB','EXR') ## End(Not run)
Extract the list of DataFlows of a provider. This function is used to query the list of dataflows of the provider. A matching pattern can be provided, if needed.
getFlows(provider, pattern = "")
getFlows(provider, pattern = "")
pattern |
the pattern to match against the dataflow id or description. If a pattern is not provided, all dataflows are returned. |
provider |
the name of the provider |
getFlows(provider, pattern)
Attilio Mattiocco [email protected], Diana Nicoletti
## Not run: ## get all flows from ECB flows = getFlows('ECB') ## get all flows that contain the 'EXR flows = getFlows('ECB','*EXR*') ## End(Not run)
## Not run: ## get all flows from ECB flows = getFlows('ECB') ## get all flows that contain the 'EXR flows = getFlows('ECB','*EXR*') ## End(Not run)
Extract the list of available Data Providers. This function is used to query the list of data providers.
getProviders()
getProviders()
getProviders()
## Not run: getProviders() ## End(Not run)
## Not run: getProviders() ## End(Not run)
Extract a list of time series. This function is used to extract a list of time series identified by the parameters provided in input.
getTimeSeries(provider, id, start, end)
getTimeSeries(provider, id, start='', end='')
getTimeSeries(provider, id, start='', end='')
provider |
the name of the provider |
id |
identifier of the time series |
end |
the end time - optional |
start |
the start time - optional |
## Not run: # SDMX V2 ## get single time series: my_ts=getTimeSeries('ECB',id='EXR.A.USD.EUR.SP00.A') ## get monthly and annual frequency: my_ts=getTimeSeries('ECB',id='EXR.A+M.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeries('ECB',id='EXR..USD.EUR.SP00.A') ## End(Not run)
## Not run: # SDMX V2 ## get single time series: my_ts=getTimeSeries('ECB',id='EXR.A.USD.EUR.SP00.A') ## get monthly and annual frequency: my_ts=getTimeSeries('ECB',id='EXR.A+M.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeries('ECB',id='EXR..USD.EUR.SP00.A') ## End(Not run)
Extract a list of time series . This function is used to extract a list of time series identified by the parameters provided in input.
getTimeSeries2( provider, dataflow, key = "", filter = "", start = "", end = "", attributes = "all", measures = "all" )
getTimeSeries2( provider, dataflow, key = "", filter = "", start = "", end = "", attributes = "all", measures = "all" )
provider |
the name of the provider |
dataflow |
dataflow of the time series |
key |
timeseries key - optional |
filter |
optional filter to be applied - optional |
start |
the start time - optional |
end |
the end time - optional |
attributes |
the comma separated list of attributes to be returned - optional, default='all', 'none' for no attributes |
measures |
the comma separated list of measures to be returned - optional, default='all', 'none' for no measures |
## Not run: # SDMX V3 ## get single time series: my_ts=getTimeSeries2('ECB', dataflow='EXR', key='A.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeries2('ECB', dataflow='EXR', key='.USD.EUR.SP00.A') #or #' ## get single time series: EXR.A.USD.EUR.SP00.A my_ts=getTimeSeries2('ECB', dataflow='EXR', filter='c[FREQ]=A&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get monthly and annual frequency: my_ts=getTimeSeries2('ECB', dataflow='EXR', filter='c[FREQ]=A,M&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get all available frequencies: my_ts=getTimeSeries2('ECB', dataflow='EXR', filter='c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## End(Not run)
## Not run: # SDMX V3 ## get single time series: my_ts=getTimeSeries2('ECB', dataflow='EXR', key='A.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeries2('ECB', dataflow='EXR', key='.USD.EUR.SP00.A') #or #' ## get single time series: EXR.A.USD.EUR.SP00.A my_ts=getTimeSeries2('ECB', dataflow='EXR', filter='c[FREQ]=A&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get monthly and annual frequency: my_ts=getTimeSeries2('ECB', dataflow='EXR', filter='c[FREQ]=A,M&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get all available frequencies: my_ts=getTimeSeries2('ECB', dataflow='EXR', filter='c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## End(Not run)
Extract time series starting from a specific update time and with history of revisions. This function works as getTimeSeriesTable but the query can be narrowed to getting only observations that were updated after a specific point in time, and eventually it returns the revision history of the matching time series.
The result is packed into a data.frame
getTimeSeriesRevisions(provider, id, start = "", end = "", updatedAfter = "", includeHistory = TRUE)
getTimeSeriesRevisions(provider, id, start = "", end = "", updatedAfter = "", includeHistory = TRUE)
id |
identifier of the time series |
provider |
the name of the provider |
end |
the end time - optional |
start |
the start time - optional |
updatedAfter |
the updatedAfter time - optional. It has to be in the form: 'YYYY-MM-DD' |
includeHistory |
boolean parameter - optional. If TRUE the full list of revisions will be returned |
getTimeSeriesRevisions(provider, id, start, end, updatedAfter, includeHistory)
## Not run: # get single time series with history: my_ts=getTimeSeriesRevisions('ECB','EXR.A.USD.EUR.SP00.A', includeHistory=TRUE) # get single time series (only observations updated after january 1st 2015): my_ts=getTimeSeriesRevisions('ECB','EXR.A.USD.EUR.SP00.A', updatedAfter='2015', includeHistory=FALSE) # get single time series (full revision history starting from january 1st 2015): my_ts=getTimeSeriesRevisions('ECB','EXR.A.USD.EUR.SP00.A', updatedAfter='2015', includeHistory=TRUE) ## End(Not run)
## Not run: # get single time series with history: my_ts=getTimeSeriesRevisions('ECB','EXR.A.USD.EUR.SP00.A', includeHistory=TRUE) # get single time series (only observations updated after january 1st 2015): my_ts=getTimeSeriesRevisions('ECB','EXR.A.USD.EUR.SP00.A', updatedAfter='2015', includeHistory=FALSE) # get single time series (full revision history starting from january 1st 2015): my_ts=getTimeSeriesRevisions('ECB','EXR.A.USD.EUR.SP00.A', updatedAfter='2015', includeHistory=TRUE) ## End(Not run)
Extract a list of time series identified by the parameters provided in input, and return a data.frame as result.
getTimeSeriesTable(provider, id, start, end, gregorianTime)
getTimeSeriesTable(provider, id, start='', end='', gregorianTime=F)
getTimeSeriesTable(provider, id, start='', end='', gregorianTime=F)
id |
identifier of the time series |
provider |
the name of the provider |
end |
the end time - optional |
start |
the start time - optional |
gregorianTime |
set to true to have all daily dates - optional |
## Not run: # SDMX V2 ## get single time series: my_ts=getTimeSeriesTable('ECB',id='EXR.A.USD.EUR.SP00.A') ## get monthly and annual frequency: my_ts=getTimeSeriesTable('ECB',id='EXR.A+M.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeriesTable('ECB',id='EXR..USD.EUR.SP00.A') ## End(Not run)
## Not run: # SDMX V2 ## get single time series: my_ts=getTimeSeriesTable('ECB',id='EXR.A.USD.EUR.SP00.A') ## get monthly and annual frequency: my_ts=getTimeSeriesTable('ECB',id='EXR.A+M.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeriesTable('ECB',id='EXR..USD.EUR.SP00.A') ## End(Not run)
Extract a list of time series identified by the parameters provided in input, and return a data.frame as result.
getTimeSeriesTable2( provider, dataflow, key = "", filter = "", start = "", end = "", attributes = "all", measures = "all", updatedAfter = .jnull(), includeHistory = FALSE, gregorianTime = F )
getTimeSeriesTable2( provider, dataflow, key = "", filter = "", start = "", end = "", attributes = "all", measures = "all", updatedAfter = .jnull(), includeHistory = FALSE, gregorianTime = F )
provider |
the name of the provider |
dataflow |
dataflow of the time series |
key |
timeseries key |
filter |
optional filter to be applied |
start |
the start time - optional |
end |
the end time - optional |
attributes |
the comma separated list of attributes to be returned - optional, default='all', 'none' for no attributes |
measures |
the comma separated list of measures to be returned - optional, default='all', 'none' for no measures |
updatedAfter |
return only changes after this date - optional |
includeHistory |
include history of revisions - optional, default=false |
gregorianTime |
set to true to have all daily dates - optional |
## Not run: # SDMX V3 ## get single time series: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', key='A.USD.EUR.SP00.A') ## get monthly and annual frequency: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', key='A+M.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', key='.USD.EUR.SP00.A') #or ## get single time series: EXR.A.USD.EUR.SP00.A my_ts=getTimeSeriesTable('ECB', dataflow='EXR', filter='c[FREQ]=A&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get monthly and annual frequency: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', filter='c[FREQ]=A,M&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get all available frequencies: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', filter='c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## End(Not run)
## Not run: # SDMX V3 ## get single time series: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', key='A.USD.EUR.SP00.A') ## get monthly and annual frequency: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', key='A+M.USD.EUR.SP00.A') ## get all available frequencies: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', key='.USD.EUR.SP00.A') #or ## get single time series: EXR.A.USD.EUR.SP00.A my_ts=getTimeSeriesTable('ECB', dataflow='EXR', filter='c[FREQ]=A&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get monthly and annual frequency: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', filter='c[FREQ]=A,M&c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## get all available frequencies: my_ts=getTimeSeriesTable('ECB', dataflow='EXR', filter='c[CURRENCY]=USD&c[CURRENCY_DENOM]=EUR&c[EXR_TYPE]=SP00&c[EXR_SUFFIX]=A') ## End(Not run)
This function is used to transform the output of the getSDMX (or getTimeseries) functions from a list of time series to a data.frame. The metadata can be requested by explicitly passing the appropriate parameters.
sdmxdf(tslist, meta = FALSE, id = TRUE)
sdmxdf(tslist, meta = FALSE, id = TRUE)
tslist |
the list of time series to be converted |
meta |
set this to TRUE if you want metadata to be included (default: FALSE, as this may increase the size of the result quite a bit) |
id |
set this to FALSE if you do not want the time series id to be included (default: TRUE) |
sdmxdf()
## Not run: a=getSDMX('ECB', 'EXR.A|Q|M|D.USD.EUR.SP00.A') ddf = sdmxdf(a) ddf = sdmxdf(a, meta=TRUE) ## End(Not run)
## Not run: a=getSDMX('ECB', 'EXR.A|Q|M|D.USD.EUR.SP00.A') ddf = sdmxdf(a) ddf = sdmxdf(a, meta=TRUE) ## End(Not run)
Open a helper graphical application. This function opens a small sdmx metadata browser that can be helpful when building queries.
sdmxHelp(internalJVM = T)
sdmxHelp(internalJVM = T)
internalJVM |
TRUE (default) if the GUI has to live in the R JVM. Set this to FALSE in MAC, to avoid issue #41 |
sdmxHelp()
## Not run: #opens the helper in the R JVM sdmxHelp() #opens the helper in an external JVM sdmxHelp(FALSE) ## End(Not run)
## Not run: #opens the helper in the R JVM sdmxHelp() #opens the helper in an external JVM sdmxHelp(FALSE) ## End(Not run)
Sets the credentials for providers that need authentication
setProviderCredentials(provider, user=.jnull(), pw=.jnull())
setProviderCredentials(provider, user=.jnull(), pw=.jnull())
provider |
the name of the provider |
user |
the username |
pw |
the password |