Package: RcppCWB 0.6.10
RcppCWB: 'Rcpp' Bindings for the 'Corpus Workbench' ('CWB')
'Rcpp' Bindings for the C code of the 'Corpus Workbench' ('CWB'), an indexing and query engine to efficiently analyze large corpora (<https://cwb.sourceforge.io>). 'RcppCWB' is licensed under the GNU GPL-3, in line with the GPL-3 license of the 'CWB' (<https://www.r-project.org/Licenses/GPL-3>). The 'CWB' relies on 'pcre2' (BSD license, see <https://github.com/PCRE2Project/pcre2/blob/master/LICENCE.md>) and 'GLib' (LGPL license, see <https://www.gnu.org/licenses/lgpl-3.0.en.html>). See the file LICENSE.note for further information. The package includes modified code of the 'rcqp' package (GPL-2, see <https://cran.r-project.org/package=rcqp>). The original work of the authors of the 'rcqp' package is acknowledged with great respect, and they are listed as authors of this package. To achieve cross-platform portability (including Windows), using 'Rcpp' for wrapper code is the approach used by 'RcppCWB'.
Authors:
RcppCWB_0.6.10.tar.gz
RcppCWB_0.6.10.tar.gz(r-4.7-x86_64)RcppCWB_0.6.10.tar.gz(r-4.6-x86_64)
manual.pdf |manual.html✨
card.svg |card.png
RcppCWB/json (API)
NEWS
| # Install 'RcppCWB' in R: |
| install.packages('RcppCWB', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/polmine/rcppcwb/issues
Last updated from:f9151cccc6. Checks:3 FAIL, 3 OK. Indexed: no.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-arm64 | FAIL | 134 | ||
| linux-devel-x86_64 | OK | 154 | ||
| source / vignettes | OK | 249 | ||
| linux-release-arm64 | FAIL | 127 | ||
| linux-release-x86_64 | OK | 169 | ||
| wasm-release | FAIL | 145 |
Exports:attribute_sizecheck_corpuscheck_cposcheck_idcheck_p_attributecheck_pkg_registry_filescheck_querycheck_region_matrixcheck_registrycheck_s_attributecheck_strucscl_attribute_sizecl_charset_namecl_cpos2idcl_cpos2lboundcl_cpos2rboundcl_cpos2strcl_cpos2struccl_delete_corpuscl_find_corpuscl_id2cposcl_id2freqcl_id2strcl_lexicon_sizecl_list_corporacl_load_corpuscl_regex2idcl_str2idcl_struc_valuescl_struc2cposcl_struc2strcorpus_data_dircorpus_full_namecorpus_info_filecorpus_is_loadedcorpus_p_attributescorpus_propertiescorpus_propertycorpus_registry_dircorpus_s_attributescpos_to_idcpos_to_lboundcpos_to_rboundcpos_to_strcpos_to_struccpos2idcpos2strcqp_drop_subcorpuscqp_dump_subcorpuscqp_get_registrycqp_initializecqp_is_initializedcqp_list_corporacqp_list_subcorporacqp_load_corpuscqp_querycqp_reset_registrycqp_subcorpus_sizecqp_verbositycwb_charsetscwb_compress_rdxcwb_encodecwb_huffcodecwb_makeallcwb_versionget_cbow_matrixget_count_vectorget_pkg_registryget_region_matrixget_tmp_registryid_to_cposid_to_freqid2strids_to_count_matrixmatrix_to_subcorpusp_attrp_attr_defaultp_attr_lexicon_sizep_attr_sizeranges_to_cposregex_to_idregion_matrix_contextregion_matrix_to_count_matrixregion_matrix_to_idsregion_matrix_to_struc_matrixregion_to_strucss_attrs_attr_is_descendents_attr_is_siblings_attr_regionss_attr_relationships_attr_sizes_attribute_decodestr_to_idstruc_to_cposstruc_to_strstruc2cpossubcorpus_get_rangesuse_tmp_registry
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Rcpp Bindings for the Corpus Workbench (CWB). | RcppCWB-package RcppCWB |
| Rcpp wrappers for CWB Corpus Library functions | attribute_size cpos2id cpos2str id2str struc2cpos |
| Check Input to Rcpp Functions. | check check_corpus check_cpos check_id check_p_attribute check_query check_region_matrix check_registry check_strucs check_s_attribute |
| Check Paths in Registry Files | check_pkg_registry_files |
| Get Attribute Size (of Positional/Structural Attribute). | cl_attribute_size |
| Get charset of a corpus. | cl_charset_name |
| Drop loaded corpus. | cl_delete_corpus |
| Load corpus. | cl_find_corpus |
| Get Lexicon Size. | cl_lexicon_size |
| Show CL corpora | cl_list_corpora |
| Load corpus | cl_load_corpus |
| Low-level CL access. | cl_rework cpos_to_id cpos_to_lbound cpos_to_rbound cpos_to_str cpos_to_struc id_to_cpos id_to_freq p_attr p_attr_lexicon_size p_attr_size regex_to_id struc_to_cpos struc_to_str str_to_id s_attr s_attr_size |
| Check whether structural attribute has values | cl_struc_values |
| Using Positional Attributes. | CL: p_attributes cl_cpos2id cl_cpos2str cl_id2cpos cl_id2freq cl_id2str cl_regex2id cl_str2id |
| Using Structural Attributes. | CL: s_attributes cl_cpos2lbound cl_cpos2rbound cl_cpos2struc cl_struc2cpos cl_struc2str |
| Get information from registry file | corpus_data_dir corpus_full_name corpus_info_file corpus_properties corpus_property corpus_p_attributes corpus_registry_dir corpus_s_attributes |
| Check whether corpus is loaded | corpus_is_loaded |
| Initialize Corpus Query Processor (CQP). | cqp_get_registry cqp_initialize cqp_is_initialized cqp_load_corpus cqp_reset_registry cqp_verbosity |
| List Available CWB Corpora. | cqp_list_corpora |
| Execute CQP Query and Retrieve Results. | cqp_drop_subcorpus cqp_dump_subcorpus cqp_list_subcorpora cqp_query cqp_subcorpus_size |
| Character sets supported by CWB | cwb_charsets |
| CWB Tools for Creating Corpora | cwb_compress_rdx cwb_encode cwb_huffcode cwb_makeall |
| Get CWB version | cwb_version |
| Get CBOW Matrix. | get_cbow_matrix |
| Get Vector with Counts for Positional Attribute. | get_count_vector |
| Get Registry Directory Within Package | get_pkg_registry |
| Get Matrix with Regions for Strucs. | get_region_matrix |
| Perform Count for Vector of IDs. | ids_to_count_matrix |
| Create CWB subcorpus from matrix with regions. | matrix_to_subcorpus |
| Get default p-attribute | p_attr_default |
| Get IDs and Counts for Region Matrices. | ranges_to_cpos region_matrix_context region_matrix_ops region_matrix_to_count_matrix region_matrix_to_ids |
| Get min and max strucs of s-attribute present in region | region_matrix_to_struc_matrix region_to_strucs |
| Explore XML structure of CWB corpus | s_attr_is_descendent s_attr_is_sibling s_attr_relationship |
| Get regions defined by a structural attribute | s_attr_regions |
| Decode Structural Attribute. | s_attribute_decode |
| Get ranges of subcorpus | subcorpus_get_ranges |
| Use Temporary Registry | get_tmp_registry use_tmp_registry |
