Package: RcppCWB 0.6.5
RcppCWB: 'Rcpp' Bindings for the 'Corpus Workbench' ('CWB')
'Rcpp' Bindings for the C code of the 'Corpus Workbench' ('CWB'), an indexing and query engine to efficiently analyze large corpora (<https://cwb.sourceforge.io>). 'RcppCWB' is licensed under the GNU GPL-3, in line with the GPL-3 license of the 'CWB' (<https://www.r-project.org/Licenses/GPL-3>). The 'CWB' relies on 'pcre2' (BSD license, see <http://www.pcre.org/licence.txt>) and 'GLib' (LGPL license, see <https://www.gnu.org/licenses/lgpl-3.0.en.html>). See the file LICENSE.note for further information. The package includes modified code of the 'rcqp' package (GPL-2, see <https://cran.r-project.org/package=rcqp>). The original work of the authors of the 'rcqp' package is acknowledged with great respect, and they are listed as authors of this package. To achieve cross-platform portability (including Windows), using 'Rcpp' for wrapper code is the approach used by 'RcppCWB'.
Authors:
RcppCWB_0.6.5.tar.gz
RcppCWB_0.6.5.tar.gz(r-4.5-noble)RcppCWB_0.6.5.tar.gz(r-4.4-noble)
RcppCWB.pdf |RcppCWB.html✨
RcppCWB/json (API)
NEWS
# Install 'RcppCWB' in R: |
install.packages('RcppCWB', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/polmine/rcppcwb/issues
Last updated 2 months agofrom:a5b18cc298. Checks:OK: 2. Indexed: no.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 23 2024 |
R-4.5-linux-x86_64 | OK | Nov 23 2024 |
Exports:attribute_sizecheck_corpuscheck_cposcheck_idcheck_p_attributecheck_pkg_registry_filescheck_querycheck_region_matrixcheck_registrycheck_s_attributecheck_strucscl_attribute_sizecl_charset_namecl_cpos2idcl_cpos2lboundcl_cpos2rboundcl_cpos2strcl_cpos2struccl_delete_corpuscl_find_corpuscl_id2cposcl_id2freqcl_id2strcl_lexicon_sizecl_list_corporacl_load_corpuscl_regex2idcl_str2idcl_struc_valuescl_struc2cposcl_struc2strcorpus_data_dircorpus_full_namecorpus_info_filecorpus_is_loadedcorpus_p_attributescorpus_propertiescorpus_propertycorpus_registry_dircorpus_s_attributescpos_to_idcpos_to_lboundcpos_to_rboundcpos_to_strcpos_to_struccpos2idcpos2strcqp_drop_subcorpuscqp_dump_subcorpuscqp_get_registrycqp_initializecqp_is_initializedcqp_list_corporacqp_list_subcorporacqp_load_corpuscqp_querycqp_reset_registrycqp_subcorpus_sizecqp_verbositycwb_charsetscwb_compress_rdxcwb_encodecwb_huffcodecwb_makeallcwb_versionget_cbow_matrixget_count_vectorget_pkg_registryget_region_matrixget_tmp_registryid_to_cposid_to_freqid2strids_to_count_matrixmatrix_to_subcorpusp_attrp_attr_defaultp_attr_lexicon_sizep_attr_sizeranges_to_cposregex_to_idregion_matrix_contextregion_matrix_to_count_matrixregion_matrix_to_idsregion_matrix_to_struc_matrixregion_to_strucss_attrs_attr_is_descendents_attr_is_siblings_attr_regionss_attr_relationships_attr_sizes_attribute_decodestr_to_idstruc_to_cposstruc_to_strstruc2cpossubcorpus_get_rangesuse_tmp_registry
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Rcpp Bindings for the Corpus Workbench (CWB). | RcppCWB-package RcppCWB |
Rcpp wrappers for CWB Corpus Library functions | attribute_size cpos2id cpos2str id2str struc2cpos |
Check Input to Rcpp Functions. | check check_corpus check_cpos check_id check_p_attribute check_query check_region_matrix check_registry check_strucs check_s_attribute |
Check Paths in Registry Files | check_pkg_registry_files |
Get Attribute Size (of Positional/Structural Attribute). | cl_attribute_size |
Get charset of a corpus. | cl_charset_name |
Drop loaded corpus. | cl_delete_corpus |
Load corpus. | cl_find_corpus |
Get Lexicon Size. | cl_lexicon_size |
Show CL corpora | cl_list_corpora |
Load corpus | cl_load_corpus |
Low-level CL access. | cl_rework cpos_to_id cpos_to_lbound cpos_to_rbound cpos_to_str cpos_to_struc id_to_cpos id_to_freq p_attr p_attr_lexicon_size p_attr_size regex_to_id struc_to_cpos struc_to_str str_to_id s_attr s_attr_size |
Check whether structural attribute has values | cl_struc_values |
Using Positional Attributes. | CL: p_attributes cl_cpos2id cl_cpos2str cl_id2cpos cl_id2freq cl_id2str cl_regex2id cl_str2id |
Using Structural Attributes. | CL: s_attributes cl_cpos2lbound cl_cpos2rbound cl_cpos2struc cl_struc2cpos cl_struc2str |
Get information from registry file | corpus_data_dir corpus_full_name corpus_info_file corpus_properties corpus_property corpus_p_attributes corpus_registry_dir corpus_s_attributes |
Check whether corpus is loaded | corpus_is_loaded |
Initialize Corpus Query Processor (CQP). | cqp_get_registry cqp_initialize cqp_is_initialized cqp_load_corpus cqp_reset_registry cqp_verbosity |
List Available CWB Corpora. | cqp_list_corpora |
Execute CQP Query and Retrieve Results. | cqp_drop_subcorpus cqp_dump_subcorpus cqp_list_subcorpora cqp_query cqp_subcorpus_size |
Character sets supported by CWB | cwb_charsets |
CWB Tools for Creating Corpora | cwb_compress_rdx cwb_encode cwb_huffcode cwb_makeall |
Get CWB version | cwb_version |
Get CBOW Matrix. | get_cbow_matrix |
Get Vector with Counts for Positional Attribute. | get_count_vector |
Get Registry Directory Within Package | get_pkg_registry |
Get Matrix with Regions for Strucs. | get_region_matrix |
Perform Count for Vector of IDs. | ids_to_count_matrix |
Create CWB subcorpus from matrix with regions. | matrix_to_subcorpus |
Get default p-attribute | p_attr_default |
Get IDs and Counts for Region Matrices. | ranges_to_cpos region_matrix_context region_matrix_ops region_matrix_to_count_matrix region_matrix_to_ids |
Get min and max strucs of s-attribute present in region | region_matrix_to_struc_matrix region_to_strucs |
Explore XML structure of CWB corpus | s_attr_is_descendent s_attr_is_sibling s_attr_relationship |
Get regions defined by a structural attribute | s_attr_regions |
Decode Structural Attribute. | s_attribute_decode |
Get ranges of subcorpus | subcorpus_get_ranges |
Use Temporary Registry | get_tmp_registry use_tmp_registry |