Package: discoverableresearch 0.0.1
Neal Haddaway
discoverableresearch: Checks Title, Abstract and Keywords to Optimise Discoverability
A suite of tools are provided here to support authors in making their research more discoverable. check_keywords() - this function checks the keywords to assess whether they are already represented in the title and abstract. check_fields() - this function compares terminology used across the title, abstract and keywords to assess where terminological diversity (i.e. the use of synonyms) could increase the likelihood of the record being identified in a search. The function looks for terms in the title and abstract that also exist in other fields and highlights these as needing attention. suggest_keywords() - this function takes a full text document and produces a list of unigrams, bigrams and trigrams (1-, 2- or 2-word phrases) present in the full text after removing stop words (words with a low utility in natural language processing) that do not occur in the title or abstract that may be suitable candidates for keywords. suggest_title() - this function takes a full text document and produces a list of the most frequently used unigrams, bigrams and trigrams after removing stop words that do not occur in the abstract or keywords that may be suitable candidates for title words. check_title() - this function carries out a number of sub tasks: 1) it compares the length (number of words) of the title with the mean length of titles in major bibliographic databases to assess whether the title is likely to be too short; 2) it assesses the proportion of stop words in the title to highlight titles with low utility in search engines that strip out stop words; 3) it compares the title with a given sample of record titles from an .ris import and calculates a similarity score based on phrase overlap. This highlights the level of uniqueness of the title. This version of the package also contains functions currently in a non-CRAN package called 'litsearchr' <https://github.com/elizagrames/litsearchr>.
Authors:
discoverableresearch_0.0.1.tar.gz
discoverableresearch_0.0.1.tar.gz(r-4.5-noble)discoverableresearch_0.0.1.tar.gz(r-4.4-noble)
discoverableresearch_0.0.1.tgz(r-4.4-emscripten)discoverableresearch_0.0.1.tgz(r-4.3-emscripten)
discoverableresearch.pdf |discoverableresearch.html✨
discoverableresearch/json (API)
# Install 'discoverableresearch' in R: |
install.packages('discoverableresearch', repos = c('https://cran.r-universe.dev', 'https://cloud.r-project.org')) |
- possible_langs - Languages codes synthesisr can recognize
This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.
Last updated 4 years agofrom:6ebe2d53f5. Checks:OK: 1 NOTE: 1. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Dec 04 2024 |
R-4.5-linux | NOTE | Dec 04 2024 |
Exports:check_fieldscheck_keywordscheck_titlecheck_title_lengthfakerakeformat_keywordsget_ngramsget_stopwordsget_tokenslanguage_coderemove_punctuationsuggest_keywordssuggest_title
Dependencies:BHbitbit64clicliprcpp11crayondplyrfansigenericsgluehmsISOcodeslifecyclemagrittrngramNLPpillarpkgconfigprettyunitsprogressR6Rcppreadrrlangslamstopwordsstringdiststringisynthesisrtibbletidyselecttmtzdbutf8vctrsvroomwithrxml2
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Check all field suitability | check_fields |
Check keyword suitability | check_keywords |
Check title with those from a test set | check_title |
Check title suitability | check_title_length |
Functions from litsearchr (not yet on CRAN) Quick keyword extraction | fakerake |
Format input keywords | format_keywords |
Extract n-grams from text | get_ngrams |
Retrieve stop words for a given language | get_stopwords |
Remove stopwords from text | get_tokens |
Get short language codes | language_code |
Languages codes synthesisr can recognize | possible_langs |
Remove punctuation from text | remove_punctuation |
Suggest keywords | suggest_keywords |
Suggest title words | suggest_title |