NEWS
semanticfa 0.1.1 (2026-07-03)
- Bundled data upgrade:
data(big5) now ships 50 x 4096 Qwen3-Embedding-8B
item embeddings (rounded to 4 decimal places), replacing the 50 x 384
all-MiniLM-L6-v2 sentence-BERT embeddings of 0.1.0. Analyses run on the
bundled data will differ from 0.1.0.
sfa_item_fit() now compares candidates against unflipped (topical)
construct centroids, matching sfa_anchor() and sfa_simplify(). The
previous behavior sign-flipped reverse-keyed reference items into anti-topic
vectors, which depressed the item-similarity profile of constructs with many
reverse-keyed items and could misassign candidates. reverse_key = TRUE
still flips the candidate itself.
sfa_congruence()'s disattenuated metric now returns NA with a warning
when either similarity matrix's split-half reliability is not positive
(e.g., the checkerboard sign pattern of atomic_reversed), instead of
failing with an unhelpful error.
sfa_nli_matrix() reads the entailment/contradiction label order from the
cross-encoder's model config instead of assuming the
cross-encoder/nli-* order, so non-default NLI models score correctly (a
warning falls back to the default order when the config is unavailable).
sfa_parallel() now applies Horn's sequential retention rule (count leading
eigenvalues until the first falls below its null percentile) instead of
counting all eigenvalues above their pointwise percentiles, and its
documentation cites the embedding-benchmark precedent (Garrido et al.).
sfa_dimselect() default encoding is now "atomic", matching sfa().
- Fitted objects store the encoded item vectors as
$transformed_embeddings
(previously $embeddings, which was easy to confuse with
$input_embeddings).
digest moved from Suggests to Imports: embedding cache keys are now always
SHA-256 (the previous fallback hash could collide).
- Documentation fixes:
sfa_anchor() help no longer claims items are
sign-aligned before anchoring (they never were in this release line, by
design), the big5 item-code range reads E1--O50, the README encoding table
marks atomic as keying-free, and README/vignette references to the bundled
data name the Qwen3 embeddings.
semanticfa 0.1.0 (2026-06-15)
- Initial release.
- Core
sfa() function for semantic factor analysis.
- Encoding methods:
atomic_reversed, atomic, squid, mean_centered_pearson.
- Embedding-adapted parallel analysis (
sfa_parallel()).
- Unified retention diagnostics (
sfa_nfactors()).
- Fit diagnostics: KMO, TEFI, RMSR, CAF, McDonald's omega, DAAL.
- Comparison metrics: Tucker phi, NMI, ARI, Frobenius, disattenuated (
sfa_congruence()).
- Embedding backends: sentence-BERT, OpenAI, custom functions, precomputed.
- Bundled IPIP Big Five 50-item dataset with sentence-BERT embeddings.