NEWS

semanticfa 0.1.1 (2026-07-03)

Bundled data upgrade: data(big5) now ships 50 x 4096 Qwen3-Embedding-8B item embeddings (rounded to 4 decimal places), replacing the 50 x 384 all-MiniLM-L6-v2 sentence-BERT embeddings of 0.1.0. Analyses run on the bundled data will differ from 0.1.0.
sfa_item_fit() now compares candidates against unflipped (topical) construct centroids, matching sfa_anchor() and sfa_simplify(). The previous behavior sign-flipped reverse-keyed reference items into anti-topic vectors, which depressed the item-similarity profile of constructs with many reverse-keyed items and could misassign candidates. reverse_key = TRUE still flips the candidate itself.
sfa_congruence()'s disattenuated metric now returns NA with a warning when either similarity matrix's split-half reliability is not positive (e.g., the checkerboard sign pattern of atomic_reversed), instead of failing with an unhelpful error.
sfa_nli_matrix() reads the entailment/contradiction label order from the cross-encoder's model config instead of assuming the cross-encoder/nli-* order, so non-default NLI models score correctly (a warning falls back to the default order when the config is unavailable).
sfa_parallel() now applies Horn's sequential retention rule (count leading eigenvalues until the first falls below its null percentile) instead of counting all eigenvalues above their pointwise percentiles, and its documentation cites the embedding-benchmark precedent (Garrido et al.).
sfa_dimselect() default encoding is now "atomic", matching sfa().
Fitted objects store the encoded item vectors as $transformed_embeddings (previously $embeddings, which was easy to confuse with $input_embeddings).
digest moved from Suggests to Imports: embedding cache keys are now always SHA-256 (the previous fallback hash could collide).
Documentation fixes: sfa_anchor() help no longer claims items are sign-aligned before anchoring (they never were in this release line, by design), the big5 item-code range reads E1--O50, the README encoding table marks atomic as keying-free, and README/vignette references to the bundled data name the Qwen3 embeddings.

semanticfa 0.1.0 (2026-06-15)

Initial release.
Core sfa() function for semantic factor analysis.
Encoding methods: atomic_reversed, atomic, squid, mean_centered_pearson.
Embedding-adapted parallel analysis (sfa_parallel()).
Unified retention diagnostics (sfa_nfactors()).
Fit diagnostics: KMO, TEFI, RMSR, CAF, McDonald's omega, DAAL.
Comparison metrics: Tucker phi, NMI, ARI, Frobenius, disattenuated (sfa_congruence()).
Embedding backends: sentence-BERT, OpenAI, custom functions, precomputed.
Bundled IPIP Big Five 50-item dataset with sentence-BERT embeddings.