Cross-dict entry-coverage divergence
Which headwords does each dictionary treat very differently from its peers? This page ranks lemmas by the spread in how many main entries each of the 7 core dictionaries devotes to the same headword — a structural proxy for editorial disagreement. High divergence = worth inspecting before citing across dictionaries.
Deep comparison uses validated feature adapters only. Broad coverage/overlap covers eligible local Sanskrit/BHS headwords; missing deep markup is not counted as zero evidence.
Trust Block
- Evidence:
data/lexico/sense_divergence.json, generated fromdata/lexico/microstructure_profile.csvbypython scripts/lexico/build_divergence_map.py. Core dicts: MW, AP, PWG, PWK, WIL, VCP, SKD. - Metric: entry-count divergence = max(entries in any dict) − min(entries in any dict) for a given headword. Structural proxy only — true semantic sense divergence requires per-lemma sense-unit computation (deferred; see
docs/RESEARCH_LAYER_ROADMAP.md). - Limitations: only covers lemmas present in ≥2 of the 7 core dicts. Divergence = 0 does not mean the dicts agree semantically; it means they both give the headword exactly 1 entry. Headword splitting (homonym indices) varies by dict and inflates apparent divergence.
- Validation:
python scripts/lexico/build_divergence_map.pyregenerates; checked bynpm run build. - Owner repo:
csl-atlas. - Next use: inspect highlighted rows, then open exact dictionary source records before citing the pattern.
Generated by python scripts/lexico/build_divergence_map.py from data/lexico/microstructure_profile.csv. CC-BY-SA-4.0.