H2/H3R — sense survival & drift on inheritance edges

A 28-noun panel across three measured inheritance edges (WIL→SHS, WIL→YAT, AP90→AP). H2 tests whether cited ancestor senses survive more often than uncited ones. H3R measures sense-unit drift (copy, condense, revise) along each edge using mean senses per lemma and gloss overlap. Generated from data/lexico/r2_h2h3.json; see docs/R2_FINDINGS.md.

Trust Block. Generated from data/lexico/r2_h2h3.json (H2: cited 0.762 vs uncited 0.705; archived 0.7/0.54). Limitations: panel reconstructed from nouns in all 5 dicts (documented drift in R2_REBUILD_CONTRACT.md); SHS/YAT senses split by inline `N.` markers. Validation: npm test; all unit tests pass. Owner repo: csl-atlas.

H2 — Citation-survival (Supported)

0%25%50%75%100%76%71%Citedn=84Uncitedn=723restored (csl-orig)archivedsense-survival rate+0.06 gap

H3R — Sense-drift per edge

04812mean sense-units per lemmaWilson 1832 → Śabda-Sāgara 1900overlap 0.91Wilson 1832 → Yates 1846overlap 0.26Apte 1890 → Apte 1957overlap 0.56copycondensationrevision

Data

EdgeSenses anc→desDriftGloss overlapPatternArchived
Wilson 1832 → Śabda-Sāgara 19009→900.91near-verbatim copy7.9→8.5 (overlap 0.82)
Wilson 1832 → Yates 18469→5.679-3.3210.26condensation (semicolon-aware count, #126 promotion: YAT abridges ~9→5.7, not the artifactual 9→1)7.9→1.1 (overlap 0.15)
Apte 1890 → Apte 195710.821→7.75-3.0710.56revision, no expansion15.5→11 (overlap 0.61)

Summary

H2 supported: Cited senses survive at 76% (n=84) vs uncited at 71% (n=723); gap = 0.06. Well-sourced senses are stickier.

H3R not supported (no net-addition): Derivatives copy or condense. Wilson 1832 → Śabda-Sāgara 1900: near-verbatim copy (overlap 0.91). Wilson 1832 → Yates 1846: condensation (semicolon-aware count, #126 promotion: YAT abridges ~9→5.7, not the artifactual 9→1) (overlap 0.26). Apte 1890 → Apte 1957: revision, no expansion (overlap 0.56).

Robustness — the citation effect is edge-concentrated

The H2 chart above is the unadjusted gap, and it is fragile. Cited ancestor senses are not spread across the panel: Wilson 1832 carries almost no <ls> citations, so 82 of 84 cited senses sit on a single edge — Apte 1890 → 1957 (the other two edges have one cited sense each). That concentration makes the pooled controlled regression unreliable: refitting survived ~ cited + centrality controls + edge fixed effects (lemma-cluster-robust) gives a citation OR of 3.0 (p = 0.011), but the estimate moves with unrelated parsing choices — the YAT semicolon promotion (#126) alone shifted it from ≈1.75 to ≈3.0 without touching a single cited sense. The pooled OR is therefore not a trustworthy citation effect.

The clean test is within the one citation-bearing edge, where parsing of the others is irrelevant. On Apte 1890 → 1957: cited senses survive at 0.768 (n = 82), uncited at 0.661 (n = 221) — a two-proportion z = 1.80, p = 0.072, not significant. So citation co-varies with survival but is not established as an independent predictor; a larger, multi-edge panel with citations on more than one edge is needed. (h2Controlled.withinPrimaryEdge + citedByEdge in data/lexico/r2_h2h3.json.)

Threshold sensitivity (referee item M4). The pooled OR is significant at every Jaccard cutoff from 0.10 to 0.25 — but that stability just inherits the edge-concentration fragility above; it is not evidence for H2. (Note the naive gap itself collapsed to ~0.03–0.06 once YAT is counted properly, from ~0.17 before.)

Jaccard cutoffcited rateuncited ratenaive gappooled ORpooled p
0.100.7980.7400.0582.320.038
0.1250.7860.7300.0552.640.028
0.15 (ref)0.7620.7050.0573.040.011
0.1750.7380.6960.0422.580.025
0.200.7380.6940.0442.840.017
0.250.7140.6860.0282.700.024

Verdict. Treat the pooled OR as unreliable (edge-concentrated); the honest H2 signal is the within-Apte-edge test — suggestive, not significant (z = 1.80, p = 0.072). Generators: h2Controlled / h2ThresholdSensitivity in data/lexico/r2_h2h3.json. (The generated "H2 supported" summary above reflects the unadjusted pooled rates only.)