MW — Monier-Williams Sanskrit-English Dictionary (1899)

Chapter authored per Decision 29 Tier A. Position 1 in the atlas ordering — the framework's home dictionary.

Trust Block

1. Overview

The standard single-volume Sanskrit–English reference, edited by Monier Monier-Williams (Boden Professor of Sanskrit at Oxford 1860–88) with contributions from Ernst Leumann and Carl Cappeller. Published 1899 (Clarendon Press, Oxford) as a substantial revision of the 1872 first edition. 286,561 records in the CDSL digital edition (compared to ~180,000 entries in PWG, the proximate German predecessor; ~44,000 in WIL 1832). Single-volume format, ~1,333 pages, alphabetical macrostructure. MW is the reference dictionary for this atlas — all microanalytic constructs (block, slot, profile, hedge, infrastructure) were developed on MW first; the other 8 chapters test how far each construct generalises.

Records 286,561
Volumes 1 (single-volume)
Year 1899 (2nd edn; 1st edn 1872)
Editor(s) Monier-Williams; with Leumann + Cappeller (2nd edn)
Source language Sanskrit (SLP1-encoded in mw.txt)
Target language English
Genre Structured bilingual scholarly dictionary
Modal blocks/entry 6 (digital edition; 5 in print, sans F17 <info>)
<ls> citations total 311,932
<ls> citations/record 1.09
License CC-BY-SA-4.0
Repo sanskrit-lexicon/MWS
Source file csl-orig/v02/mw/mw.txt (48.9 MB)
CDSL display sanskrit-lexicon.uni-koeln.de/scans/MWScan/2020

2. Profile table (8+1 primary types × 18 blocks)

The full 8+1 primary-type profile for MW (per the refactored typology in PAPER.md §5):

Primary type Count % corpus Avg blocks % lex-hedged % Vedic-acc.
Root 750 0.26% 9.73 7% 8%
Nominal (m/f/n) ≈ 37,700 ≈ 13.2% 6.05–6.43 12–21% 18–24%
Adjective 12,240 4.27% 6.25 4% 11%
Indeclinable 1,929 0.67% 6.39 2% 9%
Compound sub-entry 126,360 44.10% 6.02 13% 14%
Derived form 72,119 25.17% 5.73 16% 19%
Continuation sub-entry 9,294 3.24% 4.76 21% 12%
Encyclopedic — botanical 8,059 2.81% 7.28 72% 6%
Encyclopedic — biographical 346 0.12% 7.58 65% 16%
Verbal lemma (9th type, added 2026-05-27 per D18 audit) 7,502 2.77%

Reading: compounds and derived forms together = 69 % of the corpus and define MW's surface; roots are 0.26 % of entries but receive 9.7 blocks of editorial apparatus (the most elaborate profile); encyclopedic entries are 65–72 % lex-hedged (kosha-only).

Block-presence by name (cross-dict-common subset):

3. Citation density and apparatus

1.09 <ls> per record (311,932 citations across 286,561 records) — lowest density of any structured bilingual CDSL dict (cross-dict audit), and ~4× sparser than PWG (4.63 per record). The sparseness is a single-volume print-economic constraint, not a sign of poor coverage.

Top 12 <ls> sigla:

Tag Source Count
<ls>L.</ls> Generic lexicographer hedge (see §4) 40,212
<ls>MBh.</ls> Mahābhārata 22,990
<ls>ib.</ls> ibidem (editorial back-reference) 10,094
<ls>RV.</ls> Ṛgveda 9,707
<ls>R.</ls> Rāmāyaṇa (Vālmīki) 9,049
<ls>W.</ls> Wilson 1832 (editorial back-cite) 8,285
<ls>BhP.</ls> Bhāgavata-purāṇa 6,979
<ls>Kathās.</ls> Kathāsaritsāgara 5,926
<ls>MW.</ls> MW's own earlier editorial entry 5,710
<ls>Suśr.</ls> Suśrutasaṃhitā 5,690
<ls>ŚBr.</ls> Śatapatha-brāhmaṇa 5,493
<ls>Cat.</ls> Catalogues (kosha back-cite) 5,302

The 40,212 <ls>L.</ls> hedges (12.9 % of all citations) are the single largest tag and the structural-distinguishing feature of MW within CDSL — see §4. After the hedge, the apparatus is concentrated on three corpora: Sanskrit epics (MBh., R., 32k), Vedic-corpus (RV., ŚBr., ~15k), and editorial-internal back-references (ib., W., MW., Cat., ~29k). Locator coordinates appear in 15.1 % of <ls> tags; the rest are bare work-citations.

4. Hedge analysis — the <ls>L.</ls> construct

The hedge is the entry-level evidentiary marker absent from PWG, PWK, WIL, BEN, CAE, SKD, VCP, and present 1× in AP. In MW it appears 40,212 times across primary types:

Primary type % entries lex-hedged
Encyclopedic — botanical 71.5 %
Encyclopedic — biographical 64.7 %
Continuation sub-entry 21.1 %
Derived form 16.2 %
Compound sub-entry 13.1 %
Nominal (m / f / n) 18.9 / 20.8 / 12.2 %
Adjective 4.4 %
Indeclinable 2.2 %
Root 7.2 %

72 % of botanical entries and 65 % of biographical entries carry the hedge — MW's botanical and onomastic identifications are predominantly kosha-only, a structural fact about the dictionary that the original MW prefaces do not advertise.

Three-stage lineage (per the 2026-05-27 print-preface read):

Year Source What Scale
1866 Benfey for "no authoritative references" (weaker, methodological) ~900
1872 MW 1st edn Declares L.-convention in Section II of own preface preface only; ≈ 0 in body
1891 Cappeller * for "word taught only by grammarians or lexicographers" (first systematic typographic) 1,370
1899 MW 2nd edn (w/ Cappeller as co-editor) <ls>L.</ls> (first systematic tagged) 40,212

None of the three stages is fully derivative; each adds something. The 40,212 hedges are the high-leverage editorial target — resolving any of them to a named kosha (ARMH, ABCH, ACPH, ACSJ) lifts citation coverage and tightens evidence.

5. Lineage statement

MW occupies the synthesis position in the European Sanskrit-lexicography lineage: it draws on WIL 1832 via the indigenous-kosha tradition (Wilson's base was the Calcutta College's Amarakośa-derived word-list); on PWG 1855–75 via the German philological apparatus and the bulk of the lexical material; and on Cappeller 1891 via the systematic typographic discipline that made the 40,212-instance <ls>L.</ls> apparatus possible. MW's successors are mostly indirect: AP 1957 (Apte) explicitly leans on MW for its primary lemma list and adds its own editorial layer; modern Sanskrit-English dictionaries derived from CDSL reuse MW data wholesale.

Full lineage narrative in DICT_PROFILE.md § Lineage.

6. Cross-references — divergence/convergence with adjacent chapters

Adjacent chapter Convergence Divergence
← prior: (none — MW is position 1) MW is the framework's home; cross-dict facts emerge in chapters 2–9
next →: PWG Same kernel morphology (small modal + long tail); PWG is the immediate source of much MW lexical material PWG is ~4× more citation-dense (4.63 vs 1.09 <ls>/record); multi-volume vs single-volume; PWG has 0 hedges (uses named-kosha apparatus instead); PWG type-citation spread 0.4 pts vs MW's 11.3 pts

7. Decisions log

Per-dict editorial choices documented in this chapter:

8. Data dictionary + reproducibility manifest

See also (tools)


Source: CDSL mw.txt 2026-05-23 · MWS docs-pass commit reflects audit pipeline as of 2026-05-27 · CC-BY-SA-4.0