Skip to main content

CDSL 2022 Year in Review

2022 was the first year with a full run of monthly newsletters, and it reflected a project at full pace: a new dictionary was added, Monier-Williams received major manual accent corrections, Boehtlingk's Indische Sprüche proofreading was completed, and high-quality color scans replaced earlier grayscale pages.

LRV (L.R. Vaidya) added to CDSL

The L.R. Vaidya Sanskrit-English Dictionary (LRV) was integrated into the CDSL in October 2022: the source data was added to csl-orig, feminine and neuter headwords were derived programmatically, scans were uploaded, and the dictionary was made searchable on the Cologne website. This was a meaningful expansion of the corpus, adding a widely used mid-century Sanskrit reference work.

Monier-Williams accent corrections

Jim Funderburk undertook an extensive manual correction of accent marks throughout the MW dictionary from October through December 2022 — one of the most painstaking data quality efforts in the project's history. Hundreds of accent errors were identified and corrected against the print edition, and the mwauth.txt file was retired in favour of the more complete tooltip.txt as the primary source for MW authority abbreviations. New colored scan pages for MW replaced the earlier grayscale images.

Indische Sprüche proofreading completed

Proofreading of all three volumes of Boehtlingk's Indische Sprüche (BOESP) was completed during 2022. The effort involved Sampada, Andhrabharati, Thomas, and Jim Funderburk working through the text systematically. Links to BOESP entries from PW and PWG were added, making the Indische Sprüche cross-references in Böhtlingk's dictionaries clickable for the first time.

New scans and data additions

High-quality new scans were added for SKD (Śabdakalpadruma) and for several MW volumes. Literary source corrections were made throughout PW and PWG, including corrections for Spr. (II) entries and missing literary sources. User-submitted corrections from across the year were installed in batches, covering issues 985–1057 in the csl-orig tracker.

PyCDSL: third-party Python library

The third-party PyCDSL library (by Hrishikesh Terdalkar) matured this year, providing a Python interface to the Cologne API and data. It became a reference in the project's tooling documentation as a recommended way to access CDSL data programmatically from Python applications.


To receive future editions by email, subscribe here.