Skip to content

Latest commit

 

History

History
36 lines (19 loc) · 1.55 KB

File metadata and controls

36 lines (19 loc) · 1.55 KB

Scandinavian dictionary crosslinker

Finds shared entries in sitemaps of linguistically related dictionaries. Builds a mapping of relations that allows individual dictionaries to crosslink to related sources. Usually having same word in multiple dictionaries of different languages would not be too helpful, but in case of scandinavian languages from 8th to 16th century all of the languages are closely enough related to be useful as crossreference.

Parses sources from following dictionary projects:

The parser finds over 1 000 entries that are present in all four dictionaries. There are also over 20 000 entries that appear in at least two different dictionaries, making them worth a crosslink.

Install

yarn add scandinavian-dictionary-crosslinker

Download sitemaps.

Run cargo run in downloader folder. Downloads latest XML sitemaps to resources folder.

Generate crosslinks

Run go run *.go in crosslinks folder. Generates crosslinks json to resources folder.

Minify outout

Run nimble build and ./minifier in minifier folder. Generates minified & gzipped json outputs.

Update data to NPM module.

Run go run main.go in root folder to update json & readme to NPM module.