Talk:WiktionaryDumps to words: Difference between revisions

other languages and linux packages
(other languages and linux packages)
Line 23:
::: --[[User:Thundergnat|Thundergnat]] ([[User talk:Thundergnat|talk]]) 21:57, 9 December 2020 (UTC)
:::: Because we can not use the DOM method to parse 800MB of XML, we need to use the SAX method then. Most languages provide 2 different API for SAX and DOM XML parsing, but maybe not all. [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 09:18, 10 December 2020 (UTC)
 
==A common task==
 
You can see on this post that some people are wondering how to do this task:<br>
https://unix.stackexchange.com/questions/48939/add-new-language-to-usr-share-dict-words
The '''wordlist''' package in Debian don't seem to provide that many languages:<br>
https://packages.debian.org/fr/sid/wordlist
If we modify the ocaml script replacing "==French==" by "==Indonesian==" we can produce the word list for the Indonesian language quite easily.<br>
-- [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 13:10, 10 December 2020 (UTC)