Rosetta Code:Village Pump/WiktionaryDumps: Difference between revisions

From Rosetta Code
Content added Content deleted
(added Vptopic wiki code)
(fixed dead link)
 
(One intermediate revision by the same user not shown)
Line 5: Line 5:


I think it would be nice to do some tasks with the wiktionary dumps:<br>
I think it would be nice to do some tasks with the wiktionary dumps:<br>
[https://dumps.wikimedia.org/enwiktionary/20190420/ https://dumps.wikimedia.org/enwiktionary/20190420/]<br>
[https://dumps.wikimedia.org/enwiktionary/latest/ https://dumps.wikimedia.org/enwiktionary/latest/]<br>
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!<br>
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!<br>
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)<br>
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)<br>
[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC)
Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.<br>
:Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.<br>
[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]])
:[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC)

:Maybe recreate the kind of files like "/usr/share/dict/french" or [https://users.cs.duke.edu/~ola/ap/linuxwords linuxwords]
:[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC)

Latest revision as of 05:03, 1 November 2019

WiktionaryDumps
This is a particular discussion thread among many which consider Rosetta Code.

Summary

Trying to do something with Wiktionary dumps

Discussion

I think it would be nice to do some tasks with the wiktionary dumps:
https://dumps.wikimedia.org/enwiktionary/latest/
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)

Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)
Maybe recreate the kind of files like "/usr/share/dict/french" or linuxwords
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)