Rosetta Code:Village Pump/WiktionaryDumps: Difference between revisions
Content added Content deleted
(added Vptopic wiki code) |
(fixed dead link) |
||
(One intermediate revision by the same user not shown) | |||
Line 5: | Line 5: | ||
I think it would be nice to do some tasks with the wiktionary dumps:<br> |
I think it would be nice to do some tasks with the wiktionary dumps:<br> |
||
[https://dumps.wikimedia.org/enwiktionary/ |
[https://dumps.wikimedia.org/enwiktionary/latest/ https://dumps.wikimedia.org/enwiktionary/latest/]<br> |
||
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!<br> |
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!<br> |
||
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)<br> |
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)<br> |
||
⚫ | |||
Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.<br> |
:Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.<br> |
||
⚫ | |||
:[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC) |
|||
:Maybe recreate the kind of files like "/usr/share/dict/french" or [https://users.cs.duke.edu/~ola/ap/linuxwords linuxwords] |
|||
:[[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 17:44, 30 April 2019 (UTC) |
Latest revision as of 05:03, 1 November 2019
WiktionaryDumps
This is a particular discussion thread among many which consider Rosetta Code.
This is a particular discussion thread among many which consider Rosetta Code.
Summary
Trying to do something with Wiktionary dumps
Discussion
I think it would be nice to do some tasks with the wiktionary dumps:
https://dumps.wikimedia.org/enwiktionary/latest/
The complete dump is a big bz2'ed XML file of about 800MB so it's quite interesting to see how to handle it!
but I lack imagination right now to find an interesting task, so I post here in hope to get more inspired people :)
Blue Prawn (talk) 17:44, 30 April 2019 (UTC)
- Maybe a simple translation program, create a database with English and French words for example and a utility to get translations from a given word.
- Blue Prawn (talk) 17:44, 30 April 2019 (UTC)
- Maybe recreate the kind of files like "/usr/share/dict/french" or linuxwords
- Blue Prawn (talk) 17:44, 30 April 2019 (UTC)