Talk:WiktionaryDumps to words: Difference between revisions

From Rosetta Code
Content added Content deleted
(add a comment)
(some of my own questions)
Line 7: Line 7:
: The task, as explained, is to create a file equivalent than [http://manpages.ubuntu.com/manpages/bionic/man5/french.5.html "/usr/share/dict/french"] (output), using the [https://dumps.wikimedia.org/enwiktionary/latest/enwiktionary-latest-pages-articles.xml.bz2 wiktionary dump] as input. [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 19:27, 9 December 2020 (UTC)
: The task, as explained, is to create a file equivalent than [http://manpages.ubuntu.com/manpages/bionic/man5/french.5.html "/usr/share/dict/french"] (output), using the [https://dumps.wikimedia.org/enwiktionary/latest/enwiktionary-latest-pages-articles.xml.bz2 wiktionary dump] as input. [[User:Blue Prawn|Blue Prawn]] ([[User talk:Blue Prawn|talk]]) 19:27, 9 December 2020 (UTC)
:: I have no desire to download an 800 megabyte compressed file for a Rosetta Code task that is who-knows-how-large uncompressed. Surely the task doesn't need to use a file that large. --[[User:Chunes|Chunes]] ([[User talk:Chunes|talk]]) 20:41, 9 December 2020 (UTC)
:: I have no desire to download an 800 megabyte compressed file for a Rosetta Code task that is who-knows-how-large uncompressed. Surely the task doesn't need to use a file that large. --[[User:Chunes|Chunes]] ([[User talk:Chunes|talk]]) 20:41, 9 December 2020 (UTC)

::: I too have some questions.
::# What does wiktionary have to do with the task? Would any XML encoded word list do? If so, why does the task name include wiktionary?
::# Is the task supposed to show how to download and extract a large file in your particular language? The reference implementation just shells out and uses other tools.
::# If the task is just extract a certain group of entries from an XML file, how does it differ significantly from [[XML/XPath]]?
::: --[[User:Thundergnat|Thundergnat]] ([[User talk:Thundergnat|talk]]) 21:57, 9 December 2020 (UTC)

Revision as of 21:57, 9 December 2020

Too vague

"Demonstrate how your language can handle this dump"? How?

You need to write a task where all examples are doing one shared thing that is comparable as a feature of those languages implementation of the task. If you mean to highlight one type of XML handling over another then this doesn't do it, for example. --Paddy3118 (talk) 10:00, 9 December 2020 (UTC)

The task, as explained, is to create a file equivalent than "/usr/share/dict/french" (output), using the wiktionary dump as input. Blue Prawn (talk) 19:27, 9 December 2020 (UTC)
I have no desire to download an 800 megabyte compressed file for a Rosetta Code task that is who-knows-how-large uncompressed. Surely the task doesn't need to use a file that large. --Chunes (talk) 20:41, 9 December 2020 (UTC)
I too have some questions.
  1. What does wiktionary have to do with the task? Would any XML encoded word list do? If so, why does the task name include wiktionary?
  2. Is the task supposed to show how to download and extract a large file in your particular language? The reference implementation just shells out and uses other tools.
  3. If the task is just extract a certain group of entries from an XML file, how does it differ significantly from XML/XPath?
--Thundergnat (talk) 21:57, 9 December 2020 (UTC)