Talk:XML/Input: Difference between revisions

→‎Interpreting XML?: Handling HTML numeric codes should be a separate task.
No edit summary
(→‎Interpreting XML?: Handling HTML numeric codes should be a separate task.)
Line 7:
::This task should definitely require stuctured XML parsing. We already have [[Web Scraping]] for more ad-hoc methods. To aid this, I would change the XML to something less trivial. --[[User:IanOsgood|IanOsgood]] 19:04, 1 June 2009 (UTC)
:::I added a <del>entity</del> numeric character reference, since XML processors in general need to be able to handle &amp; and the full character set. --[[User:Kevin Reid|Kevin Reid]] 00:44, 2 June 2009 (UTC)
::::Are you suggesting that the program should convert HTML entities and numeric references into some character encoding? I think that should be a separate task. And, AFAIK, it is HTML specific, not XML. --[[User:PauliKL|PauliKL]] 09:03, 2 June 2009 (UTC)
Anonymous user