Web scraping: Difference between revisions

Content added Content deleted
(→‎Robust: Avoid obsolete ! notation for opening command pipe.)
(→‎Naive: Likewise.)
Line 1,706: Line 1,706:
Skip stuff until a line beginning with <code>&lt;BR&gt;</code> has some stuff before "UTC", and capture that stuff:
Skip stuff until a line beginning with <code>&lt;BR&gt;</code> has some stuff before "UTC", and capture that stuff:


<lang txr>@(next `!wget -c http://tycho.usno.navy.mil/cgi-bin/timer.pl -O - 2> /dev/null`)
<lang txr>@(next @(open-command "wget -c http://tycho.usno.navy.mil/cgi-bin/timer.pl -O - 2> /dev/null"))
@(skip)
@(skip)
<BR>@time@\ UTC@(skip)
<BR>@time@\ UTC@(skip)