Re: [News] Linux Reciprocity is a Major Merit

Home	Messages Index

[Date Prev]	[Date Next]	[Thread Prev]	[Thread Next]

Author Index	Date Index	Thread Index

Re: [News] Linux Reciprocity is a Major Merit

Subject: Re: [News] Linux Reciprocity is a Major Merit
From: "[H]omer" <spam@xxxxxxx>
Date: Fri, 16 Mar 2007 16:34:51 +0000
In-reply-to: <dnlqc4-21o.ln1@ellandroad.demon.co.uk>
Newsgroups: comp.os.linux.advocacy
Organization: Slated.org
References: <2179247.mT9dLdKHgq@schestowitz.com> <7q8oc4-165.ln1@ellandroad.demon.co.uk> <1663547.NfZRGtPRV2@schestowitz.com> <qo0qc4-28p.ln1@ellandroad.demon.co.uk> <34548891.OefCp5DZZk@schestowitz.com> <dnlqc4-21o.ln1@ellandroad.demon.co.uk>
User-agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.8.0.10) Gecko/20070302 Fedora/1.5.0.10-1.fc6 pango-text Thunderbird/1.5.0.10 Mnenhy/0.7.4.666
Xref: ellandroad.demon.co.uk comp.os.linux.advocacy:505499

Verily I say unto thee, that Mark Kent spake thusly:

> Has anyone tried to do anything like this already and perhaps has
> solutions for these issues?

How about running this on a leafnode spool?:

######
#!/usr/bin/perl -w
# parse-urls.pl

use strict;
use URI::Find;

my $finder = URI::Find->new(
  sub {
    my($uri, $orig_uri) = @_;
    return $orig_uri;
  });

while (<>) {
  my $text = $_;
  $finder->find(\$text);
  exec "lynx -source $text" or die;
}

1;
######

 - http://search.cpan.org/dist/URI-Find/

I'll play around with this, and see about adding URI verification.

Also IMHO the final output should be something like:

Article Name: <html title>
Archive Date: <date fetched>
Article URI : <orig_uri>
Article Body: <output from parse-urls.pl>

Getting the *real* posting date for an upstream article is a more
difficult proposition, since that info is not always available.

Also, for a proper citation, the upstream article *author* should be
included, where possible.

-- 
K.
http://slated.org - Slated, Rated & Blogged

.----
| "Future archaeologists will be able to identify a 'Vista Upgrade
| Layer' when they go through our landfill sites" - Sian Berry, the
| Green Party.
`----

Fedora Core release 5 (Bordeaux) on sky, running kernel 2.6.19-1.2288.fc5
 16:32:02 up 25 days,  3:57,  3 users,  load average: 0.50, 0.73, 0.74

Follow-Ups:
- Re: [News] Linux Reciprocity is a Major Merit
  - From: Mark Kent

References:
- [News] Linux Reciprocity is a Major Merit
  - From: Roy Schestowitz
- Re: [News] Linux Reciprocity is a Major Merit
  - From: Roy Schestowitz
- Re: [News] Linux Reciprocity is a Major Merit
  - From: Roy Schestowitz
- Re: [News] Linux Reciprocity is a Major Merit
  - From: Mark Kent

[Date Prev]	[Date Next]	[Thread Prev]	[Thread Next]

Author Index	Date Index	Thread Index