Home Messages Index
[Date Prev][Date Next][Thread Prev][Thread Next]
Author IndexDate IndexThread Index

Re: Save an entire web site

  • Subject: Re: Save an entire web site
  • From: Roy Schestowitz <newsgroups@xxxxxxxxxxxxxxx>
  • Date: Mon, 06 Mar 2006 11:28:17 +0000
  • Newsgroups: uk.comp.os.linux
  • Organization: schestowitz.com / MCC / Manchester University
  • References: <dueitd$oir$1@nwrdmz02.dmz.ncs.ea.ibs-infra.bt.com> <440acf25$0$21368$db0fefd9@news.zen.co.uk> <duejut$7ce$1@nwrdmz03.dmz.ncs.ea.ibs-infra.bt.com> <7hj0YXD+VBDEFwGO@davidm.demon.co.uk>
  • Reply-to: newsgroups@xxxxxxxxxxxxxxx
  • User-agent: KNode/0.7.2
__/ [ David Morris ] on Monday 06 March 2006 10:57 \__

> Pete Bradbury once wrote in <duejut$7ce$1@xxxxxxxxxxxxxxxxxxxxxxxx
> infra.bt.com>...
>>Dave Pickles wrote:
>>> Pete Bradbury wrote:
>>>
>>>>Is there a linux utility to save an entire web site to disc?
>>>>
>>>>I've found a few commercial 'doze applications, but not for linux?
>>>   man wget.
> 
> You may need to change the browser identification that wget uses. Some
> sites prevent wget in its default configuration from slurping whole
> sites.


The nice way:

wget -r -l99 -t99 -N http://[<username>:<password>@]<site>[:<port>]

The devilish way:

wget -r -l99 -t99 -N --user-agent=<agent-string> -erobots=off
http://[<username>:<password>@]<site>[:<port>]


Hope it helps,

Roy

-- 
Roy S. Schestowitz
http://Schestowitz.com  |    SuSE Linux     |     PGP-Key: 0x74572E8E
 11:20am  up 5 days  6:58,  9 users,  load average: 0.90, 0.68, 0.68
      http://iuron.com - Open Source knowledge engine project

[Date Prev][Date Next][Thread Prev][Thread Next]
Author IndexDate IndexThread Index