Home Messages Index
[Date Prev][Date Next][Thread Prev][Thread Next]
Author IndexDate IndexThread Index

Re: How to find out how many pages a site has using Google?

  • Subject: Re: How to find out how many pages a site has using Google?
  • From: Roy Schestowitz <newsgroups@schestowitz.com>
  • Date: Sun, 20 Feb 2005 02:07:45 +0000
  • Newsgroups: alt.internet.search-engines
  • References: <opsmem6z0br3xrds@cinza> <1108747418.399549.195260@o13g2000cwo.googlegroups.com> <cv58ln$2nig$1@godfrey.mcc.ac.uk> <Xns96017940813E7castleamber@130.133.1.4> <mxrRd.250$DW.175@newssvr17.news.prodigy.com> <797d115rlkshocd4dha2asnbgk3k1q7kqf@4ax.com> <Xns9602B71771EFcastleamber@130.133.1.4> <ubud11d7pbev9cfo54o00hntlvisvqp4pm@4ax.com> <Xns96027B096CD01castleamber@130.133.1.4>
  • User-agent: KNode/0.7.2
John Bokma wrote:

> Big Bill wrote:
> 
>> On 19 Feb 2005 07:07:29 GMT, John Bokma <postmaster@castleamber.com>
>> wrote:
>> 
>>>Big Bill wrote:
>>>
>>>> On Fri, 18 Feb 2005 19:41:06 GMT, "music" <krill@fishfood4everXYZ.org>
>>>> wrote:
>>>
>>>[ snip ]
>>>
>>>>>I use a tool like Xenu to scan an entire site. IF all pages have a
>>>>>link route from the index, it catches them all.
>>>>>
>>>>>Mike
>>>> 
>>>> How does it find orphan files then?
>>>
>>>Not, as Mike already wrote (IF etc.)
>> 
>> Then it won't scan entire sites.
> 
> Depends on the definition of an entire site :-D. Since it's impossible to
> locate all orphan pages there simply doesn't exists a program that can do
> this using HTTP.
> 
> If you own the site, use ls.

ls -la -R >tmp; wc tmp

Would give a quick approximation.

-- 
Roy Schestowitz
http://schestowitz.com

[Date Prev][Date Next][Thread Prev][Thread Next]
Author IndexDate IndexThread Index