Home Messages Index
[Date Prev][Date Next][Thread Prev][Thread Next]
Author IndexDate IndexThread Index

Re: testing depth of indeing of page

  • Subject: Re: testing depth of indeing of page
  • From: Roy Schestowitz <newsgroups@xxxxxxxxxxxxxxx>
  • Date: Fri, 05 May 2006 07:17:34 +0100
  • Newsgroups: alt.internet.search-engines
  • Organization: schestowitz.com / MCC / Manchester University
  • References: <m84l52dmpsm3p0n4f4d68l7bncacedtcg0@4ax.com> <Xns97B9EEF7DC251castleamber@130.133.1.4>
  • Reply-to: newsgroups@xxxxxxxxxxxxxxx
  • User-agent: KNode/0.7.2
__/ [ John Bokma ] on Friday 05 May 2006 05:29 \__

> Big Bill <kruse@xxxxxxxxxxxxxxx> wrote:
> 
>> http://www.sitepoint.com/article/indexing-limits-where-bots-stop
> 
> Thanks. Did this question pop up here recently? I knew it was well over
> 300K (Google), which is more then sufficient IMO.

That's intersting indeed. Thanks, Bill. I have always imagined it would be
somewhere  around  100KB  since,  beyond  this  stage,  the  user  can  be
dissatisfied  with  the  referrals. There remain many CMS's that  fail  to
apply paging to comments (it's being worked on in WordPress, as a plug-in,
I  believe).  Have you ever opened a popular Digg or Slashdot  page?  Even
with  comment  folding, these pages become monsters that are hard to  open
within  a reasonable amount of time. They also devour memory.  Thresholds,
if  comment  moderation  exists,  do not help much either;  not  in  their
default setup anyway.

Other  things  to  ponder: how many links in a page can  be  honoured  and
therefore  followed?  How  are  long pages valued? How  does  that  affect
keyword density? E.g. is it normalised by the total amount of words?...


Roy

-- 
Roy S. Schestowitz      | "World ends in five minutes - please log out"
http://Schestowitz.com  |     GNU/Linux     ¦     PGP-Key: 0x74572E8E
  7:10am  up 7 days 14:07,  13 users,  load average: 0.59, 0.67, 0.64
      http://iuron.com - next generation of search paradigms

[Date Prev][Date Next][Thread Prev][Thread Next]
Author IndexDate IndexThread Index