Introduction About Site Map

XML
RSS 2 Feed RSS 2 Feed
Navigation

Main Page | Blog Index

Archive for the ‘General’ Category

The Demise of PageRank

PageRank versus traffic
The number of sites with PageRank 10 is tiny when compared to the number of sites with PageRank 0. Conversely, traffic is largely centralised in sites with a high PR (more details)

A little tour around Google has led me to a questionably out-of-date article from the Register. This Article is 2 years old, but it seems more true than ever these days because scraping and link-related spam/attacks are constantly on the rise.

Google has made no secret of its goal to “understand” the web, an acknowledgement that its current brute-force text index produces search results with little or no context. The popularity of Teoma demonstrates that even a small index can produce superior results for certain kind of searches. Teoma leans on existing classification systems.

While Google relied on PageRankâ„¢ to provide context, all was well. But PageRank is now widely acknowledged to be broken, so new, smarter tricks are required.

Regarded as heresy when we raised the issue last spring, now some of Google’s warmest admirers, MetaFilter’s Matt Haughey and web designer Jason Kottke have acknowledged the problem.

As Gary Stock noted here last May, Google “didn’t foresee a tightly-bound body of wirers. They presumed that technicians at USC would link to the best papers from MIT, to the best local sites from a land trust or a river study – rather than a clique, a small group of people writing about each other constantly. They obviously bump the rankings system in a way for which it wasn’t prepared.”

The intersting fact is that Google themselves acknowledge the problem and I am sure difficulties have intensified, if anything, in the past 2 years. The specific reference to bloggers proves that very point as a new blog is set up every second these days.

Regarding the point about pages being indexed rather than learned from or understood, that is one of the catalysts that led me to starting Iuron. That site has attracted tremendous levels of interest since the idea had been conceived on the night on October 9th. I set up the Web site and made an official announcement the following day. Yesterday I finished a 1-page formal proposal and I contacted the person who is perceived by some as the father of the Semantic Web. He was once my lecturer.

Under Attack (Again)

Referrer spam

Referrer spam count in October logs (click image to see it full-sized)

SOME decent-scale DoS attacks, carried out by hijacked Windows workstations, have returned to the site. This comes after presistent attempts to take down the server, spamming it with URL‘s in the process.

This recent wave of attacks is one which can better be coped with due to experience acquired the last time. Sadly, I suspect that the attackers, whoever they may be, are reading my outcries in the blog and are getting a real kick out of it. Therefore, this will probably be my last post on the topic, however compelling the need to mention it becomes.

Windows XP in CSS

Windows AJAX
Windows XP in AJAX

IF Windows is ever to be used, it should be standards-compliant and perhaps, just perhaps, CSS-based. Have a look at the link. It’s worthwhile, I promise you. Try clicking the “Start” button, then navigating through the menus.

Penguin swaysThat site is an invaluable resource by all means. 5 years down the line when dual-boot no longer is a necessity to most, we will still be able to recall the days when a different operating system prevailed in this world. Since it’s Web-based, no bytes will be consumed on local storage either.

Google Earth and Area 51

Area 51
Area 51 – used to be kept secret

IT was rather interesting to discover that high-resolution aerial footage of Area 51 had been made available via Google Maps (and its sibling toy — Google Earth). You can review a little ‘photographic summary’ from those who have had a prolonged tour from up above. Do so if you do not wish to explore yourself.

In other news, related items were flagged to for me last night: “Google Satellite Photos Worry India Leader

The governments of South Korea and Thailand and lawmakers in the Netherlands have expressed similar concerns.

South Korean newspapers said Google Earth provides images of the presidential Blue House and military bases in the country, which remains technically at war with communist North Korea. The North’s main nuclear facility at Yongbyon is among sites in that country displayed on the service.

UPDATE: I am noticing a similar CNN story, which has just been cited by Slashdot.

“Google takes governmental concerns about Google Earth and Google Maps very seriously. Google welcomes dialogue with governments, and we will be happy to talk to Indian authorities about any concerns they may have,” Frost said in an e-mail statement Saturday.

Permissive Web Hosts

Computer shell
The command-line is the front line

NOT many Linux hosts permit SSH access to one’s own Webspace. There are concerns that the extra freedom and flexibility of the *NIX shell might lead to servers going down. If servers are shared, the problem is somewhat magnified and the threat is just not worth it.

There are, however, workarounds if you use cPanel as I revealed hack. I have also recently been told about PHP Console by its author. That shell can be installed on your Webspace and allow you to do what you otherwise could not.

It is neither the time nor the place to post promotional links, but I am very satisfied with my host that helped me combat the recent zombie attack on 2 of my sites. The attacks have now ended fully and I removed all my defences, namely filtering through re-directions. If you experienced some locational oddities recently, this must have been the cause.

Think, Let the PDA Do the Rest

The post is addressed to readers who are not using a PDA yet. I would like to briefly describe the advantages of life management that is assisted by pocket-sized computing.

The acquisition of a first PDA, may it be a Palm, a Pocket PC or a Zaurus (there are many more), involves transformation of long-standing habits. It is an evolving process — an adaptation if you like — of what you have become accustomed to since childhood. Tasks are being managed and flagged by a machines rather than the brain. Then again, many of us use notes to manage and organise our lives, so the transition may be primarily a ‘digitisation’ of existing information management.

The key idea of using a PDA properly is this: program your tasks as you see fit, then obey in accordance with reminders from the PDA. It is as if the PDA commands its user what to do and when, relying on and benefiting from cyclic events and ensuring that nothing is ever forgotten or becomes a mental burden. The mind is left available to think and expand its horizons rather than record information. Let us face it, machines do a better job at registering without ever ‘forgetting’. It has been their very basic nature, since the early days of computers.

SD Card

All your knowledge (and far more) in less than 1 gram

I can think of many advantages of using a PDA when compared with paper-based ‘life organisation’:

  • Automatic archival
  • Easy search
  • Easier re-use of data
  • Natural hierarchical abilities (not linear lists)
  • Less clutter (e.g. no crossed-out items)
  • Cyclic behaviour simplified, e.g. annual birthday notifications
  • Vocal reminders/alarms
  • Simple backups and duplication to prevent data loss
  • Collaborative exchange of data, e.g. Wi-Fi, synchronisation, IR port

I could think of a few more, but these should be enough to convince you why the popular Hipster PDA (famous paper-based equivalent) can never be matched. It better fits technology paranoids.

Undoing Selection/Deselection

Thinking dentistA long time ago I mentioned top software design/usability bugs. It occurred to me yesterday that there is yet another common deficiency, which is the inability to undo selection and deselection of files, entries and the like. This should become a very fundamental functionality in my opinion.

How many times in the past did you use the CTRL or SHIFT keys to establish and highlight a collection of files? One wrong click and the entire selection is gone, leading to a habit whereby files are handled in subsets, i.e. in smaller batches. In all operating systems I have come across, a selection of files, for example, forms a newline-separated list of the files with their full path. All of this is stored in the clipboard, so implementation of an undo stack should be trivial and incur no efficiency penalty.

Retrieval statistics: 21 queries taking a total of 0.385 seconds • Please report low bandwidth using the feedback form
Original styles created by Ian Main (all acknowledgements) • PHP scripts and styles later modified by Roy Schestowitz • Help yourself to a GPL'd copy
|— Proudly powered by W o r d P r e s s — based on a heavily-hacked version 1.2.1 (Mingus) installation —|