Robert Scoble kicked off a discussion today with his prediction that Mahalo, Techmeme and Facebook will beat Google. In the videos, he seems to be pretty convinced that Mahalo can beat Google. I disagree with Robert’s take on this. I had a few back to back tweets with Robert on the topic. In one of the tweets he asked whether several Mahalos can get together to beat Google. In this post, I am going to disagree with the thesis that “Mahalo can beat Google”. I am using Mahalo here but it applies to any human powered search engine with Mahalo’s approach to search.
Before I offer my views on the topic, I want to make the following clear.
- I completely agree with Robert that SEMs (Search Engine Marketers) are a pain and we need better search results.
- If, by any chance, Robert had meant that Google should be beaten in the search game, I am in full agreement with it too. We need to stop Google’s possible search monopoly and there should be heavy competition to force Google to continue innovating.
Having said that, I want to categorically state that Mahalo, in its current form, can’t beat Google. The reasons are many. Some of them include
- Human powered search cannot scale like machine based search
- Human powered search has built in bias in the system
- Human powered search will definitely lack in freshness
- Human powered search doesn’t mean much in non text search world like Image Search, Video Search, Book Search, etc.
- Human powered search is a ridiculous thing when it comes to desktop search.
There are many other reasons where Mahalo and other similar search engines will fail. However, these are the few reasons that just popped out immediately after watching Scoble’s video. Let us dig a little deeper on these reasons.
Human powered search engine in the Mahalo mould cannot scale like an algorithm based search like Google. Even if we assume that Mahalo grows like how Robert expects in the next four years, with 100K “volunteers”, it still cannot cover all topics in the world, it still cannot cover all languages in the world, it still cannot cover all the specifics in a particular topic, it cannot cover all the variations of a particular query, etc.. It might cover a search query “HDTV” much better than Google. But I searched Mahalo with queries “Dent on HDTV” and “Scratch on HDTV”, an hour back. It returned back saying there are no matching results. Even though it offers better results for HDTV, it fails miserably when you get into specifics on HDTV. Imagine the variations in queries on HDTV from all over the world, “HDTV with a spot”, “HDTV with a broken edge”, “HDTV and Bestbuy problem”, “HDTV with loose switch”, “HDTV’s remote not working”, “I just peed on HDTV”, etc. There is no way Mahalo can scale to take care of such specifics on even a single topic, along with everything else. Comparatively, HDTV is an easy problem to solve. There are much more difficult problems in this world which Mahalo cannot solve or may not bother to solve. Let us say I make a web page about a topic HDTV that contains a sentence like “not gone triple ribbon belt mother cook with mould acumen”. If someone searches for HDTV, my page may show up after the first 50 pages of Google results and not at all on Mahalo. But if someone searches for the sentence I mentioned above with double quotes, it will show up in the Google results and there is almost zero chance that it will show up in Mahalo (unless if my content is a killer content except for that particular sentence I quoted above). Now imagine endless new queries. Ask.com CEO claimed that 60% of their searches are unique. Even Google seems to have claimed that 20-25% of their total queries were not made before. Mahalo cannot even attempt to solve the search engine needs of all the internet users in this world, even if it scales like what Scoble hopes. It is like trying to build a 200 storey building without involving any machines and just with one brick at a time. It is just not humanly possible.
Scaling problems will ensure that any attempts to create a completely human powered search engine is a failure.
To highlight my arguments, I am adding the picture of the result I got in Mahalo when I queried “mod_security rules for SQL injection attack”. You can see how badly it failed for this query. The scaling of Mahalo in the next 4 years, which Robert describes, may answer this particular query well. But, there are billions of other queries which may throw up similar results even after four years.
Human powered search like Mahalo has inbuilt bias which is difficult to manage. Jason feels that this bias can be managed with stuff like transparency, ethics codes, hiring people well, firing bad people quickly, early warning systems, etc. He claims that they take a Neutral Point of View (NPOV) on stuff like abortion, 9/11 and George Bush. I agree with him that with stuff like transparency, ethics code, etc., you can take a NPOV on few things like the queries he has quoted. Bring in diversity of queries and the different points of view attached with it. Now bring scale these number of queries. There is no way you can maintain NPOV consistently on all the topics and various queries associated with these topics. Jason even suggested the idea of Mahalo Ombudsman and included Jeff Jarvis as one possible candidate, which he rejected immediately. Even if he makes it ombudsmen instead of ombudsman, the scaling problem will ensure that the bias will stay in the system. Add to this the issues like embargo. For example, the embargo on Iran might prevent Jason to hire guides from Iran and this may result in inferior Mahalo results as far as topics related to Iran are concerned. But, an algorithmic engine like Google can just crawl the websites in Iran and have better results on the topics related to Iran.
Human powered search engine like Mahalo cannot keep the content fresh. Google crawls the web more frequently to keep its contents fresh. Now, I think, we can even inform Google about how often it should update our websites and this helps Google to update its results with fresh contents. There is no way Mahalo can keep the content fresh on all topics and related to all queries. The scalability problem described above, will make sure that the Mahalo content is stale except for a smaller percentage of terms.
I think it is pretty obvious that human powered engines like Mahalo cannot do much in the areas like Image search, Video search, Book search, etc. Add to this, the search in other languages, reverse phone number look up, etc. I don’t even have to talk about how ridiculous it will be to consider human powered search when it comes to desktop search.
Robert ignores the longtail concept when he talks about human powered search engines replacing the algorithm based ones. The very concept of longtail came into existence due to the phenomenon called internet. Under such a scenario, betting the future of internet on something that uses the hits and ignores the longtail (remember his discussion on social fabric? He talks about bumping the results up based on the number of people who trust a particular person (Michael Arringtons and Robert Scobles of the world)). I do agree that Google uses pagerank which can, in some ways, be likened to the hits in the longtail concept. However, a well structured query in Google will also get results from the longtail of web pages whereas it will not be fetched in the social fabric approach of Mahalo. This is a very important point and I just hope I have put it forward clearly.
So what is the solution to the search engine problem? Is Mahalo doomed for failure? What about 100s of Mahalos? Well, I am not an expert in search engines to offer a solution to the problems, in the current day search engines. But I strongly believe that human based search engines like Mahalo cannot replace (or beat) algorithm based search engines like Google. My solution is still algorithmic based. My hunch feeling is that the future search engine will be a collection of vertical search engines which may use Robert Scoble’s trust concept to optimize the search results. As I told Robert in our twitter conversation, Mahalo can, at the very best, be a Google optimizer but not a Google beater. Vertical search engines, with Mahalos in the front for optimization, may offer the much needed solution to our search engine problems. Humans cannot replace machines in the search engine world. Mahalo may serve well for a niche market but it is not a Google beater. At best, it can be termed as a Google optimizer.
What is your take on Scoble’s thesis that Mahalo and other trust based human search engines will beat Google? Do you agree with him? If not, why do you think human search engines cannot beat Google? What do you think will be the Google killer?
PS: It is 2:15 AM. I will correct any mistakes in the morning.