3 Most Useful Discovery Engines: Find Similar Pages

Ads by Google

discovery search enginesWe all know how the Web search works: all you need is to provide relevant terms that describe the concept or the topic you are interested in, click “Search” and the search engine will generate the list of results ranked by popularity.

But what if you don’t exactly know how to describe the concept or the topic you are interested in? What if you just want “something of the kind”?

In this case, you need to try discovery search engines: these tools rank the Web by similarity (not by popularity). They allow you to discover more pages based on the one you found most relevant.

Here are the three useful and advanced discovery search engine tools.



Google’s Related: Operator

I have mentioned this neat search operator when listing google tricks when you don’t know what to search for. I also reviewed the visualization tool based on this operator called TouchGraph which can be used as discovery tool as well.

Ads by Google

discovery search engines

The index:

Obviously, the operator uses Google’s own database (which is huge). However, for most searches you run, you are most likely to see no more than 30-50 results (which looks as if Google generates only the fraction of possible results).

The algorithm:

The basic algorithm behind the search operator is co-citation, which, in simple words, works like this: if a web page A links to both page B and page C, the latter pages (B and C) are likely to be related. Of course, not everything is that easy and straightforward (it never is with Google) but the basic algorithm is like that.

Google Search Results Support?

Yes, you can access the list of related sites right from Google search results page by clicking the “Similar” link:

discovery search

Drawbacks:

It is hard to find faults with the mighty Google. The only drawback that comes to mind is that it is still Google and if you really need to test alternative user experience and get alternative results (ranked not by Google), you should try out other tools as well.


Similar Pages

Similar Pages is a standalone tool that uses its own technology and claims to let users dig into the “hidden” parts of the web – those that you wouldn’t be able to find using Google only: whereas “ordinary” search engines rank results by popularity (preventing us thus from seeing less popular pages), SimilarPages rank pages by similarity.

discovery search

The index:

The tool uses their own database which is claimed to contain more than 3.2 billion pages. The FireFox addon is said to access 200 million sites.

The algorithm:

The tool uses “PageAffinity” that takes into account both the content of pages as well as the linking structure of the web to determine the level of similarity between web pages.

Google Search Results Support?

Yes, with their addon installed, you are able to view similar pages right within search results:

discovery search

Drawbacks?

The tool has worked surprisingly well and suggested really good matches but it seems to be somewhat biased to home pages.


Similar Sites

SimilarSites (and its FireFox addon Similar Web) works similarly to the above one. We have written a review of SimilarSites previously.

discovery searching

The index:

The developers seem to be very secretive as to which technology they use and how many sites they crawl. All that I have been able to find out using the external sources is that they have mapped “millions” of sites, and “adding tens of thousands daily”.

The algorithm:

Like the above two tools, this seems to be using page content and linking structure but the unique part is that they also analyze users’ input (votes) as well as user browsing trends.

Google Search Support?

Yes, with their FireFox addon you access similar sites right from the Google search results (works only for site home pages that you come across when searching Google):

discovery search engines

Drawbacks?

As the name suggests (and as we have seen from the above screenshot), the tool works only on the domain level. So no matter what the current page is about, the tool will only find similar sites to the current site (home pages). In other words, if you were testing it on this page (which is about search discovery), the tool would list sites about generic web tools and desktop tools and hacks (which is MUO is generally about)

Besides that, the tool inserts “sponsored” results throughout their search results (which are marked as sponsored but may still seem disturbing).

Any other great discovery search engines you are aware of? Please share them in the comments!

Image credit: VJ_fliks

Join live MakeUseOf Groups on Grouvi App Join live Groups on Grouvi
Awesome Websites
Awesome Websites
131 Members
Deep Web Communities
Deep Web Communities
79 Members
Best Music Services
Best Music Services
43 Members
Web for Kids
Web for Kids
32 Members
Ads by Google
Comments (15)
  • conspiro.net

    Awesome job!!

  • Eric

    Hello Ann! Have you forgotten about http://www.similarsitesearch.com/? Have a try. It is based on clustering algorithm and considers more factors like country and language. It’s the best!

  • Ann Smarty

    re: Similar pages. Sorry, I meant it having independent of Google database. Now that I come to think of that, it does sound confusing.

    re: Google related: it does both. If the page is very new, it seems to search for similar sites. For more established pages, it does page-related search. try related:http://www.makeuseof.com/podca for example

  • klu9

    re Similarpages, the article says it’s a stand-alone tool but when I go to their website and click download it makes clear it’s an extension for Firefox (so no go for me).

    Re finding similar sites versus finding similar pages, here are the results of putting in the URL for this article in several searches:
    – Google related: lists sites similar to Makeuseof (but not specific pages on this issue)
    – Similarsites: lists sites similar to Makeuseof (but not specific pages on this issue)
    – Xmarks: lists nothing
    – Moreofit: lists nothing

    • Ann Smarty

      re: Similar pages. Sorry, I meant it having independent of Google database. Now that I come to think of that, it does sound confusing.

      re: Google related: it does both. If the page is very new, it seems to search for similar sites. For more established pages, it does page-related search. try related:http://www.makeuseof.com/podcasts/ for example

    • klu9

      Of course, age. Both Google related and Xmarks did *much* better on an older article I tried (but SimilarSites and Moreofit still nothing).

  • klu9

    re Similarpages, the article says it’s a stand-alone tool but when I go to their website and click download it makes clear it’s an extension for Firefox (so no go for me).

    Re finding similar sites versus finding similar pages, here are the results of putting in the URL for this article in several searches:
    - Google related: lists sites similar to Makeuseof (but not specific pages on this issue)
    - Similarsites: lists sites similar to Makeuseof (but not specific pages on this issue)
    - Xmarks: lists nothing
    - Moreofit: lists nothing

Load 10 more
Affiliate Disclamer

This review may contain affiliate links, which pays us a small compensation if you do decide to make a purchase based on our recommendation. Our judgement is in no way biased, and our recommendations are always based on the merits of the items.

For more details, please read our disclosure.
Affiliate Disclamer

This review may contain affiliate links, which pays us a small compensation if you do decide to make a purchase based on our recommendation. Our judgement is in no way biased, and our recommendations are always based on the merits of the items.

For more details, please read our disclosure.