3 Most Useful Discovery Engines: Find Similar Pages

discovery search enginesWe all know how the Web search works: all you need is to provide relevant terms that describe the concept or the topic you are interested in, click “Search” and the search engine will generate the list of results ranked by popularity.

But what if you don’t exactly know how to describe the concept or the topic you are interested in? What if you just want “something of the kind”?

In this case, you need to try discovery search engines: these tools rank the Web by similarity (not by popularity). They allow you to discover more pages based on the one you found most relevant.

Here are the three useful and advanced discovery search engine tools.



Google’s Related: Operator

I have mentioned this neat search operator when listing google tricks when you don’t know what to search for. I also reviewed the visualization tool based on this operator called TouchGraph which can be used as discovery tool as well.

discovery search engines

The index:

Obviously, the operator uses Google’s own database (which is huge). However, for most searches you run, you are most likely to see no more than 30-50 results (which looks as if Google generates only the fraction of possible results).

The algorithm:

The basic algorithm behind the search operator is co-citation, which, in simple words, works like this: if a web page A links to both page B and page C, the latter pages (B and C) are likely to be related. Of course, not everything is that easy and straightforward (it never is with Google) but the basic algorithm is like that.

Google Search Results Support?

Yes, you can access the list of related sites right from Google search results page by clicking the “Similar” link:

discovery search

Drawbacks:

It is hard to find faults with the mighty Google. The only drawback that comes to mind is that it is still Google and if you really need to test alternative user experience and get alternative results (ranked not by Google), you should try out other tools as well.


Similar Pages

Similar Pages is a standalone tool that uses its own technology and claims to let users dig into the “hidden” parts of the web – those that you wouldn’t be able to find using Google only: whereas “ordinary” search engines rank results by popularity (preventing us thus from seeing less popular pages), SimilarPages rank pages by similarity.

discovery search

The index:

The tool uses their own database which is claimed to contain more than 3.2 billion pages. The FireFox addon is said to access 200 million sites.

The algorithm:

The tool uses “PageAffinity” that takes into account both the content of pages as well as the linking structure of the web to determine the level of similarity between web pages.

Google Search Results Support?

Yes, with their addon installed, you are able to view similar pages right within search results:

discovery search

Drawbacks?

The tool has worked surprisingly well and suggested really good matches but it seems to be somewhat biased to home pages.


Similar Sites

SimilarSites (and its FireFox addon Similar Web) works similarly to the above one. We have written a review of SimilarSites previously.

discovery searching

The index:

The developers seem to be very secretive as to which technology they use and how many sites they crawl. All that I have been able to find out using the external sources is that they have mapped “millions” of sites, and “adding tens of thousands daily”.

The algorithm:

Like the above two tools, this seems to be using page content and linking structure but the unique part is that they also analyze users’ input (votes) as well as user browsing trends.

Google Search Support?

Yes, with their FireFox addon you access similar sites right from the Google search results (works only for site home pages that you come across when searching Google):

discovery search engines

Drawbacks?

As the name suggests (and as we have seen from the above screenshot), the tool works only on the domain level. So no matter what the current page is about, the tool will only find similar sites to the current site (home pages). In other words, if you were testing it on this page (which is about search discovery), the tool would list sites about generic web tools and desktop tools and hacks (which is MUO is generally about)

Besides that, the tool inserts “sponsored” results throughout their search results (which are marked as sponsored but may still seem disturbing).

Any other great discovery search engines you are aware of? Please share them in the comments!

Image credit: VJ_fliks


MakeUseOf Recommends

Ann Smarty

Ann Smarty is an SEO Consultant at seosmarty.com, Internet marketing blogger and active social media user. Please follow Ann on Twitter as seosmarty

The comments were closed because the article is more than 180 days old.

If you have any questions related to stuff mentioned in the article or need help with any computer issue, just ask it on MakeUseOf Answers.

Hide 14 Comments

  • Nafis Naim February 15, 2011
    0 likes

    http://www.moreofit.com/ is also a good website to find similar pages.

    | Like
  • Nafis Naim February 15, 2011
    0 likes

    http://www.moreofit.com/ is also a good website to find similar pages.

    | Like
    • Ann Smarty February 15, 2011
      0 likes

      Thanks! I’ll make sure to play with the tool

      | Like
  • skforussia.ru February 15, 2011
    0 likes

    Very impressive stuff. thanks for sharing

    | Like
  • klu9 February 22, 2011
    0 likes

    For ages I’ve used http://www.xmarks.com/

    I even created a search in Opera all I have to do is go to the address bar and type x space in front of the current URL to get similar websites.

    | Like
  • klu9 February 22, 2011
    0 likes

    For ages I’ve used http://www.xmarks.com/

    I even created a search in Opera all I have to do is go to the address bar and type x space in front of the current URL to get similar websites.

    | Like
    • Aibek February 23, 2011
      0 likes

      thanks for heads up, I had no idea they had this feature

      | Like
  • klu9 February 23, 2011
    0 likes

    re Similarpages, the article says it’s a stand-alone tool but when I go to their website and click download it makes clear it’s an extension for Firefox (so no go for me).

    Re finding similar sites versus finding similar pages, here are the results of putting in the URL for this article in several searches:
    - Google related: lists sites similar to Makeuseof (but not specific pages on this issue)
    - Similarsites: lists sites similar to Makeuseof (but not specific pages on this issue)
    - Xmarks: lists nothing
    - Moreofit: lists nothing

    | Like
  • klu9 February 23, 2011
    0 likes

    re Similarpages, the article says it’s a stand-alone tool but when I go to their website and click download it makes clear it’s an extension for Firefox (so no go for me).

    Re finding similar sites versus finding similar pages, here are the results of putting in the URL for this article in several searches:
    - Google related: lists sites similar to Makeuseof (but not specific pages on this issue)
    - Similarsites: lists sites similar to Makeuseof (but not specific pages on this issue)
    - Xmarks: lists nothing
    - Moreofit: lists nothing

    | Like
    • Ann Smarty February 24, 2011
      0 likes

      re: Similar pages. Sorry, I meant it having independent of Google database. Now that I come to think of that, it does sound confusing.

      re: Google related: it does both. If the page is very new, it seems to search for similar sites. For more established pages, it does page-related search. try related:http://www.makeuseof.com/podcasts/ for example

      | Like
      • klu9 February 25, 2011
        0 likes

        Of course, age. Both Google related and Xmarks did *much* better on an older article I tried (but SimilarSites and Moreofit still nothing).

        | Like
  • Ann Smarty February 24, 2011
    0 likes

    re: Similar pages. Sorry, I meant it having independent of Google database. Now that I come to think of that, it does sound confusing.

    re: Google related: it does both. If the page is very new, it seems to search for similar sites. For more established pages, it does page-related search. try related:http://www.makeuseof.com/podca… for example

    | Like
  • Eric May 3, 2011
    0 likes

    Hello Ann! Have you forgotten about http://www.similarsitesearch.com/? Have a try. It is based on clustering algorithm and considers more factors like country and language. It’s the best!

    | Like
    • Ann Smarty May 4, 2011
      0 likes

      Thanks! I’ll take a look!

      | Like