Pinterest Stumbleupon Whatsapp
Ads by Google

Think of your favorite open databases.

I’m sure Wikipedia and IMDb instantly spring to mind, but you might not be in the need of all that knowledge ever, or a comprehensive database of all things entertainment. Sometimes you need a bit of VLDB (Very Large Data Base) flavor. Something to spice up your data analysis How to Become a Data Scientist How to Become a Data Scientist Data science has gone from a newly coined term in 2007 to being one of the most sought-after disciplines today. But what does a data scientist do? And how can you break into the field? Read More . Something to put the “big” in your big data. Whelp, good person, you’re in the right place.

Here are 15 massive online databases So What Is a Database, Anyway? [MakeUseOf Explains] So What Is a Database, Anyway? [MakeUseOf Explains] For a programmer or a technology enthusiast, the concept of a database is something that can really be taken for granted. However, for many people the concept of a database itself is a bit foreign.... Read More you can access and analyze for free Excel Vs. Access - Can a Spreadsheet Replace a Database? Excel Vs. Access - Can a Spreadsheet Replace a Database? Which tool should you use to manage data? Access and Excel both feature data filtering, collation and querying. We'll show you which one is best suited for your needs. Read More , or just peruse at your leisure.

1000 Genomes

The 2003 completion of the Human Genome Project (HGP) was just the beginning. Since then advances in sequencing technology have vastly reduced the per-person cost allowing vast expansion of the HGP from its initial research base of twenty university labs, into a sprawling, globalized network of interconnected genome mapping facilities.

You can download part of the 1000 Genomes Project, containing sequencing information for over 2,600 people from 26 populations around the world. This is a 200TB file, so be prepared. We would suggest using it in conjunction with a powerful cloud computing platform.

Ads by Google

See also: The Animal Genome Size Database for genome data relating to 5635 species.

Airliners

Airliners

The planespotters heaven. A massive image database featuring 2,532,457 photos of all manner of aircraft, from the smallest individual craft to hulking great flying fortresses.

Airliners also features an extensive aircraft data and history section always kept updated in cooperation with Aerospace Publications to ensure factual accuracy. This has made it one of the single most detailed aircraft databases on the Internet.

See also: Try Planespotters.net for a different range of images, or SeatGuru for airplane seating schematics.

The Internet Archive

Internet Archive

The site formerly known as The Internet Archive, has gone through a massive redesign. The site hadn’t changed much since around 2002, but a lot has changed since then. The Internet Archive has done even more growing since the early days.

Archiving everything on the Internet, the site gives you free access to digital media including books, music, games Internet Archive Brings 900 Classic Arcade Games To Your Browser. Here Are 7 Of The Best Internet Archive Brings 900 Classic Arcade Games To Your Browser. Here Are 7 Of The Best Your town's arcade may have shut down in the mid-90s, but that shouldn't stop you from getting your classic games fix. Read More , videos, and much more. The collection is currently estimated at around 10 petabytes Memory Sizes Explained - Gigabytes, Terabytes & Petabytes in Layman's Terms Memory Sizes Explained - Gigabytes, Terabytes & Petabytes in Layman's Terms It’s easy to see that 500 gigabytes is more than 100 gigabytes. It’s also easy to see that 1 terabyte is larger than 1 gigabyte and that is larger than 1 megabyte. But these are... Read More , and as their webcrawlers keep crawling, it will continue to grow.

Freebase

Freebase

Freebase is “a community-curated database of well-known people, places, and things,” stored in a data structure called a graph. A graph is comprised of nodes, connected by their edges, which allowed Freebase to rapidly expand its content without disrupting existing records.

Unfortunately, Freebase, owned by Google, switched to read-only mode early this year, before the standalone service database is transferred over to the Wikimedia Foundation for integration Try Out Beta Features On Wikimedia And Preview What’s New Before Anyone Else Try Out Beta Features On Wikimedia And Preview What’s New Before Anyone Else Wikimedia’s Beta Features program will allow anyone to try out the upcoming new features on Wikimedia and its wikis. Join in and help make Wikimedia a better experience for all of us. Read More in the Wikidata project (end of June, 2015). Developers can currently still access Freebase using existing APIs, but once the switch is made, developers will have to use Wikimedia APIs to access the data.

Find a Grave

Find A Grave

From the home base of an Internet knowledge dream-team of Google and Wikimedia, we move to the morbid. Find a Grave is a massive, 121 million record database of burials around the globe.

Most comprehensive records come from the US, but there are some smaller countries with large data. Complete with photos, interesting monuments, and a number of interesting epitaphs…if you need inspiration?

GameRankings

Game Rankings

A database maintained by the ever-present reviewing team at Gamespot. GameRankings gives a well-rounded portrayal of a game’s popularity by covering on-and-offline gaming reviews from reputable sources 6 Places To Find Out What Are The Best Video Games Ever 6 Places To Find Out What Are The Best Video Games Ever It's only natural for a fan of gaming to wonder what the best games ever made are, or at least what the experts think are the best. Read More .

The Big Cartoon Database

BCBD

In a similar vein to the massive IMDb, The Big Cartoon Database focuses exclusively on all things animated: cartoons, films, television shows, adverts, and more. If it is an animation, you’ll find it here – and if not, sign up as contributor to this ever growing database.

The Big Cartoon Database has a sister site in The Big Comic Database, home to a further 100,000 or more comic book records 9 Video Game Themed Comics Books You Should Read 9 Video Game Themed Comics Books You Should Read It's time to switch off and go to bed. But you're still itching to play your favourite game. You need to wind down, so why not read your favourite game instead? Read More , spanning some 5,000 series, with over 35,000 cover scans. It also contains a comprehensive search function, including a comic book price guide detailing current resale values at the various grading levels.

See also: The Grand Comics Database, a non-commercial enterprise database of comics worldwide.

CiteSeerX

Citeseer

An invaluable tool for students Creating Bibliographies & Footnote Citations Is Easier With Bookends for Mac Creating Bibliographies & Footnote Citations Is Easier With Bookends for Mac Bookends for Mac practically performs the research and citation formatting tasks for you. Read More and academics alike, CiteSeerX is a public search engine and digital library of academic and scientific papers. Often considered the first automated citation indexing system, it was the inspiration for Google Scholar and Microsoft Academic Search. Though the latter has since been integrated into the Bing search engine.

CiteSeerX focuses on indexing public scholarly documents. If your research paper is openly distributed, it has a higher chance of appearing within the search engine. CiteSeerX is an excellent example of the power of shared knowledge made available to a much wider audience.

See also: Google Scholar for a different range of books and citations.

WorldCat

WorldCat

Unfortunately not a database of each cat picture on the Internet. Now that would be something! WorldCat is much more useful than that.  The reference site documents the collections of over 72,000 libraries around the world, covering 170 countries and territories. This is useful if you’re researching in a foreign country, or just have a desire to read rare books in person.

The only downside is the update method. WorldCat uses a batch processing model rather than allow users to access the data in real-time. So, WorldCat does not indicate the loan status of catalogued books, whether a library owns multiple copies of one book, or whether the book in question is directly accessible to those wishing to visit. It is still a very useful tool, especially when used in conjunction with CiteSeerX.

The Simpsons Archive

Simpsons

“The Internet’s clearinghouse of Simpsons guides, news, and information.” I couldn’t have put it better myself. The long-standing fan favorite began way back 1994, and is still going strong even without any interactive multimedia, if only to escape the watchful eye of Fox’s legal department.

WinCustomize

WinCustomize

You will find one of the single largest databases of customization tools for Windows How To Customize Your Windows Sound Effects How To Customize Your Windows Sound Effects You could make your computer sound like your favorite TV show, record your own sounds, or turn them off completely. Here's a crash course on changing and recording sound effects for Windows. Read More  here, spanning from XP up to Windows 8.1 10 Windows 8 Start Screen Hacks 10 Windows 8 Start Screen Hacks Windows is moving towards a more locked-down direction with Windows 8 and its Start screen and "Modern" app environment. There's no denying this -- you can't even set a custom Start screen background without installing... Read More . I’m sure it won’t take long for Windows 10 to begin making the rounds. Its vast popularity stems from a combination of forces. Owner Stardock, subsidizes the site meaning there are little-to-no advertisements. It also benefits from the number of individuals funneled to the site from Stardock.

Ultimate Guitar Archive

GuitarTabs

Ah, a trip down nostalgia lane to a database reminding me I was never to be Roger Waters. In fact, I can still barely play, but that’s another story.

The Ultimate Guitar Archive, or just Ultimate-Guitar (UG), has over 1,500,000 registered members around the world, overseeing a ridiculously large amount of community content. It is almost mind-boggling how much guitar related information is scattered out from a single source. The community just doesn’t maintain a massive database, they also frequently collaborate with one another to create sprawling music projects.

Plants for a Future

Plants-For-A-Future

Plants for a Future documents  ecologically sustainable horticulture. It has a big hand in spreading knowledge on species diversity and the importance of permaculture. What started as a small project in the depths of Cornwall has slowly grown into a worldwide database.

Growth is somewhat slow, and largely focuses on permaculture in the UK and EU, but many of the records can be swapped for specific locations in the US once you have the species details.

Quandl

Power up with this Excel add-in to process and analyze data Power Up Excel with 10 Add-Ins to Process, Analyze & Visualize Data Like a Pro Power Up Excel with 10 Add-Ins to Process, Analyze & Visualize Data Like a Pro Vanilla Excel is amazing, but you can make it even more powerful with add-ins. Whatever data you need to process, chances are someone created an Excel app for it. Here's a selection. Read More . The main Quandl site acts as a database search, locating databases from around the world that match your search terms. Try it if you need some extra data in a hurry, or just like playing with large datasets (honestly, who doesn’t?!).

Quandl

See also: The Enigma database search engine.

Tiny Images

The Tiny Images dataset acts as a visual dictionary. Click anywhere within the image and a search term pops up with extra information. You can also use specific terms to sift through 80 million images.

Visual Dictionary

The database is part of wider machine learning project How Intelligent Software Is Going to Change Your Life How Intelligent Software Is Going to Change Your Life Skynet is coming, and it's going to be incredibly popular. New AI technologies are emerging that will chance the way we live, play, and work, Read More focused upon teaching computers to “see” and “read” semantic fields within images.

Bonus Source: /r/datasets

The “front page of the Internet” is a solid home for data mining enthusiasts around the globe. There are subreddits dedicated to machine learning, data mining, text to data, and datasets. If you need something specific make a request. New datasets appear every week.

Watch out for the interesting datasets posted like the Immunization Levels in Child Care and Schools for California.

Do You Use The Wealth?

The Internet has created the single-clearest opportunity for individuals to come together and concentrate their knowledge into a single database. We are valiantly trying to document everything about anything. Some of these databases are for perusing, others are for learning How to Use an Excel Pivot Table for Data Analysis How to Use an Excel Pivot Table for Data Analysis The pivot table is one of the single most powerful tools in the Excel 2013 repertoire. It is frequently used for large data analysis. Follow our step-by-step demonstration to learn all about it. Read More , but we hope you enjoy them all.

What are your favorite databases? Are there any open massive reference sources I should have included in this list?

Image Credits: network server via Shutterstock, library via Shutterstock

Leave a Reply

Your email address will not be published. Required fields are marked *