Monday, May 24, 2010

Public Domain Books in the Millions from Google Books


The first time I heard about public domain books was from a gentleman called Robert G. Allen. Robert by the way is the author of the book "One Minute Millionaire", and "Cash in a Flash". So what is public domain? 

According to Wiktionary, public domain is the feature of intellectual property not protected under patent or copyright, i.e. no person or legal entity can establish proprietary interests. Another definition from the University of Alberta library states that a work in the public domain is free for everyone to use without asking for permission or paying royalties. The phrase "public domain" is a copyright term referring to works that belong to the public. 

The University of Alberta go on to state that: works can be in the public domain for a variety of reasons: because the term of copyright protection has expired; because the work was not eligible for copyright protection in the first place; or because the copyright owner has given the copyright in the work to the public domain. The owner must specifically license all or certain uses of the work. This is done by stating on the work what uses are permitted such as, for example, that the work may be reproduced, communicated, or performed for educational purposes without permission or payment (A Guide to Copyrights 5).

As many might have heard by now, there are many works especially the classics in the public domain. This means that you have free access to the works and can do just about anything you want with the works. However, in the digital information age, there are corporate bodies trying their damndest to ensure that we the public do not have access to these bodies of works. They (the corporate) are doing everything to change the laws in their favour of course. If we are not careful, we (the public) might find that we are left out of the argument altogether. For a more detailed discussion, please read the blog piece before this one.
Enough old chap, I hear you say; end of rant. Let us get back to what we were discussing – a million books on Google. I have a piece from the Inside Google Books blog titled - Download over a Million Public Domain Books from Google Books in the Open EPUB Format. This was written by Brandon Badger, Product Manager for Google Books, dated August 26, 2009, and here is the where you can get the original piece: http://booksearch.blogspot.com/2009/08/download-over-million-public-domain.html.  Please enjoy the piece and let me know what you think with your comments.

Over the years, we've heard a lot from people who've unearthed hidden treasures in Google Books: a crafter who uncovered a forgotten knitting technique, a family historian who discovered her ancestor once traveled the country with a dancing, roller-skating bear. The books they found were out of copyright and in the public domain, which meant they could read the full text and even download a PDF version of the book.

I'm excited to announce that starting today, Google Books will offer free downloads of these and more than one million more public domain books in an additional format, EPUB. By adding support for EPUB downloads, we're hoping to make these books more accessible by helping people around the world to find and read them in more places. More people are turning to new reading devices to access digital books, and many such phones, netbooks, and e-ink readers have smaller screens that don't readily render image-based PDF versions of the books we've scanned. EPUB is a lightweight text-based digital book format that allows the text to automatically conform (or "reflow") to these smaller screens. And because EPUB is a free, open standard supported by a growing ecosystem of digital reading devices, works you download from Google Books as EPUBs won't be tied to or locked into a particular device. We'll also continue to make available these books in the popular PDF format so you can see images of the pages just as they appear in the printed book.

To get started, just find any public domain book on 
Google Books and click on the Download button in the toolbar.


Of course, these public domain books weren't born in EPUB format--or even in digital format at all. Let's say you download a free EPUB copy of Treasure Island. You're taking a final step in a long process that takes a physical copy of Robert Louis Stevenson's book and transforms it into something you can download for your iPhone. The process begins with a book that has been preserved by one of our library partners from around the world. Google borrows the book from one of our library partners, much like you can from your local library. Before returning the book in undamaged form, we take photographs of the pages. Those images are then stitched together and processed in order to create a digital version of the classic book. This includes the difficult task of performing Optical Character Recognition on the page image in order to extract a text layer we can transform into HTML, or other text-based file formats like EPUB (if you're interested, you can read more about this process here).

Digitizing books allows us to provide more access to great literature for a wider set of the world's population. Before physical books were invented, thoughts were constrained by both space and time. It was difficult for humans to share their thoughts and feelings with a set of people too far from their physical location. Printed books changed that by allowing authors to record their experiences in a medium that could be shipped around the world. Similarly, the words written down could be preserved through time. The result was an explosion in collaboration and creativity. Via printed books, a 17th century physicist in Great Britain could build on the work of a 16th century Italian scholar.

Of course, it can be difficult and costly to reproduce and transport the information that older physical books contain. Some can't afford these works. Others who might be able to afford to purchase them can't unless they can find a physical copy available for sale or loan. Some important books are so limited in quantity that one must fly around the world to find a copy. Access to other works is only available to those who attend certain universities or belong to certain organizations.

Once we convert atoms from physical books into digital bits, we can begin to change some of that. While atoms remain fairly expensive, digital bits are on a trend where they become ever cheaper to produce, transport, and store. For example, providing every student in a school district with a paper copy of Shakespeare's Hamlet might cost thousands of dollars. Yet if those same students already have cell phones, laptops, or access to the Internet, then they can access a digital copy of Hamlet for just a fraction of the cost. Often times, public domain texts in digital form are more affordable and accessible to the public than their physical parents.

All of this of course assumes that a digital version of the book exists. I love going into work each morning knowing that we're working to convert atoms into bits and that by doing so, we hope to make knowledge more accessible. In a world where educational opportunities are often disproportionately allocated, it's exciting to think that today anyone with an Internet connection can download any of over one million free public domain books from Google Books. Who knows. Maybe some kid will read Treasure Island on their phone and be inspired to write their own great novel some day.

No comments:

Post a Comment