Google Book Search and Einstein Digital Reprint
- From: Jose Menendez <ebooks@[redacted]>
- Subject: Google Book Search and Einstein Digital Reprint
- Date: Tue, 02 May 2006 14:35:01 EDT
Since Google Book Search has been the subject of a lot of
discussion on this list at various times, I thought some members
might be interested in a recent experience I had with the Google
Book Search Team.
A few years ago, I made a PDF version of Albert Einstein's
"Relativity: the Special and General Theory" (1920 edition). Like
most ebooks, it didn't reproduce the pagination or line breaks of
the original paper book. For that reason, I didn't include the
paper book's index, since the page numbers in it wouldn't match
the ebook's pagination.
Five or six weeks ago, I realized that the "Copyright
Information" in the ebook was outdated. It warned that the
original book was still under copyright in life + 50 countries.
Since Einstein died in 1955, however, his works passed into the
public domain in those countries on January 1st of this year.
Rather than just update and repost the original PDF, I decided to
make a "digital reprint" like the ones for "My Antonia" and
"Books and Culture" that have been mentioned on this list.
Checking Google Book Search, I found a Library Project copy of
the 1920 edition published by Henry Holt and Company. The quality
of the page images was pretty good, but 6 pages (116-118 and
155-157) were missing. (I relied on a paper reprint of the Holt
edition for them.) I quickly made a digital reprint, and since
the pagination now matched the paper book's, I could include a
fully functional (clickable) copy of the original index. And like
my "Books and Culture" reprint, the page numbers in the PDF were
linked to the corresponding Google page images.
The day I was going to upload the ebook to my website, I
discovered that Google's copy had vanished; every link to it
returned a "Not Found" error page. After waiting about 3 weeks
for it to reappear, I made a new version without any mention of,
or links to, Google's copy and uploaded it to my website the day
before Easter, replacing my old PDF from a few years ago.
The day after Easter I emailed the Google Book Search Team. I
told them that I'd made a "digital reprint" of Einstein's book,
and included a link to my PDF "reprint" of "Books and Culture" so
that they could see what I meant. Then I cited this part of the
Google Book Search Help page:
http://books.google.com/intl/en/googlebooks/help.html#17
which asks people who reuse Google's public domain books to
"maintain the attribution to Google." The last paragraph of my
email to them read as follows:
> In closing, let me point out that if you're going to ask people to
"maintain the attribution to Google," you shouldn't remove books without
any explanation. If I'd put my first version online without noticing
that Google's was missing, I might have gotten a number of emails,
complaining that my ebook was full of broken links. When you remove
books, you might want to consider substituting a page which reads
something like this: "This book has been temporarily removed . . ."
How did Google respond? Did they tell me not to reuse any of
their public domain books? No. Did they tell me to buy my own
copies or get them from libraries? No. Did they warn me not to
scrape their scans? No. Did they tell me not to link to their
page images because they don't want people reading their books?
Of course not.
Their response was very courteous. First, they thanked me for
reporting the problem. Then they explained that books may be
removed "temporarily" for a "variety" of reasons. For instance, a
book with quality problems might get removed for "reprocessing."
Finally, they "assured" me that they were "working as quickly as
possible" to get the book back online. The last line in their
reply was, "Your continued patience and understanding are much
appreciated." My only disappointment was that they didn't comment
on my suggestion about "substituting a page" when books are removed.
Yesterday I checked, and Einstein's book is finally back in
Google Book Search, so I've uploaded my original digital reprint
which links to Google's page images. But first I added this word
of warning to the "Note on the Text": "Google's images may not be
available to people outside the U.S. and may be unavailable to
U.S. residents at times."
http://www.ibiblio.org/ebooks/Einstein/Einstein_Relativity.pdf
Note: the original book is still under copyright in countries
that have copyright terms longer than life + 50.
Jose Menendez
P.S. Out of the 6 pages that were missing from Google's copy,
155-157 are there now, but 116-118 are still missing. In place of
116 and 117, duplicate images of pages 126 and 127 come up. So I
wouldn't be surprised if it disappears again for more reprocessing.