04-25-2017, 05:29 PM
|
#1
|
That Crazy Guy at the Bus Stop
Join Date: Jun 2010
Location: Springfield Penitentiary
|
Google has 25 million books scanned but nobody is allowed to read them
Good read.
https://www.theatlantic.com/technolo...m_source=1-2-2
Quote:
You were going to get one-click access to the full text of nearly every book that’s ever been published. Books still in print you’d have to pay for, but everything else—a collection slated to grow larger than the holdings at the Library of Congress, Harvard, the University of Michigan, at any of the great national libraries of Europe—would have been available for free at terminals that were going to be placed in every local library that wanted one.
At the terminal you were going to be able to search tens of millions of books and read every page of any book you found. You’d be able to highlight passages and make annotations and share them; for the first time, you’d be able to pinpoint an idea somewhere inside the vastness of the printed record, and send somebody straight to it with a link. Books would become as instantly available, searchable, copy-pasteable—as alive in the digital world—as web pages.
|
Quote:
By 2004, Google had started scanning. In just over a decade, after making deals with Michigan, Harvard, Stanford, Oxford, the New York Public Library, and dozens of other library systems, the company, outpacing Page’s prediction, had scanned about 25 million books. It cost them an estimated $400 million.
|
|
|
|
The Following 8 Users Say Thank You to Cecil Terwilliger For This Useful Post:
|
|
04-25-2017, 06:12 PM
|
#2
|
Lifetime Suspension
|
This is important.
Quote:
On March 22 of that year, however, the legal agreement that would have unlocked a century’s worth of books and peppered the country with access terminals to a universal library was rejected under Rule 23(e)(2) of the Federal Rules of Civil Procedure by the U.S. District Court for the Southern District of New York
|
|
|
|
04-26-2017, 08:24 AM
|
#3
|
Norm!
|
I'm glad they've chosen to keep the necronomicron and the darkhold out normal peoples hands.
__________________
My name is Ozymandias, King of Kings;
Look on my Works, ye Mighty, and despair!
|
|
|
04-26-2017, 08:51 AM
|
#4
|
#1 Goaltender
Join Date: Nov 2005
Location: An all-inclusive.
|
That's not fair.
|
|
|
The Following 5 Users Say Thank You to Kybosh For This Useful Post:
|
|
04-26-2017, 09:16 AM
|
#5
|
CP Gamemaster
Join Date: Feb 2010
Location: The Gary
|
And to think one of the main thrusts of the opposition to this was because other companies wouldn't be able to do the same thing. Would every company really want to scan millions of old out-of-print books on the off chance they actually make some money?
The argument about science journals being outrageously priced is true though - there would need to be something to prevent that from happening.
|
|
|
04-26-2017, 09:17 AM
|
#6
|
Crash and Bang Winger
|
I Google scanned books that are still under copywrite (I assume that exists for books) without paying the owners, why would anyone think they would be allowed to release that for free?
For older books, transcripts, maps etc should be fine but books you can still buy on Amazon, why would that be ok to release those books for free?
I like Google's idea but to think they would by force of will be able to circumvent that, thats a bit nasty to think a corporation would do that. I guess google, facebook, twitter have been doing that lately with no seeming hope of antitrust lawsuits but wow.
|
|
|
04-26-2017, 09:21 AM
|
#7
|
Franchise Player
|
Did you actually read the article? They weren't going to release the things for free. If you searched for a passage that appeared in a book you'd get a small excerpt of a few sentences on either side of it, like a card catalogue. Their argument is that this is totally different from you being able to read the book, which is true.
__________________
"The great promise of the Internet was that more information would automatically yield better decisions. The great disappointment is that more information actually yields more possibilities to confirm what you already believed anyway." - Brian Eno
|
|
|
04-26-2017, 11:09 AM
|
#8
|
First Line Centre
Join Date: Aug 2009
Location: Coquitlam, BC
|
I watched a Ted Talk a while ago talking about how filling out a Captcha helps in the effort to digitize books by Google and others.
Essentially the random characters in a Captcha are not random, each are a tiny part of a scanned book page that needs verification by humans because computers have trouble deciphering that bit with with OCR.
https://www.ted.com/talks/luis_von_a...ation#t-205693
|
|
|
The Following 2 Users Say Thank You to BloodFetish For This Useful Post:
|
|
04-26-2017, 11:23 AM
|
#9
|
CP Gamemaster
Join Date: Feb 2010
Location: The Gary
|
Quote:
Originally Posted by temple5
I Google scanned books that are still under copywrite (I assume that exists for books) without paying the owners, why would anyone think they would be allowed to release that for free?
For older books, transcripts, maps etc should be fine but books you can still buy on Amazon, why would that be ok to release those books for free?
|
The article said that books still for sale aren't part of this. The article also said how Google scanning and asking for forgiveness after is what lead to the solution of the class-action lawsuit - where Google would offer the entire library to institutions, charge a nominal fee for out-of-print books, and keep the money in escrow until the author steps up to claim the moneys they were owed from the sales.
The reason it fell through was that some people felt it was too much of a monopoly on books (even though the Kindle store on Amazon has the lion's share of ebook market) by Google, some felt they were going to overcharge too much to institutions for access in the future, and some felt that Google shouldn't be profiting at all from their efforts.
|
|
|
04-26-2017, 11:24 AM
|
#10
|
Franchise Player
Join Date: Aug 2007
Location: Vancouver
|
Quote:
Originally Posted by BloodFetish
I watched a Ted Talk a while ago talking about how filling out a Captcha helps in the effort to digitize books by Google and others.
Essentially the random characters in a Captcha are not random, each are a tiny part of a scanned book page that needs verification by humans because computers have trouble deciphering that bit with with OCR.
https://www.ted.com/talks/luis_von_a...ation#t-205693
|
Watched this as well. Also doing the same thing with duolingo translating the internet into multiple languages.
__________________
|
|
|
Thread Tools |
Search this Thread |
|
|
Posting Rules
|
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is Off
|
|
|
All times are GMT -6. The time now is 10:50 PM.
|
|