Category Archives: Bulk Access

Google Summer of Code 2020: Adoption by Book Lovers

by Tabish Shaikh & Mek OpenLibrary.org,the world’s best-kept library secret: Let’s make it easier for book lovers to discover and get started with Open Library. Hi, my name is Tabish Shaikh and this summer I participated in the Google Summer of Code program with Open Library to develop improvements which will help book lovers discover […]

Also posted in Community, Google Summer of Code (GSoC), Open Source | Leave a comment

Bulk Access to OCR for 1 Million Books

The Internet Archive provides bulk access to the over one million public domain books in its Texts Collection. The entire collection is over 0.5 petabytes, which includes raw camera images, cropped and skewed images, PDFs, and raw OCR data. The Internet Archive is scanning 1000 books/day, so this collection is always growing.   The OCR data […]

Also posted in OCR | Tagged , , | Comments closed
  • open library logo
  • follow us on twitter

  • Recent Posts

  • Archives