Skip to content

A chrome/firefox extension that download books from Internet Archive(archive.org) and HathiTrust Digital Library (hathitrust.org)

License

Notifications You must be signed in to change notification settings

elementdavv/internet_archive_downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

79 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Internet Archive Downloader

In Internet Archive, Some books can be borrowed to read online for a limited period. The extension will try to download these books.

In HathiTrust Digital Library. books with full view permission is also supported.

The extension works by fetching every leaf of a book, constructing a PDF stream on the fly targeting to disk storage. So it can handle books of thousands of leafs easily with little memory.

What It Can Do

  • Download a book as a PDF file with text embedded
  • Download a book as a collection of image files (JPEG/PNG) one for each leaf, and the text of the book
  • Optional leaf range
  • Optional leaf quality
  • Download multiple books in parallel

Install manually

Install as the following directions supports automatic updates.

Chrome extension (Works on Chromium family)

  • Navigate to releases page, find and download the latest iadownloader-x.x.x.crx.
  • Drag and Drop the CRX file from file manager to Extensions page on browser (developer mode must be enabled).
  • Due to the install restriction of Chrome with extensions which are not from Chrome WebStore, take the following steps:
    • Windows: Download registry file windows.reg. Double click to run it. The file will allow Internet Archive Downloader extension CRX file to install in Chrome, Edge, Brave and Vivaldi browser.
    • Mac OS: Download policy file for Chrome / Edge / Brave. Double click to run it. Then from System Preferences (or System Settings) find the item of "Profile" (it may be in current page, or inside "General", "Security and Privacy" item, dependent of Mac OS version). In the "Profile" setting page, click "install" button to activate it. The profiles will allow Internet Archive Downloader extension CRX file to install in Chrome, Edge and Brave browser.
    • Linux: Supported by default.

Edge extension (Works on Edge only)

  • Navigate to releases page, find and download the latest iadownloader-edge-x.x.x.crx.
  • Drag and Drop the CRX file from file manager to Extensions page on Edge (developer mode must be enabled).

Firefox extension

  • With Firefox, nvigate to releases page, find and click the latest iadownloader-x.x.x.xpi to install.

Install automatically

Configuration

For Firefox, permissions of access to relative websites must be granted in the "Permissions" tab of the extension detail page.

Usage

Internet Archive:

In archive.org, the availability of books lies in three categories:

  1. Lending Library
  2. Borrow 14 days
  3. Always Available

The extension works with category 1 and 2.

After borrowing a book, two new buttons, "Quality" and "Download", appear under the book viewer alongside the "Favorite" button.

  • To get the current book as a PDF file, press the "Download" button.
  • To get each leaf as a JPEG file, press the "Download" button while holding Ctrl key (Command key on Mac).
  • To get only a range of leafs, press the "Download" button while holding Alt key (Option key on Mac), then input a range.

On successful downloads, the book will be returned automatically for availability to other users.

HathiTrust:

No login, no borrows required. In hathitrust.org, for books with full view permission, a new section, "Ayesha", appears above the "Download" section on the left-hand side of the page. The section contains three buttons, "Quality", "Tasks" and "Download".

  • To get the current book as a PDF file, press the "Download" button.
  • To get each leaf as a JPEG/PNG file, press the "Download" button while holding Ctrl key (Command key on Mac).
  • To get only a range of leafs, press the "Download" button while holding Alt key (Option key on Mac), then input a range.

The download process may take some breaks due to server constraints.

Introduction in Youtube

Internet Archive Downloader

Availability

  • Chromium family(Chromium, Chrome, Edge, Brave, Vivaldi, Opera, Yandex, Kiwi, etc) version 90+ supported
  • Firefox version 115+ supported

Privacy

Please read carefully the Extension Privacy Policy.

Disclaimer

The project is for study purpose only. Users should recognize that downloading borrowed limited period books is prohibited. All books downloaded using this project should be deleted within 48 hours. The project is provided on an "as is" basis. Use of the project is at user's own risk. The owner does not guarantee or assume responsibility for any damages to user's computer system, mobile device, loss of data or legal risks that results from using the project.

Contacts

Donation

If you want to support my work you could donate by donate

About

A chrome/firefox extension that download books from Internet Archive(archive.org) and HathiTrust Digital Library (hathitrust.org)

Resources

License

Stars

Watchers

Forks