Commit Graph

66 Commits

Author SHA1 Message Date
Nemo 5fd0056d77 Dependency and version bump 2021-06-04 13:56:51 +05:30
Nemo 1e57857a4e Version Bump (1.3.0) 2020-07-01 18:29:44 +05:30
Nemo ba0a47038d Remove input-pdf from README and help 2020-07-01 18:29:22 +05:30
Nemo a05a1253db Keep going with next issue 2020-07-01 18:26:48 +05:30
Nemo 03fccde754 Adds support for final journal downloads 2020-06-30 18:36:01 +05:30
Nemo 3a2d45fb6e Adds a skip-open-access flag 2020-06-30 18:09:38 +05:30
Nemo 62e6a21c84 Finishes support for downloading complete issues 2020-06-30 17:36:44 +05:30
Nemo 38db0dd000 Adds tests for page detection 2020-06-30 16:50:49 +05:30
Nemo 919c8ac43f Fixes parser for issue HTML
This also adds .journal_title as an attribute to the Issue object
2020-06-30 15:19:12 +05:30
Nemo 870ed3080d Modular code in fetch to support both chapters and articles 2020-06-30 14:47:51 +05:30
Nemo f04e9b799e Removes input_pdf and initial work on article download 2020-06-30 14:18:19 +05:30
Nemo 04a2fe52ec Minor fixes, parse contents for issues 2020-06-30 14:08:28 +05:30
Nemo aa392eaa64 Adds support for parsing title to volume/number/date of a journal issue 2020-06-16 19:27:11 +05:30
Nemo 7b48731afe Parse title and publisher for issues 2020-06-16 18:52:29 +05:30
Nemo 6b278531fd Infobox is parsing for an issue now 2020-06-16 18:52:29 +05:30
Nemo f11f64b9d5 Adds webmock 2020-06-16 18:52:29 +05:30
Nemo ff225b12c6 Fix filenames with double-quotes 2020-06-16 18:52:29 +05:30
Nemo 4a358d0cb0 Journal parser now parses all issues 2020-06-16 18:52:29 +05:30
Nemo d8702b2fcb Initial work on parsing the journal page 2020-06-16 18:52:29 +05:30
Nemo fcc4f0c48b Clear out the Producer/Creator on the PDF 2020-06-16 18:52:28 +05:30
Nemo 487b222d79 Adds support for --dont-strip-first-page 2020-05-14 01:04:15 +05:30
Nemo c3722430e1 Adds a check for rate-limit 2020-04-22 18:31:37 +05:30
Prad Nelluru 5e5158fe1c
Don't backoff for more than 256 seconds (~4 min) (#13) 2020-04-21 17:56:25 +05:30
Nemo ebf1b57e22
Merge pull request #12 from pradn/better-errors
Improve error handling
2020-04-20 03:23:24 +05:30
Prad Nelluru 2206c41228 Use response.body, not response.body_io, which is nil when you pass in HTTPClient for some reason. 2020-04-19 17:50:06 -04:00
Prad Nelluru 4e435dd3ab Add 60s timeout to downloads. Do backoff for all errors. 2020-04-19 17:44:21 -04:00
Prad Nelluru 9659c0ef5e
Trim chapter titles to ensure bookmarks are valid in PDF (#11) 2020-04-20 02:03:30 +05:30
Prad Nelluru 762164e223 more descriptive error messages 2020-04-19 15:18:05 -04:00
Prad Nelluru 77201bda85 Fix download issue - revert to using body_io 2020-04-19 15:00:59 -04:00
Prad Nelluru db2d86c1a8 Also add exception message to top-level rescue 2020-04-19 14:49:41 -04:00
Prad Nelluru 1d2f53bad0 forgot to git-add new error files 2020-04-19 14:46:26 -04:00
Prad Nelluru 5d9d951c9a Write backtrace in top-level rescue blocks. 2020-04-19 02:24:09 -04:00
Prad Nelluru 483f838d24 Report pdftk and download errors. Add exponential backoff to downloading after download failures. Add top-level rescue block to improve forward progress. 2020-04-19 01:58:20 -04:00
Nemo d52b06377d Version bump (1.1.2) 2020-04-05 18:58:28 +05:30
Nemo b7aad7a3c2 Add link to download message 2020-04-05 18:58:02 +05:30
Nemo 380f1f03f8 Put URL when skipping a file 2020-04-05 18:57:24 +05:30
Nemo 449be5e554 Version bump 2020-04-05 02:55:35 +05:30
Nemo c08b8b7284 Show version in help 2020-04-05 02:55:19 +05:30
Nemo 1d95cce3f8 Catch another PDF error 2020-04-05 02:14:50 +05:30
Nemo b7cae52cb4 Version Bump 2020-04-04 03:44:08 +05:30
Nemo 82ce97cbc8 Adds fix for corrupt PDF error from MUSE 2020-04-04 01:24:35 +05:30
Nemo ae2a35bb7b Version Bump 2020-04-04 01:19:59 +05:30
Nemo 00c80bf8e4 Fixes #6 2020-04-04 01:17:31 +05:30
Nemo 1a5086b2d7 Catch books with missing dates 2020-04-04 01:10:11 +05:30
Nemo 0b241d6a47 Fix broken binary issue 2020-04-01 17:51:57 +05:30
Nemo 18a051e20f [ci] Fixes CI Build 2020-04-01 02:04:15 +05:30
Nemo f4cf8ce97c Release 1.0.0 2020-04-01 01:37:47 +05:30
Nemo f3c757984a Skip books that don't support PDF 2020-04-01 01:36:53 +05:30
Nemo 2a78be1022 Adds support for .formats and fixes tests 2020-04-01 01:33:54 +05:30
Nemo 955aec3a1b Adds support for cookies 2020-03-30 02:33:55 +05:30