1
0
Fork 0
mirror of synced 2024-07-01 04:20:55 +12:00
Commit graph

725 commits

Author SHA1 Message Date
Cristian 8cfad64271 feat: Add specific logic for archive_org icon 2020-11-12 11:09:34 -05:00
Cristian e594e6a75a feat: WARC link points to the first warc result in target path 2020-11-12 10:57:31 -05:00
Cristian b237e412df feat: Finish reversal. Add ArchiveResults that are not found in the index.json 2020-11-12 10:30:41 -05:00
Cristian f7f0bebdcc feat: Modify migration reverse function to restore index (WIP) 2020-11-11 15:26:54 -05:00
Cristian 508a0bb06e refactor: Unpack extractors tuple instead of using the index to access the relevant information 2020-11-10 12:38:29 -05:00
Cristian 71655220ad feat: Add warc to list and limit check to succeeded archive results 2020-11-05 07:54:40 -05:00
Cristian 33182fd53c fix: Add missing assignation 2020-11-04 15:07:45 -05:00
Cristian d064a3eeff fix: Handle case when update tries to re-add a link that is not in the sql index 2020-11-04 15:02:54 -05:00
Cristian f292cface2 fix: Add condition for oneshot when archiving links 2020-11-04 14:40:44 -05:00
Cristian 4484491fb7 feat: Create ArchiveResult after finishing an extractor process 2020-11-04 11:22:55 -05:00
Cristian b3e0400bc0 feat: initial functional version with icons calculated based on archive results 2020-11-04 10:31:20 -05:00
Cristian 309a87e8fe feat: Add extractor field to the database 2020-11-04 07:28:02 -05:00
Cristian 8f3c03a0f9 feat: Initial (and naive) ArchiveResult model 2020-11-03 09:54:02 -05:00
Cristian ac0ec160d1 lint: Fix warnings in master branch 2020-11-02 08:51:48 -05:00
Nick Sweeting 7d4738a674 fix intermittent BrokenPipe error on macOS when SHOW_PROGRESS=True 2020-10-31 19:38:54 -04:00
Nick Sweeting 9c6ff5036c add suppress output helper 2020-10-31 19:33:17 -04:00
Nick Sweeting 22fb9c2ad7 tweak icons 2020-10-31 19:32:43 -04:00
Nick Sweeting cafe35c595 show pending in light font 2020-10-31 16:33:31 -04:00
Nick Sweeting 5cae05ae76 tweak tags css and add tags to navbar 2020-10-31 07:57:11 -04:00
Nick Sweeting c47398851b nicer timeout hints 2020-10-31 07:57:11 -04:00
Nick Sweeting 651d6c4447 bold snapshots over 50MB 2020-10-31 07:57:11 -04:00
Nick Sweeting b8bbb75f9c logarithmic progress bars woohoo 2020-10-31 07:57:11 -04:00
Nick Sweeting ac9e0e356d config fixes 2020-10-31 07:57:11 -04:00
Nick Sweeting 79051ca15b new package build 2020-10-31 03:08:41 -04:00
Nick Sweeting 18355dc2c6 clean up config loading in settings and config file layout 2020-10-31 03:08:03 -04:00
Cristian e7e33ea7a5 tests: Add tests for several different ways to extract the title 2020-10-30 08:04:26 -05:00
Nick Sweeting aede134ab3 temporarily disable icon highlighting in favor of performance 2020-10-30 05:12:33 -04:00
Nick Sweeting f727ece7b3 add regex fallback back to title parser 2020-10-30 04:57:31 -04:00
Nick Sweeting 79bef1384e
Merge pull request #493 from ttimasdf/feat-ogtitle
Feature: add og:title metadata as alternative title
2020-10-30 04:51:14 -04:00
Nick Sweeting cac3912439 small type fixes 2020-10-30 04:50:14 -04:00
Nick Sweeting 1e5fbf4bd2
Update archivebox/config/__init__.py 2020-10-29 13:46:03 -04:00
Cristian 81dd626b85 fix: CHROME_USER_DATA_DIR was causing an error after the update to posix paths 2020-10-29 11:09:18 -05:00
Nick Sweeting 5faadee7d1 workaround for mercury version output 2020-10-24 22:59:09 -04:00
Nick Sweeting e727af6f22 allow Path args to get_dir_size and copy_and_overwrite 2020-10-24 22:47:18 -04:00
Cristian f330e6428b lint: Remove unused imports from utils 2020-10-23 06:45:56 -05:00
Cristian f397634dd2 feat: Rename old indexes at the end of init process 2020-10-23 06:45:56 -05:00
Cristian 7fc9b7d456 refactor: Update mentions of the html index in the logs 2020-10-23 06:45:56 -05:00
Cristian 572b46cecf lint: Remove unused imports 2020-10-23 06:45:56 -05:00
Cristian ae1484b8bf feat: Remove index.json and index.html generation from the regular process 2020-10-23 06:45:56 -05:00
Nick Sweeting 494af5f2e1
Merge pull request #507 from ehainry/master
Add parser for Wallabag Atom feeds
2020-10-22 14:04:57 -04:00
Cristian 14f56a868a refactor: Change typing for new stubs 2020-10-22 08:46:16 -05:00
Cristian c12fe0e3d7 feat: Use CURL_ARGS on title extractor 2020-10-22 08:46:16 -05:00
Cristian 563d0f94ec feat: Use CURL_ARGS in favicon extractor 2020-10-22 08:46:16 -05:00
Cristian 2e1cdca789 feat: Use CURL_ARGS on header extractor 2020-10-22 08:46:16 -05:00
Cristian 972d57bd08 feat: Add CURL_ARGS to control curl arguments 2020-10-22 08:46:16 -05:00
Cristian 24e7a74855 feat: Add WGET_ARGS to control wget arguments 2020-10-22 08:46:16 -05:00
Cristian 65530e1e5b refactor: Use json.loads instead of split for list arguments 2020-10-22 08:46:16 -05:00
Cristian bc02e0ffe3 feat: Add config for youtubedl (YOUTUBEDL_ARGS) 2020-10-22 08:46:16 -05:00
Cristian Vargas a850b4a9d9
Merge branch 'master' into tags 2020-10-20 08:23:25 -05:00
Emmanuel Hainry aebc83659d Add parser for Wallabag Atom feeds 2020-10-18 11:20:07 +02:00