Nick Sweeting
|
3eaf580fc0
|
Merge branch 'dev' into link-removal2
|
2021-02-01 02:46:57 -05:00 |
|
Nick Sweeting
|
7d0f5653c3
|
fix lgtm alerts
|
2021-02-01 02:27:24 -05:00 |
|
Nick Sweeting
|
04c951cdd5
|
fix alerts
|
2021-02-01 02:22:02 -05:00 |
|
Nick Sweeting
|
846c966c4d
|
use globbing to find wget output path
|
2021-01-30 22:02:39 -05:00 |
|
Nick Sweeting
|
e6fa16e13a
|
only chmod wget output if it exists
|
2021-01-30 22:02:11 -05:00 |
|
Nick Sweeting
|
385daf9af8
|
save the url as title for staticfiles or non html files
|
2021-01-30 22:01:49 -05:00 |
|
Nick Sweeting
|
b9b1c3d9e8
|
fix singlefile output path not relative
|
2021-01-30 20:44:49 -05:00 |
|
Nick Sweeting
|
d6de04a83a
|
fix lgtm errors
|
2021-01-30 06:07:35 -05:00 |
|
Nick Sweeting
|
1fe95474c2
|
Merge branch 'dev' into link-removal2
|
2021-01-30 03:43:46 -05:00 |
|
Nick Sweeting
|
c2aaa41c76
|
fix missing str path
|
2021-01-30 01:25:08 -05:00 |
|
Nick Sweeting
|
15e58bd366
|
fix using os.path calls on pathlib paths
|
2021-01-27 11:27:40 -05:00 |
|
Nick Sweeting
|
9764a8ed9b
|
check for non html files from wget
|
2021-01-25 18:15:16 -05:00 |
|
Dan Arnfield
|
5420903102
|
Refactor should_save_extractor methods to accept overwrite parameter
|
2021-01-21 15:56:32 -06:00 |
|
Nick Sweeting
|
ef7711ffa0
|
fix cookies file arg is path
|
2021-01-20 19:13:53 -05:00 |
|
Cristian
|
91f543ff8d
|
lint: Remove unused imports around the app
|
2021-01-19 12:18:48 -05:00 |
|
Cristian
|
ab311d86e1
|
fix: Wget issue when calculating output path
|
2021-01-19 10:59:50 -05:00 |
|
Cristian
|
30c886d4d4
|
fix: Overwrite(add command) was using snapshots without timestamps (and in memory), causing issues with some extractors
|
2021-01-16 14:11:45 -05:00 |
|
Cristian
|
a36f36866f
|
fix: post-rebase issues
|
2021-01-11 09:32:10 -05:00 |
|
Cristian
|
075d310a9f
|
fix: History is now correctly being generated from ArchiveResult. Removed old divergent logic.
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
973f8b6abc
|
refactor: status command is functional
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
d92083b928
|
refactor: update command is functional
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
de3c82730c
|
refactor: archive_org uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
5ea1b9e39f
|
refactor: dom uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
eac29e3a7a
|
refactor: favicon uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
d1326c3660
|
refactor: git uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
b9489c971c
|
refactor: headers uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
29ec48a35f
|
refactor: media uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
48aa9d5c08
|
refactor: mercury uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
756b7fc76d
|
refactor: pdf uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
5cf9ca0e2c
|
refactor: readability uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
6230984cb3
|
refactor: screenshot uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
ca7f48042b
|
refactor: singlefile uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
f6152ded44
|
refactor: wget uses snapshot instead of link
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
8c4ae73d65
|
refactor: Initial and dirty refactor to replace link with snapshot. Barely functional add command
|
2021-01-11 09:08:33 -05:00 |
|
Cristian
|
6031ffa3b2
|
fix: Mercury extractor error was incorrectly initialized
|
2021-01-07 09:22:46 -05:00 |
|
Cristian
|
e9e4adfc34
|
fix: wget_output_path failing on some extractors. Add a new condition
|
2021-01-07 09:07:29 -05:00 |
|
Cristian
|
81d766aba1
|
refactor: Remove setup_django from title.py
|
2020-12-11 16:03:50 -05:00 |
|
Cristian
|
275ad22db7
|
refactor: Remove skip_index from archive related functions
|
2020-12-08 18:42:25 -05:00 |
|
Cristian
|
f6c73f9aeb
|
fix: Issue with oneshot command
|
2020-12-08 18:42:25 -05:00 |
|
JDC
|
7903db6dfb
|
Add ArchiveResult Manager and sorted indexable filter
|
2020-12-06 01:13:39 +02:00 |
|
JDC
|
b1f70b2197
|
Initial implementation
|
2020-12-06 01:12:45 +02:00 |
|
Cristian
|
33182fd53c
|
fix: Add missing assignation
|
2020-11-04 15:07:45 -05:00 |
|
Cristian
|
d064a3eeff
|
fix: Handle case when update tries to re-add a link that is not in the sql index
|
2020-11-04 15:02:54 -05:00 |
|
Cristian
|
f292cface2
|
fix: Add condition for oneshot when archiving links
|
2020-11-04 14:40:44 -05:00 |
|
Cristian
|
4484491fb7
|
feat: Create ArchiveResult after finishing an extractor process
|
2020-11-04 11:22:55 -05:00 |
|
Cristian
|
ac0ec160d1
|
lint: Fix warnings in master branch
|
2020-11-02 08:51:48 -05:00 |
|
Nick Sweeting
|
ac9e0e356d
|
config fixes
|
2020-10-31 07:57:11 -04:00 |
|
Nick Sweeting
|
18355dc2c6
|
clean up config loading in settings and config file layout
|
2020-10-31 03:08:03 -04:00 |
|
Cristian
|
e7e33ea7a5
|
tests: Add tests for several different ways to extract the title
|
2020-10-30 08:04:26 -05:00 |
|
Nick Sweeting
|
f727ece7b3
|
add regex fallback back to title parser
|
2020-10-30 04:57:31 -04:00 |
|