1
0
Fork 0
mirror of synced 2024-09-12 15:38:48 +12:00
ArchiveBox/archivebox/extractors
2024-01-03 20:31:46 -08:00
..
__init__.py config.py lint fixes 2023-11-14 02:07:35 -08:00
archive_org.py
dom.py After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2023-08-28 17:27:03 +02:00
favicon.py Add FAVICON_PROVIDER option for custom favicon service 2023-05-05 20:42:36 -05:00
git.py
headers.py
htmltotext.py Add htmltotext extractor 2023-10-23 21:42:32 -04:00
media.py
mercury.py
pdf.py After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2023-08-28 17:27:03 +02:00
readability.py tag URLs immediately once added instead of waiting until archival completes 2024-01-03 20:31:46 -08:00
screenshot.py After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file. 2023-08-28 17:27:03 +02:00
singlefile.py add CHROME_TIMEOUT args 2023-03-14 20:29:41 +09:00
title.py prefer dom dump to singlefile for generating readability output 2024-01-03 20:11:06 -08:00
wget.py