Commit Graph

49 Commits (ba7b6b0f8b6f7b483f39b5d23fe7088fe3b4499a)

Author SHA1 Message Date
dgtlmoon 9e08f326be
Chrome/Webdriver support for Javascript websites (#114)
3 years ago
dgtlmoon e2304b2ce0
Re #154 Ldjson extract parse (#158)
3 years ago
Richard Schwab b008269a70
Partially revert 47e5a7cf09 (#138)
3 years ago
dgtlmoon 83daa6f630 Re #132 - Make a list of the JSONpath results instead of using only the first value
4 years ago
dgtlmoon 655a350f50 Re #117 - dont re-encode single value types, looks better in the diff
4 years ago
dgtlmoon e073521f4d
Re #117 Jsonpath based JSON change detection filter (#125)
4 years ago
dgtlmoon 25185e6d00
Auto extract html title as title (#102)
4 years ago
dgtlmoon f215adbbe5 CSS Filter - Smarter is to just extract the HTML blob and continue with inscriptus, so we have almost the same output as not using the filter
4 years ago
dgtlmoon 8d59ef2e10 CSS Filter - restore nicer linefeeds
4 years ago
dgtlmoon e3a9847f74 @todo Comment - BS4's element.get_text() seems to lose the indentation format no-matter what
4 years ago
dgtlmoon 47f7698b32 CSS Filter - strip text of whitespacing, preserve new lines where applicable, remove extra newlines
4 years ago
dgtlmoon 854520005d
#81 - Regex support (#90)
4 years ago
Leonardo Brondani Schenkel cec45a7ad7
Strip surrounding whitespace from elements (#89)
4 years ago
dgtlmoon 2346b42ef2
CSS selector filter (#73)
4 years ago
Leigh Morresi e0578acca2 Tidy up thread logic and version check
4 years ago
Leigh Morresi 47fcb8b4f8 Move logic
4 years ago
Leigh Morresi f1da8f96b6 When new ignore text is specified, reprocess the checksum
4 years ago
Leigh Morresi 468184bc3a Issue #14 - Tweaks to edit, create ignore text, tests for ignore text, integrate ignore text
4 years ago
Leigh Morresi 96221598e7 Tidy up return logic
4 years ago
Leigh Morresi e200cd3289 Fixing a few more easy lint wins
4 years ago
Leigh Morresi 63eea2d6db Linting fixups
4 years ago
Leigh Morresi b0c5dbd88e Just use the current/previous md5
4 years ago
Leigh Morresi 1718e2e86f Finalse pytest methods
4 years ago
Leigh Morresi 87f4347fe5 hack of pytest implementation - doesnt work yet
4 years ago
Leigh Morresi 93ee65fe53 Tidy up a few broken datastore paths
4 years ago
Leigh Morresi 9f964b6d3f WIP, separate out the Flask from everything else, get pytest working
4 years ago
Leigh Morresi 47e5a7cf09 Avoid accidently using Python's objects that are copied - but land as a 'soft reference', need to use a better dict struct in the future #6
4 years ago
Leigh Morresi d07cf53a07 Minor fix to 'last changed' field, simplify template and logic
4 years ago
Leigh Morresi 5e31ae86d0 Use a thread locker and cleaner separation of concerns between main thread and site status fetch
4 years ago
Leigh Morresi 07f41782c0 Adding SEND_FILE_MAX_AGE_DEFAULT to ensure backups etc dont get old
4 years ago
Leigh Morresi f1c2ece32f Use a pool of thread workers, better for huge lists of watchers
4 years ago
Leigh Morresi eecc620386 https://github.com/psf/requests/issues/4525 - brotli compression is not yet supported in requests, be sure that users cant accidently use this content type encoding in the headers
4 years ago
Leigh Morresi 81534d9367 Add [diff] mechanism
4 years ago
Leigh Morresi 43c7ccb3fe Use a single thread for writing the sync json
4 years ago
Leigh Morresi bfcb17ca24 Remove import for old lib
4 years ago
Leigh Morresi 98f6f4619f Switch to inscriptis
4 years ago
Leigh Morresi fbe20d45cc Support for custom headers per watch
4 years ago
Leigh Morresi 324c54fe46 Use requests's r.text so we dont have to deal with charsets
4 years ago
Leigh Morresi b7a0c2dbcd Add edit UI
4 years ago
Leigh Morresi 9c0c8bf6aa Remove actual :// links, dont consider these as part of the changes, often they include variables/trackingscript ref etc
4 years ago
Leigh Morresi b574a28f1f Tweak comments
4 years ago
Leigh Morresi 01359e4811 Store a history of changes, used for future lookup/diff/explore changes UI
4 years ago
Leigh Morresi 93562afb02 Adding README amd docker info
4 years ago
Leigh Morresi f455f14efd Primitive support for extra headers
4 years ago
Leigh Morresi a4f1f6ab69 Handle titles and links
4 years ago
Leigh Morresi 1968d400fe Store the html2text version too
4 years ago
Leigh Morresi 0515aca7dd small fixes
4 years ago
Leigh Morresi 646a54945a Handle errors better, use the plaintext output
4 years ago
Leigh Morresi 2f018ac04c Workon threads
4 years ago