Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
1. As mentioned in review comments, `test_page_should_be_valid_html` no longer needs to test whether a page should be validated, since its input is now guaranteed to be filtered. 2. `should_validate()` now performs both filtering checks: A blacklisted page should not be validated, and a non-HTML page should not be validated. 3. With the above changes, `crawl_only_html()` can now be refactored to a one-liner.
- Loading branch information