- fix: handle empty strings in levenshtein_similarity to avoid division by zero
- feat(parser): include zero and one pixel elements in hidden checks
- feat(parser): add comment removal functionality to article parser
- feat(htmlutil): add text content improvement function
- fix: remove 'media' from negative regex to prevent image removal in Readability.js