Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

text-splitters[patch]: delete unused html_chunks_with_headers.xslt #29340

Merged
merged 2 commits into from
Jan 23, 2025

Conversation

AhmedTammaa
Copy link
Contributor

This pull request removes the now-unused html_chunks_with_headers.xslt file from the codebase. In a previous update (PR #27678), the HTMLHeaderTextSplitter class was refactored to utilize BeautifulSoup instead of lxml and XSLT for HTML processing. As a result, the html_chunks_with_headers.xslt file is no longer necessary and can be safely deleted to maintain code cleanliness and reduce potential confusion.

Issue: N/A

Dependencies: N/A

@dosubot dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jan 21, 2025
Copy link

vercel bot commented Jan 21, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment
Name Status Preview Comments Updated (UTC)
langchain ⬜️ Ignored (Inspect) Visit Preview Jan 21, 2025 2:39pm

@dosubot dosubot bot added Ɑ: text splitters Related to text splitters package 🤖:nit Small modifications/deletions, fixes, deps or improvements to existing code or docs labels Jan 21, 2025
@AhmedTammaa AhmedTammaa marked this pull request as draft January 21, 2025 14:30
@AhmedTammaa AhmedTammaa force-pushed the html-splitter-cleaning branch from aafba64 to 4b9dd4c Compare January 21, 2025 14:39
@AhmedTammaa AhmedTammaa marked this pull request as ready for review January 21, 2025 14:44
@AhmedTammaa
Copy link
Contributor Author

AhmedTammaa commented Jan 21, 2025

Hey @eyurtsev,

This is a follow-up for PR #267678.

I just removed the unused file for code cleaning.

I will do the suggested simplification in a different PR to keep things simple.

@dosubot dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Jan 23, 2025
@ccurme ccurme changed the title Removing the unused html_chunks_with_headers.xslt text-splitters[patch]: delete unused html_chunks_with_headers.xslt Jan 23, 2025
@ccurme ccurme merged commit d5b8aab into langchain-ai:master Jan 23, 2025
45 checks passed
@AhmedTammaa AhmedTammaa deleted the html-splitter-cleaning branch January 23, 2025 17:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lgtm PR looks good. Use to confirm that a PR is ready for merging. 🤖:nit Small modifications/deletions, fixes, deps or improvements to existing code or docs size:L This PR changes 100-499 lines, ignoring generated files. Ɑ: text splitters Related to text splitters package
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants