Overview
Integration details
| Class | Package | Local | Serializable | JS support |
|---|---|---|---|---|
| BSHTMLLoader | langchain-community | ✅ | ❌ | ❌ |
Loader features
| Source | Document Lazy Loading | Native Async Support |
|---|---|---|
| BSHTMLLoader | ✅ | ❌ |
Setup
To access BSHTMLLoader document loader you’ll need to install thelangchain-community integration package and the bs4 python package.
Credentials
No credentials are needed to use theBSHTMLLoader class.
To enable automated tracing of your model calls, set your LangSmith API key:
Installation
Install langchain-community and bs4.Initialization
Now we can instantiate our model object and load documents:- TODO: Update model instantiation with relevant params.
Load
Lazy Load
Adding separator to BS4
We can also pass a separator to use when calling get_text on the soupAPI reference
For detailed documentation of all BSHTMLLoader features and configurations head to the API reference: python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.html_bs.BSHTMLLoader.htmlConnect these docs programmatically to Claude, VSCode, and more via MCP for real-time answers.