BeautifulSoupCrawler
crawlee.beautifulsoup_crawler._beautifulsoup_crawler.BeautifulSoupCrawler
Index
Constructors
Constructors
__init__
Initialize the BeautifulSoupCrawler.
Parameters
parser: Literal['html.parser', 'lxml', 'xml', 'html5lib'] = 'lxml'keyword-only
additional_http_error_status_codes: Iterable[int] = ()keyword-only
ignore_http_error_status_codes: Iterable[int] = ()keyword-only
kwargs: Unpack[BasicCrawlerOptions[BeautifulSoupCrawlingContext]]
Returns None
A crawler that fetches the request URL using
httpx
and parses the result withBeautifulSoup
.