BeautifulSoupParser
Hierarchy
- AbstractHttpParser
- BeautifulSoupParser
Index
Methods
__init__
Parameters
optionalkeyword-onlyparser: BeautifulSoupParserType = 'lxml'
Returns None
find_links
Find all links in result using selector.
Parameters
optionalkeyword-onlyparsed_content: TParseResult
Parsed HTTP response. Result of
parse
method.optionalkeyword-onlyselector: str
String used to define matching pattern for finding links.
Returns Iterable[str]
is_blocked
Detect if blocked and return BlockedInfo with additional information.
Default implementation that expects
is_matching_selector
abstract method to be implemented. Override this method if your parser has different way of blockage detection.Parameters
optionalkeyword-onlyparsed_content: TParseResult
Parsed HTTP response. Result of
parse
method.
Returns BlockedInfo
is_matching_selector
Find if selector has match in parsed content.
Parameters
optionalkeyword-onlyparsed_content: TParseResult
Parsed HTTP response. Result of
parse
method.optionalkeyword-onlyselector: str
String used to define matching pattern.
Returns bool
parse
Parse http response.
Parameters
optionalkeyword-onlyresponse: HttpResponse
HTTP response to be parsed.
Returns TParseResult
Parser for parsing HTTP response using
BeautifulSoup
.