NoParser
Hierarchy
- AbstractHttpParser
- NoParser
Index
Methods
find_links
Find all links in result using selector.
Parameters
parsed_content: TParseResult
Parsed HTTP response. Result of
parse
method.selector: str
String used to define matching pattern for finding links.
Returns Iterable[str]
is_blocked
Detect if blocked and return BlockedInfo with additional information.
Default implementation that expects
is_matching_selector
abstract method to be implemented. Override this method if your parser has different way of blockage detection.Parameters
parsed_content: TParseResult
Parsed HTTP response. Result of
parse
method.
Returns BlockedInfo
is_matching_selector
Find if selector has match in parsed content.
Parameters
parsed_content: TParseResult
Parsed HTTP response. Result of
parse
method.selector: str
String used to define matching pattern.
Returns bool
parse
Parse HTTP response.
Parameters
response: HttpResponse
HTTP response to be parsed.
Returns TParseResult
parse_text
Parse text containing html.
Parameters
text: str
String containing html.
Returns TParseResult
select
Use css selector to select page element and return it.
Parameters
parsed_content: TParseResult
Content where the page element will be located.
selector: str
Css selector used to locate desired html element.
Returns Sequence[TSelectResult]
A no-op parser that returns raw response content without any processing.
This is useful when you only need the raw response data and don't require HTML parsing, link extraction, or content selection functionality.