Skip to main content

NoParser

A no-op parser that returns raw response content without any processing.

This is useful when you only need the raw response data and don't require HTML parsing, link extraction, or content selection functionality.

Hierarchy

AbstractHttpParser
- NoParser

Index

Methods

Methods

find_links

find_links(parsed_content, selector): Iterable[str]

Overrides AbstractHttpParser.find_links
Find all links in result using selector.
Parameters
- parsed_content: TParseResult
  Parsed HTTP response. Result of parse method.
- selector: str
  String used to define matching pattern for finding links.
Returns Iterable[str]

is_blocked

is_blocked(parsed_content): BlockedInfo

Inherited from AbstractHttpParser.is_blocked
Detect if blocked and return BlockedInfo with additional information.

Default implementation that expects is_matching_selector abstract method to be implemented. Override this method if your parser has different way of blockage detection.
Parameters
- parsed_content: TParseResult
  Parsed HTTP response. Result of parse method.
Returns BlockedInfo

is_matching_selector

is_matching_selector(parsed_content, selector): bool

Overrides AbstractHttpParser.is_matching_selector
Find if selector has match in parsed content.
Parameters
- parsed_content: TParseResult
  Parsed HTTP response. Result of parse method.
- selector: str
  String used to define matching pattern.
Returns bool

parse

async parse(response): TParseResult

Overrides AbstractHttpParser.parse
Parse HTTP response.
Parameters
- response: HttpResponse
  HTTP response to be parsed.
Returns TParseResult

parse_text

async parse_text(text): TParseResult

Overrides AbstractHttpParser.parse_text
Parse text containing html.
Parameters
- text: str
  String containing html.
Returns TParseResult

select

async select(parsed_content, selector): Sequence[TSelectResult]

Overrides AbstractHttpParser.select
Use css selector to select page element and return it.
Parameters
- parsed_content: TParseResult
  Content where the page element will be located.
- selector: str
  Css selector used to locate desired html element.
Returns Sequence[TSelectResult]

Page Options

Hide Inherited

find_links
is_blocked
is_matching_selector
parse
parse_text
select