Skip to main content
Version: Next

PlaywrightPreNavCrawlingContext

The pre navigation crawling context used by the PlaywrightCrawler.

It provides access to the Page object, before the navigation to the URL is performed.

Hierarchy

Index

Methods

__hash__

  • __hash__(): int

create_modified_copy

  • create_modified_copy(push_data, add_requests, get_key_value_store): Self
  • Create a modified copy of the crawling context with specified changes.


    Parameters

    • optionalpush_data: PushDataFunction | None = None
    • optionaladd_requests: AddRequestsFunction | None = None
    • optionalget_key_value_store: GetKeyValueStoreFromRequestHandlerFunction | None = None

    Returns Self

get_snapshot

Properties

add_requests

add_requests: AddRequestsFunction

Add requests crawling context helper function.

block_requests

block_requests: BlockRequestsFunction

Blocks network requests matching specified URL patterns.

get_key_value_store

Get key-value store crawling context helper function.

goto_options

goto_options: GotoOptions

Additional options to pass to Playwright's Page.goto() method. The timeout option is not supported.

log

log: logging.Logger

Logger instance.

page

page: Page

The Playwright Page object for the current page.

proxy_info

proxy_info: ProxyInfo | None

Proxy information for the current page being processed.

push_data

push_data: PushDataFunction

Push data crawling context helper function.

register_deferred_cleanup

register_deferred_cleanup: Callable[[DeferredCleanupCallback], None]

Register an async callback to be called after request processing completes (including error handlers).

request

request: Request

Request object for the current page being processed.

send_request

send_request: SendRequestFunction

Send request crawling context helper function.

session

session: Session | None

Session object for the current page being processed.

use_state

use_state: UseStateFunction

Use state crawling context helper function.