PlaywrightPreNavCrawlingContext
Hierarchy
- BasicCrawlingContext
- PlaywrightPreNavCrawlingContext
Index
Methods
__hash__
Return hash of the context. Each context is considered unique.
Returns int
create_modified_copy
Create a modified copy of the crawling context with specified changes.
Parameters
optionalpush_data: PushDataFunction | None = None
optionaladd_requests: AddRequestsFunction | None = None
optionalget_key_value_store: GetKeyValueStoreFromRequestHandlerFunction | None = None
Returns Self
get_snapshot
Get snapshot of crawled page.
Returns PageSnapshot
Properties
add_requests
Add requests crawling context helper function.
block_requests
Blocks network requests matching specified URL patterns.
get_key_value_store
Get key-value store crawling context helper function.
goto_options
Additional options to pass to Playwright's Page.goto() method. The timeout option is not supported.
log
Logger instance.
page
The Playwright Page object for the current page.
proxy_info
Proxy information for the current page being processed.
push_data
Push data crawling context helper function.
request
Request object for the current page being processed.
send_request
Send request crawling context helper function.
session
Session object for the current page being processed.
use_state
Use state crawling context helper function.
The pre navigation crawling context used by the
PlaywrightCrawler.It provides access to the
Pageobject, before the navigation to the URL is performed.