BasicCrawlingContext
Hierarchy
Index
Methods
__hash__
Return hash of the context. Each context is considered unique.
Returns int
create_modified_copy
Create a modified copy of the crawling context with specified changes.
Parameters
optionalpush_data: PushDataFunction | None = None
optionaladd_requests: AddRequestsFunction | None = None
optionalget_key_value_store: GetKeyValueStoreFromRequestHandlerFunction | None = None
Returns Self
get_snapshot
Get snapshot of crawled page.
Returns PageSnapshot
Properties
add_requests
Add requests crawling context helper function.
get_key_value_store
Get key-value store crawling context helper function.
log
Logger instance.
proxy_info
Proxy information for the current page being processed.
push_data
Push data crawling context helper function.
request
Request object for the current page being processed.
send_request
Send request crawling context helper function.
session
Session object for the current page being processed.
use_state
Use state crawling context helper function.
Basic crawling context.
It represents the fundamental crawling context used by the
BasicCrawler. It is extended by more specific crawlers to provide additional functionality.