RequestManagerTandem

Implements a tandem behaviour for a pair of RequestLoader and RequestManager.

In this scenario, the contents of the "loader" get transferred into the "manager", allowing processing the requests from both sources and also enqueueing new requests (not possible with plain RequestManager).

Hierarchy

RequestManager
- RequestManagerTandem

Index

Methods

init

__init__(request_loader, request_manager): None

Parameters
- request_loader: RequestLoader
- request_manager: RequestManager
Returns None

add_request

async add_request(request, *, forefront): ProcessedRequest

Overrides RequestManager.add_request
Add a single request to the manager and store it in underlying resource client.
Parameters
- request: str | Request
  The request object (or its string representation) to be added to the manager.
- optionalkeyword-onlyforefront: bool = False
  Determines whether the request should be added to the beginning (if True) or the end (if False) of the manager.
Returns ProcessedRequest

add_requests

async add_requests(requests, *, forefront, batch_size, wait_time_between_batches, wait_for_all_requests_to_be_added, wait_for_all_requests_to_be_added_timeout): None

Overrides RequestManager.add_requests
Add requests to the manager in batches.
Parameters
- requests: Sequence[str | Request]
  Requests to enqueue.
- optionalkeyword-onlyforefront: bool = False
  If True, add requests to the beginning of the queue.
- optionalkeyword-onlybatch_size: int = 1000
  The number of requests to add in one batch.
- optionalkeyword-onlywait_time_between_batches: timedelta = timedelta(seconds=1)
  Time to wait between adding batches.
- optionalkeyword-onlywait_for_all_requests_to_be_added: bool = False
  If True, wait for all requests to be added before returning.
- optionalkeyword-onlywait_for_all_requests_to_be_added_timeout: timedelta | None = None
  Timeout for waiting for all requests to be added.
Returns None

drop

async drop(): None

Overrides Storage.drop
Remove persistent state either from the Apify Cloud storage or from the local database.
Returns None

fetch_next_request

async fetch_next_request(): Request | None

Overrides RequestManager.fetch_next_request
Return the next request to be processed, or null if there are no more pending requests.
Returns Request | None

get_handled_count

async get_handled_count(): int

Overrides RequestManager.get_handled_count
Get the number of requests in the loader that have been handled.
Returns int

get_total_count

async get_total_count(): int

Overrides RequestManager.get_total_count
Get an offline approximation of the total number of requests in the loader (i.e. pending + handled).
Returns int

is_empty

async is_empty(): bool

Overrides RequestManager.is_empty
Return True if there are no more requests in the loader (there might still be unfinished requests).
Returns bool

is_finished

async is_finished(): bool

Overrides RequestManager.is_finished
Return True if all requests have been handled.
Returns bool

mark_request_as_handled

async mark_request_as_handled(request): ProcessedRequest | None

Overrides RequestManager.mark_request_as_handled
Mark a request as handled after a successful processing (or after giving up retrying).
Parameters
- request: Request
Returns ProcessedRequest | None

reclaim_request

async reclaim_request(request, *, forefront): ProcessedRequest | None

Overrides RequestManager.reclaim_request
Reclaims a failed request back to the source, so that it can be returned for processing later again.

It is possible to modify the request data by supplying an updated request as a parameter.
Parameters
- request: Request
- optionalkeyword-onlyforefront: bool = False
Returns ProcessedRequest | None

to_tandem

async to_tandem(request_manager): RequestManagerTandem

Inherited from RequestLoader.to_tandem
Combine the loader with a request manager to support adding and reclaiming requests.
Parameters
- optionalrequest_manager: RequestManager | None = None
  Request manager to combine the loader with. If None is given, the default request queue is used.
Returns RequestManagerTandem

Hierarchy

Index

Methods

Methods

__init__

Parameters

request_loader: RequestLoader

request_manager: RequestManager

Returns None

add_request

Parameters

request: str | Request

optionalkeyword-onlyforefront: bool = False

Returns ProcessedRequest

add_requests

Parameters

requests: Sequence[str | Request]

optionalkeyword-onlyforefront: bool = False

optionalkeyword-onlybatch_size: int = 1000

optionalkeyword-onlywait_time_between_batches: timedelta = timedelta(seconds=1)

optionalkeyword-onlywait_for_all_requests_to_be_added: bool = False

optionalkeyword-onlywait_for_all_requests_to_be_added_timeout: timedelta | None = None

Returns None

drop

Returns None

fetch_next_request

Returns Request | None

get_handled_count

Returns int

get_total_count

Returns int

is_empty

Returns bool

is_finished

Returns bool

mark_request_as_handled

Parameters

request: Request

Returns ProcessedRequest | None

reclaim_request

Parameters

request: Request

optionalkeyword-onlyforefront: bool = False

Returns ProcessedRequest | None

to_tandem

Parameters

optionalrequest_manager: RequestManager | None = None

Returns RequestManagerTandem

init