RequestManager

Base class that extends RequestLoader with the capability to enqueue new requests and reclaim failed ones.

Hierarchy

RequestLoader
- RequestManager
  - RequestQueue
  - RequestManagerTandem

Index

Methods

add_request

async add_request(request, *, forefront): ProcessedRequest

Overrides RequestManager.add_request
Add a single request to the manager and store it in underlying resource client.
Parameters
- request: str | Request
  The request object (or its string representation) to be added to the manager.
- optionalkeyword-onlyforefront: bool = False
  Determines whether the request should be added to the beginning (if True) or the end (if False) of the manager.
Returns ProcessedRequest

add_requests

async add_requests(requests, *, forefront, batch_size, wait_time_between_batches, wait_for_all_requests_to_be_added, wait_for_all_requests_to_be_added_timeout): None

Overrides RequestManager.add_requests
Add requests to the manager in batches.
Parameters
- requests: Sequence[str | Request]
  Requests to enqueue.
- optionalkeyword-onlyforefront: bool = False
  If True, add requests to the beginning of the queue.
- optionalkeyword-onlybatch_size: int = 1000
  The number of requests to add in one batch.
- optionalkeyword-onlywait_time_between_batches: timedelta = timedelta(seconds=1)
  Time to wait between adding batches.
- optionalkeyword-onlywait_for_all_requests_to_be_added: bool = False
  If True, wait for all requests to be added before returning.
- optionalkeyword-onlywait_for_all_requests_to_be_added_timeout: timedelta | None = None
  Timeout for waiting for all requests to be added.
Returns None

drop

async drop(): None

Overrides Storage.drop
Remove persistent state either from the Apify Cloud storage or from the local database.
Returns None

fetch_next_request

async fetch_next_request(): Request | None

Overrides RequestManager.fetch_next_request
Return the next request to be processed, or None if there are no more pending requests.

The method should return None if and only if is_finished would return True. In other cases, the method should wait until a request appears.
Returns Request | None

get_handled_count

async get_handled_count(): int

Overrides RequestManager.get_handled_count
Get the number of requests in the loader that have been handled.
Returns int

get_total_count

async get_total_count(): int

Overrides RequestManager.get_total_count
Get an offline approximation of the total number of requests in the loader (i.e. pending + handled).
Returns int

is_empty

async is_empty(): bool

Overrides RequestManager.is_empty
Return True if there are no more requests in the loader (there might still be unfinished requests).
Returns bool

is_finished

async is_finished(): bool

Overrides RequestManager.is_finished
Return True if all requests have been handled.
Returns bool

mark_request_as_handled

async mark_request_as_handled(request): ProcessedRequest | None

Overrides RequestManager.mark_request_as_handled
Mark a request as handled after a successful processing (or after giving up retrying).
Parameters
- request: Request
Returns ProcessedRequest | None

reclaim_request

async reclaim_request(request, *, forefront): ProcessedRequest | None

Overrides RequestManager.reclaim_request
Reclaims a failed request back to the source, so that it can be returned for processing later again.

It is possible to modify the request data by supplying an updated request as a parameter.
Parameters
- request: Request
- optionalkeyword-onlyforefront: bool = False
Returns ProcessedRequest | None

to_tandem

async to_tandem(request_manager): RequestManagerTandem

Inherited from RequestLoader.to_tandem
Combine the loader with a request manager to support adding and reclaiming requests.
Parameters
- optionalrequest_manager: RequestManager | None = None
  Request manager to combine the loader with. If None is given, the default request queue is used.
Returns RequestManagerTandem

Hierarchy

Index

Methods

Methods

add_request

Parameters

request: str | Request

optionalkeyword-onlyforefront: bool = False

Returns ProcessedRequest

add_requests

Parameters

requests: Sequence[str | Request]

optionalkeyword-onlyforefront: bool = False

optionalkeyword-onlybatch_size: int = 1000

optionalkeyword-onlywait_time_between_batches: timedelta = timedelta(seconds=1)

optionalkeyword-onlywait_for_all_requests_to_be_added: bool = False

optionalkeyword-onlywait_for_all_requests_to_be_added_timeout: timedelta | None = None

Returns None

drop

Returns None

fetch_next_request

Returns Request | None

get_handled_count

Returns int

get_total_count

Returns int

is_empty

Returns bool

is_finished

Returns bool

mark_request_as_handled

Parameters

request: Request

Returns ProcessedRequest | None

reclaim_request

Parameters

request: Request

optionalkeyword-onlyforefront: bool = False

Returns ProcessedRequest | None

to_tandem

Parameters

optionalrequest_manager: RequestManager | None = None

Returns RequestManagerTandem