Configuration
Index
Methods
Properties
- available_memory_ratio
- chrome_executable_path
- default_browser_path
- default_dataset_id
- default_key_value_store_id
- default_request_queue_id
- disable_browser_sandbox
- headless
- internal_timeout
- log_level
- max_used_cpu_ratio
- memory_mbytes
- model_config
- persist_state_interval
- persist_storage
- purge_on_start
- storage_dir
- system_info_interval
- verbose_log
- write_metadata
- xvfb
Methods
get_global_configuration
Retrieve the global instance of the configuration.
Returns Self
Properties
available_memory_ratio
The ratio of system memory to use when memory_mbytes is not specified. The Snapshotter.available_memory_ratio
is set to this value.
chrome_executable_path
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
default_browser_path
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
default_dataset_id
The default dataset ID.
default_key_value_store_id
The default key-value store ID.
default_request_queue_id
The default request queue ID.
disable_browser_sandbox
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
headless
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
internal_timeout
Timeout for the internal asynchronous operations.
log_level
The logging level.
max_used_cpu_ratio
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
memory_mbytes
The maximum memory in megabytes. The Snapshotter.max_memory_size
is set to this value.
model_config
persist_state_interval
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
persist_storage
Whether to persist the storage.
purge_on_start
Whether to purge the storage on the start.
storage_dir
The path to the storage directory.
system_info_interval
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
verbose_log
Whether to enable verbose logging.
write_metadata
Whether to write the storage metadata.
xvfb
This setting is currently unused. For more details, see https://github.com/apify/crawlee-python/issues/670.
Configuration settings for the Crawlee project.
This class stores common configurable parameters for Crawlee. Default values are provided for all settings, so typically, no adjustments are necessary. However, you may modify settings for specific use cases, such as changing the default storage directory, the default storage IDs, the timeout for internal operations, and more.
Settings can also be configured via environment variables, prefixed with
CRAWLEE_
.