CSS selectors for elements that should trigger a retry, as the crawler is likely getting blocked.
Content of proxy errors that should trigger a retry, as the proxy is likely getting blocked / is malfunctioning.
Default regular expression to match URLs in a string that may be plain text, JSON, CSV or other. It supports common URL characters and does not support URLs containing commas or spaces. The URLs also may contain Unicode letters (not symbols).
Regular expression that, in addition to the default regular expression
URL_NO_COMMAS_REGEX, supports matching commas in URL path and query.
Note, however, that this may prevent parsing URLs from comma delimited lists, or the URLs may become malformed.