autoextract_poet.pages.AutoExtractItemWebPage

class AutoExtractItemWebPage(response: autoextract_poet.page_inputs.AutoExtractHtml)[source]

Bases: autoextract_poet.pages.AutoExtractWebPage, web_poet.pages.ItemPage

AutoExtractWebPage that requires the to_item() method to be implemented.

__init__(response: autoextract_poet.page_inputs.AutoExtractHtml) None

Method generated by attrs for class AutoExtractItemWebPage.

Methods

__init__(response)

Method generated by attrs for class AutoExtractItemWebPage.

css(query)

Run a CSS query on a response, using parsel.Selector.

to_item()

Extract an item from a web page

urljoin(url)

Convert url to absolute, taking in account url and baseurl of the response

xpath(query, **kwargs)

Run an XPath query on a response, using parsel.Selector.

Attributes

base_url

Return the base url of the given response

html

Shortcut to HTML Response's content.

selector

parsel.Selector instance for the HTML Response.

url

Shortcut to HTML Response's URL.