Skip to main content
Templates
Downloads

Extract and download images

Give the template a page URL and download the images discovered during the Notte browser session.

Run this template

Clone just this template, configure Notte, and start the run.

Before running

File workflow

  • Browser-context downloads: page.context.request.get() inherits session cookies, proxy, and headers.
  • Structured URL extraction: tries Notte scrape() first, then falls back to deterministic Playwright DOM extraction.
  • Format-aware naming: uses the Content-Type response header to choose file extensions.
  • Organized output: images are saved to ./images/<hostname>/.

Download controls

  • Scrape: Notte structured extraction from the current page.
  • Browser context request: Playwright API request made from the active browser context.
  • MAX_IMAGES: configurable cap on how many images to download per run.
  • OUTPUT_DIR: configurable output directory, defaulting to ./images.

Other downloads

  • Asset archiving from pages you own or have permission to scrape.
  • Visual regression fixture collection.
  • Authenticated media downloads after logging into a session.