Downloads
Extract and download images
Give the template a page URL and download the images discovered during the Notte browser session.
Run this template
Clone just this template, configure Notte, and start the run.
Before running
- Have
NOTTE_API_KEYready. Generate an API key.
Need help? Join the Notte Slack.
File workflow
- Browser-context downloads: page.context.request.get() inherits session cookies, proxy, and headers.
- Structured URL extraction: tries Notte scrape() first, then falls back to deterministic Playwright DOM extraction.
- Format-aware naming: uses the Content-Type response header to choose file extensions.
- Organized output: images are saved to ./images/<hostname>/.
Download controls
- Scrape: Notte structured extraction from the current page.
- Browser context request: Playwright API request made from the active browser context.
- MAX_IMAGES: configurable cap on how many images to download per run.
- OUTPUT_DIR: configurable output directory, defaulting to ./images.
Other downloads
- Asset archiving from pages you own or have permission to scrape.
- Visual regression fixture collection.
- Authenticated media downloads after logging into a session.