Skip to main content
Templates
Public API

Scrape calendar events

Extract structured event names, dates, times, and links from public calendar pages with a browser-backed probe.

Run this template

Clone just this template, configure Notte, and start the run.

Before running

API and browser path

  • Defaults to https://www.loc.gov/events/, but accepts any loc.gov event listing URL.
  • Uses a Notte browser session for traceability when NOTTE_API_KEY is set, then extracts from the LOC JSON listing endpoint.
  • Parameterized by listing URL and result limit from the CLI or .env.
  • Original listing URL and derived LOC JSON URL.

Probe inputs

  • NOTTE_API_KEY: optional; enables the Notte browser probe when set.
  • LOC_EVENTS_URL: default event listing URL when --url is omitted.
  • LOC_EVENT_LIMIT: default number of events when --limit is omitted.
  • USE_PROXY: whether the Notte browser session uses proxies. Defaults to true.