Skip to main content
Templates
Downloads

Find and download arXiv papers

Search arXiv, choose a result, and download the selected paper PDF with metadata for the run.

Run this template

Clone just this template, configure Notte, and start the run.

Before running

File workflow

  • Default query: openai.
  • Parameterized by search query, result limit, selected result index, and local download directory.
  • Uses arXiv Advanced Search because openai returns a smaller result set there than the broad homepage search.
  • Avoids scrape() for speed; it uses evaluate_js to read only the selected search result and article metadata from the DOM.

Download controls

  • NOTTE_API_KEY: Notte API key used by the Python SDK.
  • SEARCH_QUERY: default query when no command-line query is provided.
  • RESULT_LIMIT: default number of results to extract. Must be between 1 and 50.
  • RESULT_INDEX: default 1-based result index to open and download.

Other downloads

  • exploration-notes.md records the Notte CLI session details and selectors.
  • exported_workflow.py keeps the exported workflow code from notte sessions workflow-code --session-id ... as a reference.