Find and download arXiv papers

Templates

Downloads

Find and download arXiv papers

Search arXiv, choose a result, and download the selected paper PDF with metadata for the run.

Run this template

Clone just this template, configure Notte, and start the run.

Before running

Have NOTTE_API_KEY ready. Generate an API key.

Need help? Join the Notte Slack.

File workflow

Default query: openai.
Parameterized by search query, result limit, selected result index, and local download directory.
Uses arXiv Advanced Search because openai returns a smaller result set there than the broad homepage search.
Avoids scrape() for speed; it uses evaluate_js to read only the selected search result and article metadata from the DOM.

Download controls

NOTTE_API_KEY: Notte API key used by the Python SDK.
SEARCH_QUERY: default query when no command-line query is provided.
RESULT_LIMIT: default number of results to extract. Must be between 1 and 50.
RESULT_INDEX: default 1-based result index to open and download.

Other downloads

exploration-notes.md records the Notte CLI session details and selectors.
exported_workflow.py keeps the exported workflow code from notte sessions workflow-code --session-id ... as a reference.