arXiv MCP server
A popular MCP server that lets agents search arXiv, download papers, and read their full text as markdown for research workflows.
arXiv MCP Server is a widely-used, actively maintained community server that bridges an AI agent and arXiv's open-access repository of scientific papers. It lets a model do real literature work instead of relying on memorized abstracts: search arXiv with category, date, and boolean filters, download a paper by its arXiv ID (fetching HTML first and falling back to PDF), list the papers already pulled into a local collection, and read a downloaded paper's full text rendered as clean markdown. That covers the core loop of finding, retrieving, and reasoning over current research in machine learning, physics, math, and beyond.
A paid pro tier adds heavier research tooling: semantic search across your local collection, a citation graph built from Semantic Scholar, and topic watches with alert polling for newly published papers. The server is published as the arxiv-mcp-server Python package, installed with uv tool install, and runs locally over stdio by default; set TRANSPORT=http (or streamable-http) with HOST and PORT to expose it as an HTTP service. It needs no API key for the core search-and-read tools and also ships research prompts such as deep-paper-analysis to guide thorough reviews. It is Apache-2.0 licensed and one of the most-starred academic MCP servers.
Quick install
Copy-paste configs are provided for all 8 supported clients. Pick your client below.
Available tools
| Tool | Description |
|---|---|
| search_papers | Searches arXiv with optional category, date, and boolean filters. |
| download_paper | Downloads a paper by arXiv ID, fetching HTML first and falling back to PDF. |
| list_papers | Lists the papers currently downloaded to the local collection. |
| read_paper | Returns the full text of a downloaded paper rendered as markdown. |
| semantic_search | Finds similar papers within the local collection (requires the pro tier). |
| citation_graph | Fetches a paper's references and citing papers via Semantic Scholar (requires the pro tier). |
| watch_topic | Registers a topic watch for research alerts (requires the pro tier). |
| check_alerts | Polls for newly published papers matching saved topic watches (requires the pro tier). |
Required configuration
- TRANSPORTOptional
Transport mode: stdio (default), http, or streamable-http.
- PORTOptional
Port to bind when running over HTTP (default 8000).
What you can do with it
Run a literature review
The agent searches arXiv with category and date filters, downloads the most relevant papers, and reads their full text as markdown to synthesize the state of the art on a topic.
Ground a draft in real papers
While writing or reviewing, the agent retrieves the exact paper it needs and reads specific sections, so claims are anchored to a citable source rather than a hallucinated reference.
FAQ
- Is it free?
- The server is open source under the Apache-2.0 license and the core search, download, list, and read tools are free with no API key, since arXiv is open access. Semantic search, citation graphs, and topic watches are gated behind a paid pro tier.
- Does it support remote/OAuth?
- It runs locally over stdio by default and can be exposed as an HTTP service by setting TRANSPORT=http with HOST and PORT. There is no hosted OAuth endpoint; the core tools need no authentication.
- Where are downloaded papers stored?
- Papers are saved to a local collection on disk, which list_papers enumerates and read_paper renders as markdown. This lets the agent work over a persistent local corpus across sessions.