arXiv for academic research
arXiv is the open preprint archive for physics, math, CS, and more, and this popular server is the top pick for academic research. It wins because it does the core literature job end to end: search the archive, download a paper, and read its full text as clean markdown, so an agent works from the actual source rather than abstracts or snippets a general search returns.
The two siblings here, Wikipedia and MediaWiki, ground definitions and background. arXiv is the one that reaches primary research, which is why it leads a research workflow that turns on reading and tracing real papers.
How arXiv fits
The free core covers most of the work: search_papers queries arXiv with category, date, and boolean filters, download_paper fetches a paper by ID (trying HTML first, falling back to PDF), list_papers shows what is in the local collection, and read_paper returns a downloaded paper's full text rendered as markdown an agent can actually reason over. Beyond that, semantic_search finds similar papers in the local collection, citation_graph pulls a paper's references and citing papers via Semantic Scholar, and watch_topic plus check_alerts track newly published work on a saved topic.
The honest limits are two. First, the literature-graph features (semantic_search, citation_graph, watch_topic, check_alerts) require the pro tier, so on the free tier you get search, download, and read but not similarity or citation tracing. Second, the server is community-maintained (by Joseph Blazick), not an official arXiv build, and its scope is arXiv, so it does not cover journals or paywalled venues outside the archive. The siblings fill the gaps around it: Wikipedia anchors definitions and background context, and MediaWiki reads domain-specific wiki knowledge bases. Use arXiv as the engine for preprints and full-text reading, and add the wiki servers when you need grounding rather than primary research.
Tools you would use
| Tool | What it does |
|---|---|
| search_papers | Searches arXiv with optional category, date, and boolean filters. |
| download_paper | Downloads a paper by arXiv ID, fetching HTML first and falling back to PDF. |
| list_papers | Lists the papers currently downloaded to the local collection. |
| read_paper | Returns the full text of a downloaded paper rendered as markdown. |
| semantic_search | Finds similar papers within the local collection (requires the pro tier). |
| citation_graph | Fetches a paper's references and citing papers via Semantic Scholar (requires the pro tier). |
| watch_topic | Registers a topic watch for research alerts (requires the pro tier). |
| check_alerts | Polls for newly published papers matching saved topic watches (requires the pro tier). |
FAQ
- Can the arXiv server read a paper's full text, or only the abstract?
- Yes. download_paper fetches the paper (HTML first, PDF as fallback) and read_paper returns its full text as markdown, so an agent reasons over the complete document rather than an abstract.
- Do citation graphs and semantic search work on the free tier?
- No. semantic_search, citation_graph, watch_topic, and check_alerts require the pro tier. The free tier covers search_papers, download_paper, list_papers, and read_paper.
- Is this an official arXiv server?
- No. It is a popular community server by Joseph Blazick. It covers arXiv well, but it is not an official release and its scope is the arXiv archive, not paywalled journals.