Skip to content

Commit 4bea8b9

Browse files
add mcp-for-research blog post (#3021)
* add mcp-for-research blog post * Update mcp-for-research.md Co-authored-by: Aritra Roy Gosthipaty <[email protected]> * move embedded space down * incorporated suggestions --------- Co-authored-by: Aritra Roy Gosthipaty <[email protected]>
1 parent eeeb9a6 commit 4bea8b9

File tree

4 files changed

+131
-0
lines changed

4 files changed

+131
-0
lines changed

_blog.yml

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6542,3 +6542,13 @@
65426542
- llm
65436543
- evaluation
65446544
- agents
6545+
6546+
- local: mcp-for-research
6547+
title: "MCP for Research: How to Connect AI to Research Tools"
6548+
author: dylanebert
6549+
thumbnail: /blog/assets/mcp-for-research/thumbnail.png
6550+
date: Aug 18, 2025
6551+
tags:
6552+
- mcp
6553+
- research
6554+
- guide

assets/mcp-for-research/demo.gif

497 KB
Loading

assets/mcp-for-research/thumbnail.png

89.7 KB
Loading

mcp-for-research.md

Lines changed: 121 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,121 @@
1+
---
2+
title: "MCP for Research: How to Connect AI to Research Tools"
3+
thumbnail: /blog/assets/mcp-for-research/thumbnail.png
4+
authors:
5+
- user: dylanebert
6+
---
7+
8+
# MCP for Research: How to Connect AI to Research Tools
9+
10+
Academic research involves frequent **research discovery**: finding papers, code, related models and datasets. This typically means switching between platforms like [arXiv](https://arxiv.org/), [GitHub](https://github.com/), and [Hugging Face](https://huggingface.co/), manually piecing together connections.
11+
12+
The [Model Context Protocol (MCP)](https://huggingface.co/learn/mcp-course/unit0/introduction) is a standard that allows agentic models to communicate with external tools and data sources. For research discovery, this means AI can use research tools through natural language requests, automating platform switching and cross-referencing.
13+
14+
![Research Tracker MCP in action](./assets/mcp-for-research/demo.gif)
15+
16+
## Research Discovery: Three Layers of Abstraction
17+
18+
Much like software development, research discovery can be framed in terms of layers of abstraction.
19+
20+
### 1. Manual Research
21+
22+
At the lowest level of abstraction, researchers search manually and cross-reference by hand.
23+
24+
```bash
25+
# Typical workflow:
26+
1. Find paper on arXiv
27+
2. Search GitHub for implementations
28+
3. Check Hugging Face for models/datasets
29+
4. Cross-reference authors and citations
30+
5. Organize findings manually
31+
```
32+
33+
This manual approach becomes inefficient when tracking multiple research threads or conducting systematic literature reviews. The repetitive nature of searching across platforms, extracting metadata, and cross-referencing information naturally leads to automation through scripting.
34+
35+
### 2. Scripted Tools
36+
37+
Python scripts automate research discovery by handling web requests, parsing responses, and organizing results.
38+
39+
```python
40+
# research_tracker.py
41+
def gather_research_info(paper_url):
42+
paper_data = scrape_arxiv(paper_url)
43+
github_repos = search_github(paper_data['title'])
44+
hf_models = search_huggingface(paper_data['authors'])
45+
return consolidate_results(paper_data, github_repos, hf_models)
46+
47+
# Run for each paper you want to investigate
48+
results = gather_research_info("https://arxiv.org/abs/2103.00020")
49+
```
50+
51+
The [research tracker](https://huggingface.co/spaces/dylanebert/research-tracker) demonstrates systematic research discovery built from these types of scripts.
52+
53+
While scripts are faster than manual research, they often fail to automatically collect data due to changing APIs, rate limits, or parsing errors. Without human oversight, scripts may miss relevant results or return incomplete information.
54+
55+
### 3. MCP Integration
56+
57+
MCP makes these same Python tools accessible to AI systems through natural language.
58+
59+
```markdown
60+
# Example research directive
61+
Find recent transformer architecture papers published in the last 6 months:
62+
- Must have available implementation code
63+
- Focus on papers with pretrained models
64+
- Include performance benchmarks when available
65+
```
66+
67+
The AI orchestrates multiple tools, fills information gaps, and reasons about results:
68+
69+
```python
70+
# AI workflow:
71+
# 1. Use research tracker tools
72+
# 2. Search for missing information
73+
# 3. Cross-reference with other MCP servers
74+
# 4. Evaluate relevance to research goals
75+
76+
user: "Find all relevant information (code, models, etc.) on this paper: https://huggingface.co/papers/2010.11929"
77+
ai: # Combines multiple tools to gather complete information
78+
```
79+
80+
This can be viewed as an additional layer of abstraction above scripting, where the "programming language" is natural language. This follows the [Software 3.0 Analogy](https://youtu.be/LCEmiRjPEtQ?si=J7elM86eW9XCkMFj), where the natural language research direction is the software implementation.
81+
82+
This comes with the same caveats as scripting:
83+
84+
- Faster than manual research, but error-prone without human guidance
85+
- Quality depends on the implementation
86+
- Understanding the lower layers (both manual and scripted) leads to better implementations
87+
88+
## Setup and Usage
89+
90+
### Quick Setup
91+
92+
The easiest way to add the Research Tracker MCP is through [Hugging Face MCP Settings](https://huggingface.co/settings/mcp):
93+
94+
1. Visit [huggingface.co/settings/mcp](https://huggingface.co/settings/mcp)
95+
2. Search for "research-tracker-mcp" in the available tools
96+
3. Click to add it to your tools
97+
4. Follow the provided setup instructions for your specific client (Claude Desktop, Cursor, Claude Code, VS Code, etc.)
98+
99+
This workflow leverages the Hugging Face MCP server, which is the standard way to use Hugging Face Spaces as MCP tools. The settings page provides client-specific configuration that's automatically generated and always up-to-date.
100+
101+
<script
102+
type="module"
103+
src="https://gradio.s3-us-west-2.amazonaws.com/4.36.1/gradio.js"
104+
></script>
105+
106+
<gradio-app theme_mode="light" space="dylanebert/research-tracker-mcp"></gradio-app>
107+
108+
## Learn More
109+
110+
**Get Started:**
111+
- [Hugging Face MCP Course](https://huggingface.co/learn/mcp-course/en/unit1/introduction) - Complete guide from basics to building your own tools
112+
- [MCP Official Documentation](https://modelcontextprotocol.io) - Protocol specifications and architecture
113+
114+
**Build Your Own:**
115+
- [Gradio MCP Guide](https://www.gradio.app/guides/building-mcp-server-with-gradio) - Turn Python functions into MCP tools
116+
- [Building the Hugging Face MCP Server](https://huggingface.co/blog/building-hf-mcp) - Production implementation case study
117+
118+
**Community:**
119+
- [Hugging Face Discord](https://hf.co/join/discord) - MCP development discussions
120+
121+
Ready to automate your research discovery? Try the [Research Tracker MCP](https://huggingface.co/settings/mcp) or build your own research tools with the resources above.

0 commit comments

Comments
 (0)