iOS Interact MCP Server

Control iOS simulators through the Model Context Protocol (MCP).

NOTE: This is AI Slop, there are dragons all over the place. I think the majority of the tests are fake or otherwise nonsensical, I have manually tested everything and the only bug I've found is in the complex gestures Claude made.

Features

Click Actions: Click on UI elements by text or coordinates using OCR
App Control: Launch and terminate iOS applications
Screenshots: Capture simulator screenshots with OCR support
Text Finding: Find and interact with text elements using OCR
Deep Linking: Open URLs in the simulator
Hardware Buttons: Simulate hardware button presses
Window Management: List and control simulator windows

Requirements

macOS with Xcode installed
Python 3.10 or higher
iOS Simulator
MCP-compatible client (e.g., Claude Desktop)

Installation

Via pip

pip install ios-interact-mcp

From source

git clone https://github.com/AFDudley/ios-interact-mcp.git
cd ios-interact-mcp
pip install -e .

Configuration

Claude Desktop

Add to your Claude Desktop configuration (~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "ios-interact": {
      "command": "ios-interact-mcp"
    }
  }
}

Standalone Usage

# Run with stdio transport (default)
ios-interact-mcp

# Run with SSE transport for debugging
ios-interact-mcp --transport sse

Using with Claude Code

To use the MCP server with Claude Code, you need to start it with SSE transport and then connect Claude Code to it:

Start the SSE server:

# Start the server on port 8000 (default)
ios-interact-mcp --transport sse

# Or specify a custom port
ios-interact-mcp --transport sse --port 37849

Connect Claude Code to the server:

# Add the MCP server to Claude Code
claude mcp add -t sse ios-interact http://localhost:8000/sse

# Or if using a custom port
claude mcp add -t sse ios-interact http://localhost:37849/sse

Verify the connection:

# List configured MCP servers
claude mcp list

# Get details about the ios-interact server
claude mcp get ios-interact

Remove the server (when done):
```
claude mcp remove ios-interact -s local
```

Available Tools

click_text

Click on text found in the simulator using OCR.

click_text(text: string, occurrence?: number, simulator_name?: string)

click_at_coordinates

Click at specific screen coordinates.

click_at_coordinates(x: number, y: number, coordinate_space?: "screen")

launch_app

Launch an iOS application.

launch_app(bundle_id: string)

terminate_app

Terminate a running iOS application.

terminate_app(bundle_id: string)

screenshot

Take a screenshot of the simulator.

screenshot(filename?: string, return_path?: boolean)

find_text_in_simulator

Find text elements in the simulator using OCR.

find_text_in_simulator(search_text?: string, simulator_name?: string)

list_apps

List all installed applications.

list_apps()

open_url

Open a URL in the simulator (for deep linking).

open_url(url: string)

press_button

Press a hardware button.

press_button(button_name: "home" | "lock" | "volume_up" | "volume_down")

list_simulator_windows

List all simulator windows with their positions and sizes.

list_simulator_windows()

Usage Examples

Basic Automation

# Click on Settings app
await click_text("Settings")

# Navigate to General
await click_text("General")

# Take a screenshot
await screenshot("general_settings.png")

App Testing

# Launch your app
await launch_app("com.yourcompany.yourapp")

# Click on UI elements
await click_text("Login")

# Enter deep link
await open_url("yourapp://profile")

# Capture state
await screenshot("profile_screen.png")

Permissions

For OCR functionality to work properly, you need to grant accessibility permissions:

Go to System Preferences > Security & Privacy > Accessibility
Add Terminal (or your IDE) to the allowed applications
Restart the application if needed

Development

Setup Development Environment

# Clone the repository
git clone https://github.com/AFDudley/ios-interact-mcp.git
cd ios-interact-mcp

# Install in development mode with dev dependencies
pip install -e ".[dev]"

# Install pre-commit hooks
pre-commit install

Running Tests

# Run all tests
python -m pytest tests/

# Run specific test
python -m pytest tests/test_ocr_controller.py

Code Quality

The project uses:

Black for code formatting
Flake8 for linting
Pyright for type checking

These are automatically run on commit via pre-commit hooks.

Troubleshooting

OCR Not Working

Ensure you have granted accessibility permissions to Terminal/your IDE
Check that the simulator window is visible and not minimized
Verify ocrmac is installed: pip install ocrmac

Click Actions Failing

Verify the simulator is in focus
Ensure the target text is visible on screen
Try using find_text_in_simulator first to verify OCR is working

"No booted devices" Error

Make sure iOS Simulator is running:

open -a Simulator

Permission Errors

Grant necessary permissions in System Preferences > Security & Privacy > Accessibility

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Built on the Model Context Protocol
Uses ocrmac for OCR functionality
Powered by Apple's Vision framework and xcrun tools

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
docs		docs
examples		examples
ios_interact_mcp		ios_interact_mcp
scripts		scripts
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
FULLSCREEN_REMOVAL_SUMMARY.md		FULLSCREEN_REMOVAL_SUMMARY.md
LICENSE		LICENSE
README.md		README.md
debug_ensure_fullscreen.py		debug_ensure_fullscreen.py
debug_exit_fullscreen.py		debug_exit_fullscreen.py
debug_menu_click.py		debug_menu_click.py
exit_fullscreen.py		exit_fullscreen.py
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
requirements.txt		requirements.txt
setup.py		setup.py
test_click_only.py		test_click_only.py
test_fullscreen_error.py		test_fullscreen_error.py
test_ocr_click_direct.py		test_ocr_click_direct.py
test_simulator_focus.py		test_simulator_focus.py
test_windowed_restoration.py		test_windowed_restoration.py
use_keyboard_shortcut.py		use_keyboard_shortcut.py

License

AFDudley/ios-interact-mcp

Folders and files

Latest commit

History

Repository files navigation

iOS Interact MCP Server

Features

Requirements

Installation

Via pip

From source

Configuration

Claude Desktop

Standalone Usage

Using with Claude Code

Available Tools

click_text

click_at_coordinates

launch_app

terminate_app

screenshot

find_text_in_simulator

list_apps

open_url

press_button

list_simulator_windows

Usage Examples

Basic Automation

App Testing

Permissions

Development

Setup Development Environment

Running Tests

Code Quality

Troubleshooting

OCR Not Working

Click Actions Failing

"No booted devices" Error

Permission Errors

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages