Capability
5 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “mouse-cursor-movement-and-clicking”
Computer Use MCP Server
Unique: Abstracts OS-specific input APIs (Xdotool, CGEvent, SendInput) behind a unified MCP interface, allowing agents to perform mouse interactions without knowledge of underlying platform; includes configurable movement curves and timing to simulate human-like interaction patterns
vs others: Provides cross-platform mouse automation in a single MCP tool without requiring separate platform-specific libraries, and integrates directly into agent decision loops unlike standalone automation frameworks
via “mouse control with absolute positioning”
Computer Use MCP Server
Unique: Exposes mouse control as discrete MCP tools (move, click) with absolute coordinate parameters, allowing agents to compose clicks with screenshot analysis in a tight perception-action loop. No gesture or drag abstractions — forces explicit coordinate calculation.
vs others: More granular than high-level UI automation frameworks (Selenium, Playwright) because it operates at raw input level; more flexible for non-web UIs but requires agent to handle coordinate math
via “mouse movement and click control via mcp”
Zero-dependency macOS desktop automation for AI agents. Screenshot, mouse, keyboard, clipboard, and window control via MCP. 18 tools, macOS 13+, one command: npx mac-use-mcp.
Unique: Integrates mouse control directly into MCP tool schema with coordinate-based targeting, allowing agents to chain screenshot analysis → coordinate extraction → click execution in a single agent loop without external tool dependencies or subprocess management
vs others: More direct than PyAutoGUI or xdotool because it uses native macOS CGEvent APIs with MCP protocol binding, eliminating subprocess overhead and enabling real-time feedback loops between vision analysis and mouse actions
via “mouse-control-with-coordinate-targeting”
MCP server exposing desktop computer-use as an MCP tool
Unique: Exposes raw coordinate-based mouse control through MCP protocol, allowing clients to implement their own coordinate detection strategies (vision models, OCR, element detection) rather than bundling a specific vision system, enabling flexibility in how coordinates are determined.
vs others: More flexible than vision-integrated automation tools because it decouples coordinate detection from mouse control, allowing clients to use any vision model or coordinate source while maintaining a simple, stateless MCP interface.
via “mouse click simulation with button variants”
** - Programmatic control over Windows system operations including mouse, keyboard, window management, and screen capture using nut.js.
Unique: Provides high-level click API that abstracts mouse button event sequencing (down/up pairs) and timing, allowing developers to specify click type and count without managing low-level input event details
vs others: More intuitive than raw mouse button events because it handles down/up sequencing automatically; more flexible than image-recognition-based clicking because it uses direct coordinate targeting
Building an AI tool with “Mouse Cursor Movement And Clicking”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.