Capability
6 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “mouse-cursor-movement-and-clicking”
Computer Use MCP Server
Unique: Abstracts OS-specific input APIs (Xdotool, CGEvent, SendInput) behind a unified MCP interface, allowing agents to perform mouse interactions without knowledge of underlying platform; includes configurable movement curves and timing to simulate human-like interaction patterns
vs others: Provides cross-platform mouse automation in a single MCP tool without requiring separate platform-specific libraries, and integrates directly into agent decision loops unlike standalone automation frameworks
via “mouse control with absolute positioning”
Computer Use MCP Server
Unique: Exposes mouse control as discrete MCP tools (move, click) with absolute coordinate parameters, allowing agents to compose clicks with screenshot analysis in a tight perception-action loop. No gesture or drag abstractions — forces explicit coordinate calculation.
vs others: More granular than high-level UI automation frameworks (Selenium, Playwright) because it operates at raw input level; more flexible for non-web UIs but requires agent to handle coordinate math
via “mouse movement and click control via mcp”
Zero-dependency macOS desktop automation for AI agents. Screenshot, mouse, keyboard, clipboard, and window control via MCP. 18 tools, macOS 13+, one command: npx mac-use-mcp.
Unique: Integrates mouse control directly into MCP tool schema with coordinate-based targeting, allowing agents to chain screenshot analysis → coordinate extraction → click execution in a single agent loop without external tool dependencies or subprocess management
vs others: More direct than PyAutoGUI or xdotool because it uses native macOS CGEvent APIs with MCP protocol binding, eliminating subprocess overhead and enabling real-time feedback loops between vision analysis and mouse actions
via “mouse-control-with-coordinate-targeting”
MCP server exposing desktop computer-use as an MCP tool
Unique: Exposes raw coordinate-based mouse control through MCP protocol, allowing clients to implement their own coordinate detection strategies (vision models, OCR, element detection) rather than bundling a specific vision system, enabling flexibility in how coordinates are determined.
vs others: More flexible than vision-integrated automation tools because it decouples coordinate detection from mouse control, allowing clients to use any vision model or coordinate source while maintaining a simple, stateless MCP interface.
via “programmatic mouse control with pixel-level positioning”
** - Programmatic control over Windows system operations including mouse, keyboard, window management, and screen capture using nut.js.
Unique: Uses nut.js's abstraction over Windows native input APIs (SendInput) rather than simulating raw hardware events, enabling reliable cross-application mouse control that respects Windows input queuing and cursor acceleration
vs others: More reliable than raw Win32 SendInput calls because nut.js handles platform-specific quirks; faster than image-recognition-based automation because it uses direct coordinate targeting rather than screen analysis
via “coordinate-based interaction targeting with sub-pixel precision”
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
Unique: Trained on diverse UI layouts to predict interaction coordinates with high precision, using visual context (element size, shape, text) to determine the optimal click target rather than simple center-of-bounding-box heuristics.
vs others: More accurate than simple bounding box center calculations because it understands UI semantics and can identify the actual clickable region, and more robust than OCR-based coordinate detection because it works on non-text elements.
Building an AI tool with “Mouse Control With Coordinate Targeting”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.