Capability
4 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “mouse-cursor-movement-and-clicking”
Computer Use MCP Server
Unique: Abstracts OS-specific input APIs (Xdotool, CGEvent, SendInput) behind a unified MCP interface, allowing agents to perform mouse interactions without knowledge of underlying platform; includes configurable movement curves and timing to simulate human-like interaction patterns
vs others: Provides cross-platform mouse automation in a single MCP tool without requiring separate platform-specific libraries, and integrates directly into agent decision loops unlike standalone automation frameworks
via “mouse control with absolute positioning”
Computer Use MCP Server
Unique: Exposes mouse control as discrete MCP tools (move, click) with absolute coordinate parameters, allowing agents to compose clicks with screenshot analysis in a tight perception-action loop. No gesture or drag abstractions — forces explicit coordinate calculation.
vs others: More granular than high-level UI automation frameworks (Selenium, Playwright) because it operates at raw input level; more flexible for non-web UIs but requires agent to handle coordinate math
via “mouse-control-with-coordinate-targeting”
MCP server exposing desktop computer-use as an MCP tool
Unique: Exposes raw coordinate-based mouse control through MCP protocol, allowing clients to implement their own coordinate detection strategies (vision models, OCR, element detection) rather than bundling a specific vision system, enabling flexibility in how coordinates are determined.
vs others: More flexible than vision-integrated automation tools because it decouples coordinate detection from mouse control, allowing clients to use any vision model or coordinate source while maintaining a simple, stateless MCP interface.
via “programmatic mouse control with pixel-level positioning”
** - Programmatic control over Windows system operations including mouse, keyboard, window management, and screen capture using nut.js.
Unique: Uses nut.js's abstraction over Windows native input APIs (SendInput) rather than simulating raw hardware events, enabling reliable cross-application mouse control that respects Windows input queuing and cursor acceleration
vs others: More reliable than raw Win32 SendInput calls because nut.js handles platform-specific quirks; faster than image-recognition-based automation because it uses direct coordinate targeting rather than screen analysis
Building an AI tool with “Programmatic Mouse Control With Pixel Level Positioning”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.