Capability
5 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “application-window-enumeration-and-focus-control”
Computer Use MCP Server
Unique: Provides unified window enumeration and focus control across Windows/macOS/Linux, abstracting platform-specific window manager APIs (wmctrl, NSWindow, Windows API) behind a single interface
vs others: Combines window enumeration and focus control in a single MCP tool, whereas most automation frameworks require separate window management libraries or platform-specific code
via “multi-window-and-application-context-management”
I've been building computer-use tools for a while, and I quietly launched this about a month ago (122 Stars on GH). I figured it was worth sharing here.Over the last few months, a lot of computer-use agents have come out: Codex, Claude Code, CUA, and others. Most of them seem to work roughly li
Unique: Maintains persistent window registry and focus state rather than treating each window interaction independently — enables agents to reason about application context and coordinate actions across multiple windows
vs others: More sophisticated than simple window switching because it tracks window state and properties, enabling agents to make intelligent decisions about which window to target based on application context
via “window management and focus control via mcp”
Zero-dependency macOS desktop automation for AI agents. Screenshot, mouse, keyboard, clipboard, and window control via MCP. 18 tools, macOS 13+, one command: npx mac-use-mcp.
Unique: Provides unified window enumeration and control through MCP by querying macOS Accessibility API (AXUIElement), enabling agents to discover and manage windows without parsing window manager output or using AppleScript, with direct focus control for multi-window workflows
vs others: More reliable than AppleScript window management because it uses native Accessibility APIs with structured data output, enabling agents to reliably identify windows by multiple attributes (title, app, PID) and chain window operations with screenshot context
via “window and space management with heuristic-based window selection”
** - a macOS-only MCP server that enables AI agents to capture screenshots of applications, or the entire system.
Unique: Heuristic-based window selection system that ranks windows by relevance (foreground status, recent focus, window type) rather than simple first-match; includes specialized handling for multi-window applications and edge cases like hidden/minimized windows
vs others: More intelligent than simple window enumeration because it uses heuristics to select the most relevant window when an application has multiple windows; more robust than coordinate-based window targeting because it uses semantic window references
via “window enumeration and metadata retrieval”
** - Programmatic control over Windows system operations including mouse, keyboard, window management, and screen capture using nut.js.
Unique: Provides a JavaScript-friendly abstraction over Windows EnumWindows API, returning structured window objects with bounds and metadata rather than raw window handles, enabling filter-and-find patterns without low-level Win32 knowledge
vs others: More efficient than polling for window changes because enumeration is a single system call; more reliable than title-based lookup in AutoHotkey because it returns structured metadata enabling multi-criteria filtering
Building an AI tool with “Application Window Enumeration And Focus Control”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.