Back to list
dev_to 2026年3月7日

PowerSkills: PowerShell を使ってWindowsとAIボットをコントロール

PowerSkills: Giving AI Agents Control Over Windows with PowerShell

Translated: 2026/3/7 9:12:20

Japanese Translation

あなたがAIボットを作成している場合、おそらくすでに気付いたと思います:多くのツールニングはLinuxやmacOSを使用します。その一方で、Windowsの自動化は、後付けなのです。

Original Content

If you're building AI agents that need to interact with Windows, you've probably noticed: most agent tooling assumes Linux or macOS. Windows automation is an afterthought. But enterprise work happens on Windows. Outlook holds the emails. Edge holds the browser sessions. PowerShell is the automation backbone. PowerSkills bridges this gap. PowerSkills is an open-source PowerShell toolkit that gives AI agents structured control over Windows - Outlook email, Edge browser, desktop windows, and system operations. Every action returns clean, parseable JSON. Outlook - Read inbox, search emails, send messages, access calendar events via COM automation Browser - Control Edge via Chrome DevTools Protocol (CDP) - list tabs, navigate, take screenshots, interact with the DOM Desktop - Manage windows, capture screenshots, read/write clipboard, send keystrokes via Win32 API System - Query system info, manage processes, execute commands, read environment variables Every action returns a consistent envelope. No more parsing free-text output: { "status": "success", "exit_code": 0, "data": { "hostname": "WORKSTATION-01", "os": "Microsoft Windows 11 Pro", "memory_gb": 32 }, "timestamp": "2026-03-06T17:30:00Z" } Agents check status, extract data, handle errors - no regex needed. # Dispatcher mode .\powerskills.ps1 system info .\powerskills.ps1 outlook inbox --limit 5 .\powerskills.ps1 browser tabs .\powerskills.ps1 desktop screenshot --path C:\temp\screen.png # Standalone mode .\skills\system\system.ps1 info .\skills\outlook\outlook.ps1 inbox --limit 5 Each skill includes a SKILL.md file with structured metadata - name, description, available actions, and parameters. AI agents can discover and understand capabilities without hardcoded instructions. No package manager, no installer. Just PowerShell 5.1+ and Windows 10/11: Clone or download the repository Run: .\powerskills.ps1 list For browser skills: launch Edge with --remote-debugging-port=9222 Note: If scripts are blocked, set the execution policy: Set-ExecutionPolicy -Scope CurrentUser -ExecutionPolicy RemoteSigned PowerSkills is MIT licensed. Contributions, issues, and stars are welcome: github.com/aloth/PowerSkills If you're building agents that need to work with Windows, I'd love to hear how you're approaching the problem. What other Windows capabilities would be useful for your agent workflows?