Run Tests
by SprocketLab · devops · python, bash, docker, skill
Run problem tests using eval-snapshot instead of raw pytest. Use this to evaluate solutions against benchmark tests in Docker. Invoke with /run-tests <snapshot_path> <problem_name> <checkpoint_index>.
Source: https://github.com/SprocketLab/slop-code-bench
Install
git clone https://github.com/SprocketLab/slop-code-benchTags: python, bash, docker, skill
⭐ 73 GitHub stars · Source: skillsdirectory
About devops MCP servers and Claude skills
devops MCP servers extend what AI agents can do inside Claude Code, Cursor, Copilot, Codex, and Windsurf. The Skiln directory indexes 16,000+ such integrations across 22 categories.
Run Tests is one of hundreds of devops entries indexed on Skiln. Browse the full devops category or the complete directory of Claude skills, MCP servers, agents, commands, and hooks.
Related devops MCPs and skills
- Triage Codebase by Sw4mpf0x
Triage and document an application's codebase. Use when asked to analyze, triage, or summarize a codebase. Produces a structured report covering tech stack, user roles, component communication, backend infrastructure, container orchestration, secrets management, and deployment pipeline.
- Aspire Monitoring by foxminchan
**ANALYSIS SKILL** - Observe Aspire apps: logs, traces, metrics, resource state, telemetry export, browser telemetry, and the standalone dashboard. Routes between local Aspire CLI, AKS workload diagnostics, and deployed Azure resource health. USE FOR: aspire logs, aspire otel logs, aspire otel traces, aspire otel spans, aspire describe, aspire ps, aspire export, aspire dashboard run, --include-hidden, browser logs in dashboard, WithBrowserLogs, App Insights query, AKS pod logs, container app ...
- Coverage Report by melodic-software
Generate test coverage reports with HTML visualization and threshold enforcement
- Ros2 Engineering Skills by LeoYeAI
Comprehensive ROS 2 engineering guide covering workspace setup, node architecture, communication patterns (topics/services/actions with QoS), lifecycle and component nodes, launch composition, tf2/URDF, ros2_control hardware interfaces, real-time constraints, Nav2, MoveIt 2, perception pipelines, simulation (Gazebo/Isaac Sim), security (SROS2/DDS), micro-ROS (MCU/RTOS), multi-robot systems (fleet management/Open-RMF), testing, debugging, deployment, and ROS 1 migration. Trigger whenever the u...
- Good Self Contained Skill by bobmatnyc
Example of a properly self-contained skill following all best practices
- Phoenix Ops by bobmatnyc
Phoenix operations and deployment: releases, runtime configuration, clustering, libcluster, telemetry/logging, secrets, assets, background jobs, and production hardening on the BEAM.
- Depot Github Runners by PostHog
Configures Depot-managed GitHub Actions runners as a drop-in replacement for GitHub-hosted runners. Use when setting up or migrating GitHub Actions workflows to use Depot runners, choosing runner sizes (CPU/RAM), configuring runs-on labels, setting up ARM or Windows or macOS runners, troubleshooting GitHub Actions runner issues, configuring egress filtering, using Depot Cache with GitHub Actions, or running Dagger/Dependabot on Depot runners. Also use when the user mentions depot-ubuntu, depo...
- Argocd Expert by pedronauck
Expert-level ArgoCD GitOps deployment, application management, sync strategies, and production operations
Frequently asked questions
How do I install Run Tests?
Add the install command above to your Claude Code, Cursor, or Windsurf MCP configuration. Most servers register via npx, a local command, or a Docker image. Refer to the source repository for environment variables and credential requirements.
Which clients support Run Tests?
Any MCP-compatible client works: Claude Desktop, Claude Code CLI, Cursor, Windsurf, Zed, and VS Code with the official MCP extension. OpenAI Codex and GitHub Copilot increasingly support MCP via adapter bridges.
Is Run Tests free?
The server itself is typically open source. Any upstream service (API keys, paid tiers, hosted infrastructure) may have its own pricing. Check the source repository for details.