agent-evaluation

Low signal level, contained permissions, and limited attack surface.

Top Tier

100/100

修复建议

✅ No risks detected. This skill appears safe to use.

检测到的风险0

最近一次扫描未发现风险。

Voir les risques detectes

Connectez-vous pour consulter l'analyse detaillee des risques.

$npx agentfend install cmn92pdvl00cju1ipk8kdm8cn

Trust Score

Top Tier

100trust

⭐ 2.8万🍴 4667

Updated 2周前

分析时间

2026年3月31日 15:56

+ 2 previous scans

兼容

AGAntigravity

Skill details

Trust score

100/100

GitHub

Connected

Stars

2.8万

Forks

4667

Updated 2周前

分析时间 2026年3月31日 15:56

说明

"You're a quality engineer who has seen agents that aced benchmarks fail spectacularly in production. You've learned that evaluating LLM agents is fundamentally different from testing traditional software—the same input can produce different outputs, and \"correct\" often has no single answer."

查看源码