360
pts
SkillsBench: Benchmarking how well agent skills work across diverse tasks
AI Summary
SkillsBench is a framework designed to evaluate the performance of agent skills across a variety of tasks, aiming to identify strengths and weaknesses in agent capabilities. By benchmarking skills in diverse contexts, it provides insights that can help enhance agent performance and adapt them to specific needs more effectively.