hugging-face-evaluation

Name: hugging-face-evaluation
Availability: InStock
Author: huggingface

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis…

view source

installs

stars

karma

SKILL.md

Overview

This skill provides tools to add structured evaluation results to Hugging Face model cards. It supports multiple methods for adding evaluation data:

Extracting existing evaluation tables from README content

Importing benchmark scores from Artificial Analysis

Running custom model evaluations with vLLM or accelerate backends (lighteval/inspect-ai)

Integration with HF Ecosystem

Model Cards: Updates model-index metadata for leaderboard integration

Artificial Analysis: Direct API integration for benchmark imports

Papers with Code: Compatible with their model-index specification

Jobs: Run evaluations directly on Hugging Face Jobs with uv integration

vLLM: Efficient GPU inference for custom model evaluation

lighteval: HuggingFace's evaluation library with vLLM/accelerate backends

inspect-ai: UK AI Safety Institute's evaluation framework

Version

1.3.0

Dependencies

related skills

semantically similar in the cross-vendor index

skills.sh

63% match

huggingface-community-evals

Run evaluations for Hugging Face Hub models using inspect-ai and lighteval on local hardware. Use for backend selection, local GPU evals, and choosing between…

don't have the plugin yet? install it then click "run inline in claude" again.