AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories
-
Updated
Jun 7, 2026 - Python
AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories
Open-source self-hosted web tool for evaluating Agent Skills with rubric scores, Deep Review, and improvement suggestions.
A survey of rubrics across the evolving LLM landscape.
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
Bilingual Codex skills for source-grounded academic writing, conservative audit, format-preserving revision, paper-to-PPT delivery, and final submission cleanup.
Reward model engineering harness for evolutionary rubric search, deployable RM artifacts, online scoring, and RL experiment lineage.
A Claude Code skill that adds a rubric-based eval layer to any agent project. Framework-agnostic — generates rubric, test cases, judge prompt, and harness. Returns a weighted score plus a judge-leniency signal.
Export grades from assignment using advanced grading methods in excel format
AskBench: LLM question-asking/clarification benchmark & dataset with evaluation and training code (paper: arXiv 2602.11199).
Rubric-driven AI homework grading system built as a Claude Code Skill. Score student submissions with CoT reasoning, bias mitigation, and PDCA quality cycle.
Evaluate Claude Code and Codex skill directories with deterministic rubric checks and graded, fixable reports.
Context-compensation scaffold for LLM evaluation prompts. A short language prefix you prepend so the model discloses prior exposure, scores on quoted evidence only, and hedges on thin evidence — for scorers that can see your CLAUDE.md, memory, or session context. Backend-agnostic. Experimental: variance-reduction effect not yet measured.
Customize, manage templates of rubrics and fast grade HTML/PDF files
Add a description, image, and links to the rubric topic page so that developers can more easily learn about it.
To associate your repository with the rubric topic, visit your repo's landing page and select "manage topics."