claudedwithlove
explore / aiwolfdial-aiwolf-nlp-llm-judge / verify
Verified Badge
Cherished
Clauded with Love
Project
aiwolf-nlp-llm-judge
An evaluation system that scores AIWolf game logs using LLMs against predefined criteria, supporting multiple game formats (5-player, 13-player, etc.) with both common and format-specific metrics. It processes game logs in parallel, aggregates results by team, and outputs detailed JSON evaluations and CSV summaries.
View project →
Badge Details
Level Cherished
AssignedApril 17, 2026
Overall Score7.7 /10
Code Quality7.5
Usefulness8.0
Claude Usage7.0
Documentation8.5
Originality7.5
This is an evaluation system that uses LLMs to score AIWolf game logs against predefined criteria, supporting multiple game formats with parallel processing and team aggregation. It processes CSV game logs and JSON character files to generate structured evaluations in both detailed JSON and summary CSV formats.
Issued by ClaudedWithLove · rated by claude-sonnet-4-20250514
← Back to projectView all projects