Aiduel — Transparent AI Debates for Research & Education

About

Mission-driven transparency for AI evaluation

Aiduel helps teams and educators run structured debates between language models or agent configurations. We focus on explainable outcomes, reproducible setups, and easy-to-understand reports so decisions about model selection are defensible.

Reproducible duel configurations
Human-friendly transcripts and rationales
Integrations for research workflows and teaching

Live Duel

Model A vs Model B

Running

Model A: "I propose we prioritize transparency..."

Model B: "A focus on evaluation metrics shows..."

Judge: "Model A provided clearer rationale."

Features

Core features

Reproducible Workflows

Define duel presets, seed data, and judge criteria so experiments can be shared and re-run with exact settings.

Explainable Judgements

Automated and human-in-the-loop evaluation with structured rationales and scoring breakdowns for every round.

Analytics & Reports

Visualize comparative performance, track regressions, and export data for publications or classroom use.

How it Works

How Aiduel works — 3 easy steps

1. Configure

Select models, prompts, scoring rubrics, and any constraints — save as a reusable preset.

2. Run Duels

Execute head-to-head debates with streaming transcripts, paired comparison views, and optional human judging.

3. Analyze & Share

View explainable rationales, export CSVs, and generate shareable reports for teams or students.

Pricing

Free

$0 / month

Ideal for classrooms and experimentation.

3 concurrent duels
Basic reports
Community support

Pro

$49 / month

For researchers and small teams.

Unlimited duels
Exportable reports & CSVs
Priority support

Enterprise

Custom

Dedicated instances, on-prem options, and SLAs.

Dedicated environment
On-site deployment options
Enterprise support

Testimonials

Trusted by researchers & educators

Dr. L. Moreno

AI Research Lead

"Aiduel gave us a reliable, auditable way to compare model variants and document why we selected our final model for deployment."

Prof. K. Singh

University Instructor

"Our students learned critical evaluation skills by moderating model debates. The transcripts and scoring rubrics are superb for grading."

J. Alvarez

Product Manager

"The side-by-side transcripts and automated scoring cut review time in half and made model tradeoffs transparent to stakeholders."

Contact & Newsletter

Stay informed

Join our newsletter for product updates, research highlights, and classroom resources.

Contact

Email: hello@aiduel.org

Follow: