読み込み中...
読み込み中...
Complete AGI Capability Measurement with 4-Tier Structure
AGI Olympics V3 is a new benchmark framework for comprehensively measuring AGI (Artificial General Intelligence) capabilities. Through a 4-Tier structure, it systematically evaluates all capabilities necessary for AGI, from self-awareness to core capabilities, consciousness, and long-term memory.
Measures the ability of AGI systems to recognize themselves and continuously improve
Evaluates fundamental AGI capabilities such as abstract reasoning, multi-domain knowledge, and code generation
Measures conscious experience, qualia, and philosophical reasoning capabilities
Evaluates long-term memory retention and information integration capabilities. Demonstrates long context ≠ true memory
Framework details and evaluation results for Tier 1 (Self-Awareness & Self-Improvement) and Tier 4 (Long-Term Memory)
Detailed evaluation results of Project A.L.I.C.E. on ARC, MMLU, and HumanEval
Comprehensive evaluation of philosophical reasoning, consciousness consistency, and qualia detection
Sakamoto, M. (2025). AGI Olympics V3: Comprehensive AGI Capability Evaluation Framework - Proposal and Public Release. Extoria Research. https://extoria.co.jp/research/benchmarks/agi-olympics-v3/