AI Scores Well on Chart Comparison Test, But the Scoring Method May Be Flawed — Markdown | type0 | type0