Concerns Grow Over AI's Ability to Grade University Essays Effectively
A recent study indicates that AI struggles to evaluate undergraduate essays accurately, achieving human-level grading only about 50% of the time, raising questions about its reliability.
Editorial Staff
1 min read
Updated 5 days ago
A study conducted by researchers has found that leading Generative AI models are not yet capable of grading undergraduate essays effectively. The AI systems matched human grading only half the time.
The research highlights significant shortcomings in AI's ability to discern quality, as it often fails to identify both the best and worst submissions among the essays evaluated.
These findings prompt concerns regarding the increasing reliance on AI for academic grading, suggesting that current technology may prioritize style over substance in assessments.