Automatic short answer grading at scale using Large Language Models