pseudo-randomize grading order #57
As discussed in #57 there is a tendency by graders to get harsher and less helpful over time. Sorting exams by instance means these effects add up systematically. The proposed changes would use different orders for different problems. To be honest, I'm not 100% convinced if this is a good idea.