Test the problem against the model in batches of parallel attempts, tuning the problem difficulty until the agent only succeeds in a small number of attempts
Once you're happy with the task, and it scores within range, the task goes to a senior reviewer in your subfield. They will provide feedback to ensure task quality is high
Degree in Electrical Engineering or related field;
...