Why AI researchers are questioning whether reward maximization is the wrong objective — type0 | type0