Discussion about this post

User's avatar
Shane P's avatar

Internalize the revision process and the judge, not the student and the artifact.

Write the rubric and then the C student report and then the A student report and then the graders review.

Then the second revisions from each student and the second review from the grader.

Don't manufacture answer machines, manufacture revision machines that refine crap into gold. Don't write my paper for me, write the entire history of my paper and it's complete 20 revisions cycle.

My paper is already world class? Great rewrite it without the letter e or in the style of cs lewis or Hemingway. Don't just do it, but tell me how to do it, then do it, then tell me how to further improve it.

People are still so scarcity minded, follow the gradient, don't stop yet.

Expand full comment
Srikanth Vidapanakal's avatar

RL learns from process rewards also and not just from binary out come rewards is it not?

Expand full comment
1 more comment...

No posts