<p><span class="h-card" translate="no"><a href="https://mastodon.social/@whitequark" class="u-url mention">@<span>whitequark</span></a></span> Maybe put differently, the same kinds of arguments and excuses used to justify A/B testing are now used to justify tweaking LLMs &quot;based on user feedback.&quot; OpenAI has even admitted to that recent versions of 4o encouraged people to think of themselves as religious prophets because of how they interpreted user feedback.</p><p>I posit that understanding why A/B testing can both be useful and harmful is itself useful in deflating OpenAI&#39;s arguments.</p>
Reply