Friday, May 15

News Feed

Category Added in a WPeMatico Campaign

Chatbot Arena - if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why
News Feed, Reddit

Chatbot Arena – if you’ve felt that Claude 3 Opus still holds a slight edge over the new GPT-4 Turbo, we now understand why

If we exclude the refusals (e.g., "I cannot answer") ,and only tally votes for actual responses, Claude 3 Opus continues to be marginally superior to the new GPT-4 Turbo. Yes, you might think it’s pure bias on my part, but if you’re looking to compare the chatbots based on the quality of their responses when they do provide an answer, then excluding refusals might be a reasonable approach. This could give you a clearer picture of how well each chatbot performs when it is able to engage in a conversation. https://preview.redd.it/868h3r3p5quc1.png?width=1801&format=png&auto=webp&s=35b52face87ff90d405b85db74fc92e048f0e657 ​ submitted by /u/ok373737 [link] [comments]
The AI Report