VeryFrugal@sh.itjust.works to A Boring Dystopia@lemmy.worldEnglish · edit-220 days agoPeople got mad after they lost their AI boyfriend after GPT-4o deprecationwww.reddit.comexternal-linkmessage-square59linkfedilinkarrow-up1333file-text
arrow-up1333external-linkPeople got mad after they lost their AI boyfriend after GPT-4o deprecationwww.reddit.comVeryFrugal@sh.itjust.works to A Boring Dystopia@lemmy.worldEnglish · edit-220 days agomessage-square59linkfedilinkfile-text
minus-squarebrucethemoose@lemmy.worldlinkfedilinkarrow-up8·edit-219 days agoThis is a common pattern unfortunately. Big LLMs are benchmaxxing coding and one shot answers, and multi turn conversation is taking a nosedive. https://arxiv.org/abs/2504.04717 Restructure your prompts, or better yet try non-OpenAI LLMs. I’d suggest z.ai, Jamba, and Gemini Pro for multi turn. Maybe Qwen Code, though it’s pretty deep fried too.
This is a common pattern unfortunately. Big LLMs are benchmaxxing coding and one shot answers, and multi turn conversation is taking a nosedive.
https://arxiv.org/abs/2504.04717
Restructure your prompts, or better yet try non-OpenAI LLMs. I’d suggest z.ai, Jamba, and Gemini Pro for multi turn. Maybe Qwen Code, though it’s pretty deep fried too.