goosethe@lemmy.sdf.orgM to math@lemmy.sdf.org · 3 years agoTaming AI Bots: Prevent LLMs from entering "bad" states using continuous guidance from the LLM ("is this good? bad?") to avoid bad states.arxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTaming AI Bots: Prevent LLMs from entering "bad" states using continuous guidance from the LLM ("is this good? bad?") to avoid bad states.arxiv.orggoosethe@lemmy.sdf.orgM to math@lemmy.sdf.org · 3 years agomessage-square0linkfedilink