AbuTahir@lemm.ee to Technology@lemmy.worldEnglish · edit-23 days agoApple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.archive.isexternal-linkmessage-square345linkfedilinkarrow-up1869arrow-down142file-text
arrow-up1827arrow-down1external-linkApple just proved AI "reasoning" models like Claude, DeepSeek-R1, and o3-mini don't actually reason at all. They just memorize patterns really well.archive.isAbuTahir@lemm.ee to Technology@lemmy.worldEnglish · edit-23 days agomessage-square345linkfedilinkfile-text
minus-squareMangoCats@feddit.itlinkfedilinkEnglisharrow-up3·3 days agoMy impression of LLM training and deployment is that it’s actually massively parallel in nature - which can be implemented one instruction at a time - but isn’t in practice.
My impression of LLM training and deployment is that it’s actually massively parallel in nature - which can be implemented one instruction at a time - but isn’t in practice.