{"id":126774,"date":"2024-11-17T05:16:56","date_gmt":"2024-11-16T22:16:56","guid":{"rendered":"https:\/\/hotvideos24.online\/?p=126774"},"modified":"2024-11-17T05:16:56","modified_gmt":"2024-11-16T22:16:56","slug":"playing-chess-against-llms-and-the-mystery-of-instruct-models","status":"publish","type":"post","link":"https:\/\/hotvideos24.online\/?p=126774","title":{"rendered":"Playing Chess Against LLMs And The Mystery Of Instruct Models"},"content":{"rendered":"<p> <script async src=\"https:\/\/pagead2.googlesyndication.com\/pagead\/js\/adsbygoogle.js?client=ca-pub-3711241968723425\"\r\n     crossorigin=\"anonymous\"><\/script>\r\n<ins class=\"adsbygoogle\"\r\n     style=\"display:block\"\r\n     data-ad-format=\"fluid\"\r\n     data-ad-layout-key=\"-fb+5w+4e-db+86\"\r\n     data-ad-client=\"ca-pub-3711241968723425\"\r\n     data-ad-slot=\"7910942971\"><\/ins>\r\n<script>\r\n     (adsbygoogle = window.adsbygoogle || []).push({});\r\n<\/script><br \/>\n<\/p>\n<div itemprop=\"articleBody\">\n<p>At first glance, trying to play chess against a large language model (LLM) seems like a daft idea, as its weighted nodes have, at most, been trained on some chess-adjacent texts. It has no concept of board state, stratagems, or even whatever a \u2018rook\u2019 or \u2018knight\u2019 piece is. This daftness is indeed demonstrated by [Dynomight] <a href=\"https:\/\/dynomight.net\/chess\/\" target=\"_blank\" rel=\"noopener\">in a recent blog post<\/a> (<a href=\"https:\/\/dynomight.substack.com\/p\/chess\" target=\"_blank\" rel=\"noopener\">Substack version<\/a>), where the <a href=\"https:\/\/stockfishchess.org\/\" target=\"_blank\" rel=\"noopener\">Stockfish<\/a> chess AI is pitted against a range of LLMs, from a small Llama model to GPT-3.5. Although the outcomes (see featured image) are largely as you\u2019d expect, there is one surprise: the <code>gpt-3.5-turbo-instruct<\/code> model, which seems quite capable of giving Stockfish a run for its money, albeit on Stockfish\u2019s lower settings.<\/p>\n<p>Each model was given the same query, telling it to be a chess grandmaster, to use standard notation, and to choose its next move. The stark difference between the instruct model and the others calls investigation. OpenAI describes the instruct model as an \u2018InstructGPT 3.5 class model\u2019, which <a href=\"https:\/\/openai.com\/index\/instruction-following\/\" target=\"_blank\" rel=\"noopener\">leads us to this page<\/a> on OpenAI\u2019s site and an <a href=\"https:\/\/arxiv.org\/abs\/2203.02155\" target=\"_blank\" rel=\"noopener\">associated 2022 paper<\/a> that describes how InstructGPT is effectively the standard GPT LLM model heavily fine-tuned using human feedback.<\/p>\n<p>Ultimately, it seems that instruct models do better with instruction-based queries because they have been programmed that way using extensive tuning. A <a href=\"https:\/\/news.ycombinator.com\/item?id=37558911\" target=\"_blank\" rel=\"noopener\">[Hacker News] thread from last year<\/a> discusses the Turbo vs Instruct version of GPT 3.5. That thread also uses chess as a comparison point. Meanwhile, <a href=\"https:\/\/openai.com\/index\/chatgpt\/\" target=\"_blank\" rel=\"noopener\">ChatGPT is a sibling of InstructGPT<\/a>, per OpenAI, using Reinforcement Learning from Human Feedback (RLHF), with presumably ChatGPT users now mostly providing said feedback.<\/p>\n<p>OpenAI notes repeatedly that InstructGPT nor ChatGPT provide correct responses all the time. However, within the limited problem space of chess, it would seem that it\u2019s good enough not to bore a dedicated chess AI into digital oblivion.<\/p>\n<p>If you want a digital chess partner, try your <a href=\"https:\/\/hackaday.com\/2024\/03\/30\/playing-chess-against-your-printer-with-postscript\/\">Postscript printer<\/a>. Chess software doesn\u2019t have to be as <a href=\"https:\/\/hackaday.com\/2023\/06\/23\/a-chess-ai-in-only-4k-of-memory\/\">large<\/a> as an AI model.<\/p>\n<\/p><\/div>\n<p><script async src=\"https:\/\/pagead2.googlesyndication.com\/pagead\/js\/adsbygoogle.js?client=ca-pub-3711241968723425\"\r\n     crossorigin=\"anonymous\"><\/script>\r\n<ins class=\"adsbygoogle\"\r\n     style=\"display:block\"\r\n     data-ad-format=\"fluid\"\r\n     data-ad-layout-key=\"-fb+5w+4e-db+86\"\r\n     data-ad-client=\"ca-pub-3711241968723425\"\r\n     data-ad-slot=\"7910942971\"><\/ins>\r\n<script>\r\n     (adsbygoogle = window.adsbygoogle || []).push({});\r\n<\/script><br \/>\n<br \/><div data-type=\"_mgwidget\" data-widget-id=\"1660802\">\r\n<\/div>\r\n<script>(function(w,q){w[q]=w[q]||[];w[q].push([\"_mgc.load\"])})(window,\"_mgq\");\r\n<\/script>\r\n<br \/>\n<br \/><a href=\"https:\/\/hackaday.com\/2024\/11\/16\/playing-chess-against-llms-and-the-mystery-of-instruct-models\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>At first glance, trying to play chess against a large language model (LLM) seems like a daft idea, as its weighted nodes have, at most, been trained on some chess-adjacent &hellip; <a href=\"https:\/\/hotvideos24.online\/?p=126774\" class=\"more-link\">Read More<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8630],"tags":[],"class_list":["post-126774","post","type-post","status-publish","format-standard","hentry","category-technology","entry"],"_links":{"self":[{"href":"https:\/\/hotvideos24.online\/index.php?rest_route=\/wp\/v2\/posts\/126774","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hotvideos24.online\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hotvideos24.online\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hotvideos24.online\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/hotvideos24.online\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=126774"}],"version-history":[{"count":0,"href":"https:\/\/hotvideos24.online\/index.php?rest_route=\/wp\/v2\/posts\/126774\/revisions"}],"wp:attachment":[{"href":"https:\/\/hotvideos24.online\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=126774"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hotvideos24.online\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=126774"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hotvideos24.online\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=126774"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}