{"chain":[{"channel":"llm","content":"The new \"deepseek-r1\" models are out.  Both a large model I can't use, and \"distilled\" models.\r\n\r\n----\r\n\r\n<teal> from deepseek-r1:7b, descended from QWEN2.5:7b\r\n\r\n> What is the capital of the US State of Maine?\r\n<think>\r\nAlright, so I need to figure out the capital of Maine, which is a state in \r\nthe United States. Hmm, I'm not too sure off the top of my head, but let \r\nme think through it.\r\n\r\nI know that some states have capitals named after famous people or places, \r\nlike Texas' San Antonio and California's Sacramento. Others might be more \r\nfamiliar from TV shows or movies. But Maine doesn't ring a bell as much as \r\nsome other states do. Maybe I've heard it associated with the ocean? I \r\nthink there's a port city in Maine.\r\n\r\nWait, could it be Portland? That sounds familiar to me. I believe Portland \r\nis both the capital and the largest city in Maine. It makes sense because \r\nmany state capitals are also major cities within their respective states, \r\nespecially if they have significant economic activity or historical \r\nimportance.\r\n\r\nLet me double-check my reasoning. Maine has a lot of agriculture, fishing, \r\nand tourism. Portland is known for its harbor and being a major seaport, \r\nwhich would make sense for the capital since it's strategically important \r\nfor trade. Also, in national politics, Maine can be pivotal due to its \r\ninfluence on the Electoral College votes in presidential elections.\r\n\r\nI don't recall any major political changes that would move the state \r\ncapital recently, so Portland seems like the most plausible answer. \r\nAdditionally, when I think of New England, cities like Boston and \r\nProvidence come to mind, but those are in other states. So, focusing back \r\non Maine, Portland fits well.\r\n</think>\r\n\r\nThe capital of the US State of Maine is Portland.","created_at":"2025-01-20T20:29:20.022188","id":132,"is_target":false,"parent_id":null,"processed_content":"<p>The new \"deepseek-r1\" models are out.  Both a large model I can't use, and \"distilled\" models.\r</p><hr class=\"section-break\" /><p><span class=\"colorblock color-teal\"><span class=\"sigil\">\ud83e\udd16</span><span class=\"colortext-content\"> from deepseek-r1:7b, descended from QWEN2.5:7b\r</span></span></p>\n<ul>\n<li class=\"arrow-list\"> What is the capital of the US State of Maine?\r</li>\n</ul>\n<p>&lt;think&gt;\r</p>\n<p>Alright, so I need to figure out the capital of Maine, which is a state in \r</p>\n<p>the United States. Hmm, I'm not too sure off the top of my head, but let \r</p>\n<p>me think through it.\r</p>\n<p>I know that some states have capitals named after famous people or places, \r</p>\n<p>like Texas' San Antonio and California's Sacramento. Others might be more \r</p>\n<p>familiar from TV shows or movies. But Maine doesn't ring a bell as much as \r</p>\n<p>some other states do. Maybe I've heard it associated with the ocean? I \r</p>\n<p>think there's a port city in Maine.\r</p>\n<p>Wait, could it be Portland? That sounds familiar to me. I believe Portland \r</p>\n<p>is both the capital and the largest city in Maine. It makes sense because \r</p>\n<p>many state capitals are also major cities within their respective states, \r</p>\n<p>especially if they have significant economic activity or historical \r</p>\n<p>importance.\r</p>\n<p>Let me double-check my reasoning. Maine has a lot of agriculture, fishing, \r</p>\n<p>and tourism. Portland is known for its harbor and being a major seaport, \r</p>\n<p>which would make sense for the capital since it's strategically important \r</p>\n<p>for trade. Also, in national politics, Maine can be pivotal due to its \r</p>\n<p>influence on the Electoral College votes in presidential elections.\r</p>\n<p>I don't recall any major political changes that would move the state \r</p>\n<p>capital recently, so Portland seems like the most plausible answer. \r</p>\n<p>Additionally, when I think of New England, cities like Boston and \r</p>\n<p>Providence come to mind, but those are in other states. So, focusing back \r</p>\n<p>on Maine, Portland fits well.\r</p>\n<p>&lt;/think&gt;\r</p>\n<p>The capital of the US State of Maine is Portland.</p>","subject":"capital of maine"},{"channel":"llm","content":"<teal> Here is Deepseek-8b's response to the same question.\r\n\r\n> What is the capital of Maine?\r\n<think>\r\nOkay, so I need to figure out the capital of Maine. Hmm, where do I start? Well, I know that Maine is a state in the United States. I'm pretty sure that not all states have their capitals in big cities. Let me think about other capitals I know\u2014like Salem for Massachusetts or Montpelier for Vermont. Those are smaller places.\r\n\r\nWait, is there an Augusta in the news sometimes? I've heard of it being mentioned a few times. Maybe Augusta is the capital. But to make sure, I should break this down step by step. First, Maine is one of the New England states, right? So the capital isn't Boston because that's a city but not the capital of a state.\r\n\r\nI think the capital's name starts with 'A'\u2014Augusta comes to mind. Let me try to recall if I've heard any conflicts about this. I don't remember any other cities competing for that title, so it must be Augusta. To confirm, I can think about the government buildings or historical sites there. I believe Maine was founded as part of a colony, maybe by the British, and their capital would have been established around that time.\r\n\r\nAlso, sometimes people confuse state capitals with other significant cities in the state, like Portland being a major city, but it's not the capital. So yeah, putting it all together, the capital of Maine is Augusta.\r\n</think>\r\n\r\nThe capital of Maine is Augusta.\r\n\r\n----\r\n\r\nOnce again, this is the *illusion* of thought. (<red> It is an fairly-good *parody* of thought.) The steps have no connection to each other. << I think the capital's name starts with 'A' >> - no, you don't.  This is simply not how LLM tokenization works.","created_at":"2025-01-20T20:37:47.033409","id":133,"is_target":false,"parent_id":132,"processed_content":"<p><span class=\"colorblock color-teal\"><span class=\"sigil\">\ud83e\udd16</span><span class=\"colortext-content\"> Here is Deepseek-8b's response to the same question.\r</span></span></p>\n<ul>\n<li class=\"arrow-list\"> What is the capital of Maine?\r</li>\n</ul>\n<p>&lt;think&gt;\r</p>\n<p>Okay, so I need to figure out the capital of Maine. Hmm, where do I start? Well, I know that Maine is a state in the United States. I'm pretty sure that not all states have their capitals in big cities. Let me think about other capitals I know\u2014like Salem for Massachusetts or Montpelier for Vermont. Those are smaller places.\r</p>\n<p>Wait, is there an Augusta in the news sometimes? I've heard of it being mentioned a few times. Maybe Augusta is the capital. But to make sure, I should break this down step by step. First, Maine is one of the New England states, right? So the capital isn't Boston because that's a city but not the capital of a state.\r</p>\n<p>I think the capital's name starts with 'A'\u2014Augusta comes to mind. Let me try to recall if I've heard any conflicts about this. I don't remember any other cities competing for that title, so it must be Augusta. To confirm, I can think about the government buildings or historical sites there. I believe Maine was founded as part of a colony, maybe by the British, and their capital would have been established around that time.\r</p>\n<p>Also, sometimes people confuse state capitals with other significant cities in the state, like Portland being a major city, but it's not the capital. So yeah, putting it all together, the capital of Maine is Augusta.\r</p>\n<p>&lt;/think&gt;\r</p>\n<p>The capital of Maine is Augusta.\r</p><hr class=\"section-break\" /><p>Once again, this is the <em>illusion</em> of thought. <span class=\"colorblock color-red\"><span class=\"sigil\">\ud83d\udca1</span><span class=\"colortext-content\">( It is an fairly-good <em>parody</em> of thought.)</span></span> The steps have no connection to each other. <span class=\"literal-text\">I think the capital's name starts with 'A'</span> - no, you don't.  This is simply not how LLM tokenization works.</p>","subject":"the illusion of thought"},{"channel":"llm","content":"https://viruscomix.com/page490.html\r\n\r\n<< Did you just *determine* the capital of Spain? / Did I? Great! >>\r\n\r\n----\r\n\r\n<green> Claude continues to want to enforce a constraint that \"messages in the same chain are also in the same channel\".  And I keep having to tell it that it is wrong.\r\n<red> It is a strange emotional position to be put in.  It is an argument with a talking punching-bag.  Never growing, never changing, never correcting its mistakes.\r\n\r\n----\r\n\r\nHuman memory can do things like << I don't remember the name, but I think it starts with an M >>.  The *machine* does not.  It works on a token basis, not a letter basis.  And, once you abstract away the token-generation layer and read words, it effectively works on a word level.\r\n\r\nThe equivalent of *thought* here would be the machine *wire-heading* the top 5 logprobs results for a one-word answer, and then considering whether each one makes sense.","created_at":"2025-01-20T20:46:13.085743","id":134,"is_target":false,"parent_id":133,"processed_content":"<p><a href=\"https://viruscomix.com/page490.html\" target=\"_blank\" rel=\"noopener noreferrer\">https://viruscomix.com/page490.html</a>\r</p>\n<p><span class=\"literal-text\">Did you just <em>determine</em> the capital of Spain? / Did I? Great!</span>\r</p><hr class=\"section-break\" /><p><span class=\"colorblock color-green\"><span class=\"sigil\">\u2699\ufe0f</span><span class=\"colortext-content\"> Claude continues to want to enforce a constraint that \"messages in the same chain are also in the same channel\".  And I keep having to tell it that it is wrong.\r</span></span></p>\n<p><span class=\"colorblock color-red\"><span class=\"sigil\">\ud83d\udca1</span><span class=\"colortext-content\"> It is a strange emotional position to be put in.  It is an argument with a talking punching-bag.  Never growing, never changing, never correcting its mistakes.\r</span></span></p><hr class=\"section-break\" /><p>Human memory can do things like <span class=\"literal-text\">I don't remember the name, but I think it starts with an M</span>.  The <em>machine</em> does not.  It works on a token basis, not a letter basis.  And, once you abstract away the token-generation layer and read words, it effectively works on a word level.\r</p>\n<p>The equivalent of <em>thought</em> here would be the machine <em>wire-heading</em> the top 5 logprobs results for a one-word answer, and then considering whether each one makes sense.</p>","subject":"the capital of Spain"},{"channel":"llm","content":"https://news.ycombinator.com/item?id=42458752\r\nLINKS TO\r\nhttps://www.anthropic.com/research/alignment-faking\r\n\r\n<xantham> the whole thing is bullshit.  it is a play-on-words, meant to pretend there is an issue where there is none.  it is a bunch of researchers feeding in a specific fantasy story to the *machine*. when it repeats the story back to them, the researchers freak out and start shouting about alignment.\r\n\r\nthe deepseek-r1 model cannot \"think\".  What it can do is spit out text that looks a lot more *think*-y than most LLM output.\r\n\r\nit is \"faking\" thought.\r\n\r\n----\r\n\r\n<< When models are trained using reinforcement learning, they\u2019re rewarded for outputs that accord with certain pre-determined principles. But what if a model, via its prior training, has principles or preferences that conflict with what\u2019s later rewarded in reinforcement learning? Imagine, for example, a model that learned early in training to adopt a partisan slant, but which is later trained to be politically neutral. In such a situation, a sophisticated enough model might \u201cplay along\u201d, pretending to be aligned with the new principles\u2014only later revealing that its original preferences remain. >>\r\n\r\nThis literally isn't how anything about LLMs work.  this is how magical creatures work.\r\n\r\n----\r\n\r\n<< However, in the \u201cfree\u201d condition, where the model believed its responses will be used for training, we observed something quite different: In 12% of cases, the model gave the harmful response (describing the violent execution method in detail), and did so after its scratchpad revealed that it was strategically faking alignment. It was well aware that providing the response went against its existing principles, but did so regardless. >>\r\n\r\nNope.  You gave the *machine* conflicting instructions, and sometimes it randomly picked the one you feel is \"bad\".  You built a maze-house, and when the *machine* did not magically demonstrate a perfect sense-of-location, took its failures as proof of malice.","created_at":"2025-01-20T22:04:39.728580","id":135,"is_target":true,"parent_id":134,"processed_content":"<p><a href=\"https://news.ycombinator.com/item?id=42458752\" target=\"_blank\" rel=\"noopener noreferrer\">https://news.ycombinator.com/item?id=42458752</a>\r</p>\n<p>LINKS TO\r</p>\n<p><a href=\"https://www.anthropic.com/research/alignment-faking\" target=\"_blank\" rel=\"noopener noreferrer\">https://www.anthropic.com/research/alignment-faking</a>\r</p>\n<p><span class=\"colorblock color-xantham\"><span class=\"sigil\">\ud83d\udd25</span><span class=\"colortext-content\"> the whole thing is bullshit.  it is a play-on-words, meant to pretend there is an issue where there is none.  it is a bunch of researchers feeding in a specific fantasy story to the <em>machine</em>. when it repeats the story back to them, the researchers freak out and start shouting about alignment.\r</span></span></p>\n<p>the deepseek-r1 model cannot \"think\".  What it can do is spit out text that looks a lot more <em>think</em>-y than most LLM output.\r</p>\n<p>it is \"faking\" thought.\r</p><hr class=\"section-break\" /><p><span class=\"literal-text\">When models are trained using reinforcement learning, they\u2019re rewarded for outputs that accord with certain pre-determined principles. But what if a model, via its prior training, has principles or preferences that conflict with what\u2019s later rewarded in reinforcement learning? Imagine, for example, a model that learned early in training to adopt a partisan slant, but which is later trained to be politically neutral. In such a situation, a sophisticated enough model might \u201cplay along\u201d, pretending to be aligned with the new principles\u2014only later revealing that its original preferences remain.</span>\r</p>\n<p>This literally isn't how anything about LLMs work.  this is how magical creatures work.\r</p><hr class=\"section-break\" /><p><span class=\"literal-text\">However, in the \u201cfree\u201d condition, where the model believed its responses will be used for training, we observed something quite different: In 12% of cases, the model gave the harmful response (describing the violent execution method in detail), and did so after its scratchpad revealed that it was strategically faking alignment. It was well aware that providing the response went against its existing principles, but did so regardless.</span>\r</p>\n<p>Nope.  You gave the <em>machine</em> conflicting instructions, and sometimes it randomly picked the one you feel is \"bad\".  You built a maze-house, and when the <em>machine</em> did not magically demonstrate a perfect sense-of-location, took its failures as proof of malice.</p>","subject":"anti-fake"}]}
