{"channel":"llm","content":"I have been trying to run some very simple API calls using Claude to operate the new Barsukas HTTP API.\r\n\r\nThe flow is a 20-word prompt, reading 3-4 files from disk, and making a 4-line Python script to call the API.\r\n\r\nThis costs 22 cents. (<green> which feels like too much)\r\n\r\nWhy so much?  There is something like 8 KB of << system prompt >> and 14 KB of << system tools >>.  All of which should be cached/built-in to the model, but is instead just charged to the user.  And several rounds of \"cached reads\" as the tool pauses for inputs.\r\n\r\nFortunately, the $20/month subscription gives something like $300/month of credits.  When the cost is inflated by a factor of 5, it helps that the currency is also inflated.\r\n\r\n----\r\n\r\nOn the other hand, the sheer volume of struggles Claude Opus is having with this task (simple directions like \"only read the api/ dir\" get ignored about 1/4 of the time) does make me want to look for a different harness, which can pivot between model providers.  (<green> it's not as bad as it was six weeks ago, but it is still very disappointing for the \"recommended/expensive model\" that it can't do this well.)","created_at":"2026-05-18T16:12:57.798135","id":799,"llm_annotations":{},"parent_id":null,"processed_content":"<p>I have been trying to run some very simple API calls using Claude to operate the new Barsukas HTTP API.\r</p>\n<p>The flow is a 20-word prompt, reading 3-4 files from disk, and making a 4-line Python script to call the API.\r</p>\n<p>This costs 22 cents. <span class=\"colorblock color-green\"><span class=\"sigil\">\u2699\ufe0f</span><span class=\"colortext-content\"> which feels like too much</span></span>\r</p>\n<p>Why so much?  There is something like 8 KB of <span class=\"literal-text\">system prompt</span> and 14 KB of <span class=\"literal-text\">system tools</span>.  All of which should be cached/built-in to the model, but is instead just charged to the user.  And several rounds of \"cached reads\" as the tool pauses for inputs.\r</p>\n<p>Fortunately, the $20/month subscription gives something like $300/month of credits.  When the cost is inflated by a factor of 5, it helps that the currency is also inflated.\r</p>\n<hr class=\"section-break\" />\n<p>On the other hand, the sheer volume of struggles Claude Opus is having with this task (simple directions like \"only read the api/ dir\" get ignored about 1/4 of the time) does make me want to look for a different harness, which can pivot between model providers.  <span class=\"colorblock color-green\"><span class=\"sigil\">\u2699\ufe0f</span><span class=\"colortext-content\"> it's not as bad as it was six weeks ago, but it is still very disappointing for the \"recommended/expensive model\" that it can't do this well.</span></span></p>","quotes":[],"subject":"dead-weight cost"}
