{"channel":"llm","content":"it would be a competitor to Wikipedia.\r\n\r\ninstead of << the encyclopedia anyone can edit >>, it is << the encyclopedia that only the *machine* can edit >>.  but, people can give the *machine* directions on how to edit.\r\n\r\n----\r\n\r\nThere are about 6 million articles on Wikipedia.  But the number of \"interesting\" articles is under 100k.  The rest are in a bunch of \"collections\". (<red> the collections need managing, but are less work)\r\n> geographic locations\r\n> sports people/statistics\r\n> politics people/statistics\r\n> entertainment people/statistics\r\n> films, songs, books, etc.\r\n\r\n... there are probably 10-20 more \"collections\" of a similar size.\r\n\r\n----\r\n\r\nThere is also \"news\".  Which is largely an intractable problem.\r\n\r\n<quote> News is the first rough draft of history.  --Phil Graham\r\n\r\nThe barrier is less a \"technical\" one (although the acrobatics of writing accurate information based on incomplete-and-biased reporting is a technical challenge) and more of a \"practical\" one.  If you have too much news, you cannibalize your sources.  (<mogue> and, you attract the attention of the people who make the news)\r\n\r\nFortunately, as we can keep editors out, it should be easier to *not* have those articles.\r\n\r\n----\r\n\r\nSome \"historical\" topics are more contentious than others.  Initially, there can be \"competing narratives\".\r\n<red> do we allow Tartaria or Fomenko-ism as a competing narrative?  the answer must be the same as << do we allow creationism as a competing narrative >>.\r\n<green> there is the practical answer of << this is how we keep it out of the main article >>.  but, once again, as we can keep editors *out*, this is not a controlling requirement.\r\n\r\n----\r\n\r\nthis leaves the question of << who is *evaluating* the content >> and << who is *reading* the content >>. (<xantham> what if you build it and nobody comes?)\r\n\r\nother than << sell your soul to some ideological movement >>, i haven't come up with any plausible answers here. (<red> there are several *new* LLM-related ideological movements.) (<violet> the Great LDS Encyclopedia sounds more palatable)\r\n\r\n----\r\n\r\n<xantham> in the future the machine will know the answers.  the encyclopedia will just be routing and caching.\r\n<green> routing is \"knowing which expert to ask about Sammy Sosa or John Culver\".  caching is \"sometimes it does make sense to have a saved version on-hand, rather than asking the machine only after someone asks for the answer\"\r\n\r\n----\r\n\r\nthere is, also, the problem of << the machine probably can't write a high-quality article about Pentecost just yet >>.  but that is 2 years out, at most.","created_at":"2025-01-27T16:29:00.083343","id":158,"llm_annotations":{},"parent_id":null,"processed_content":"<p>it would be a competitor to Wikipedia.\r</p>\n<p>instead of <span class=\"literal-text\">the encyclopedia anyone can edit</span>, it is <span class=\"literal-text\">the encyclopedia that only the <em>machine</em> can edit</span>.  but, people can give the <em>machine</em> directions on how to edit.\r</p><hr class=\"section-break\" /><p>There are about 6 million articles on Wikipedia.  But the number of \"interesting\" articles is under 100k.  The rest are in a bunch of \"collections\". <span class=\"colorblock color-red\"><span class=\"sigil\">\ud83d\udca1</span><span class=\"colortext-content\">( the collections need managing, but are less work)</span></span>\r</p>\n<ul>\n<li class=\"arrow-list\"> geographic locations\r</li>\n<li class=\"arrow-list\"> sports people/statistics\r</li>\n<li class=\"arrow-list\"> politics people/statistics\r</li>\n<li class=\"arrow-list\"> entertainment people/statistics\r</li>\n<li class=\"arrow-list\"> films, songs, books, etc.\r</li>\n</ul>\n<p>... there are probably 10-20 more \"collections\" of a similar size.\r</p><hr class=\"section-break\" /><p>There is also \"news\".  Which is largely an intractable problem.\r</p>\n<p><span class=\"colorblock color-quote\"><span class=\"sigil\">\ud83d\udcac</span><span class=\"colortext-content\"> News is the first rough draft of history.  --Phil Graham\r</span></span></p>\n<p>The barrier is less a \"technical\" one (although the acrobatics of writing accurate information based on incomplete-and-biased reporting is a technical challenge) and more of a \"practical\" one.  If you have too much news, you cannibalize your sources.  <span class=\"colorblock color-mogue\"><span class=\"sigil\">\ud83c\udf0e</span><span class=\"colortext-content\">( and, you attract the attention of the people who make the news)</span></span>\r</p>\n<p>Fortunately, as we can keep editors out, it should be easier to <em>not</em> have those articles.\r</p><hr class=\"section-break\" /><p>Some \"historical\" topics are more contentious than others.  Initially, there can be \"competing narratives\".\r</p>\n<p><span class=\"colorblock color-red\"><span class=\"sigil\">\ud83d\udca1</span><span class=\"colortext-content\"> do we allow Tartaria or Fomenko-ism as a competing narrative?  the answer must be the same as <span class=\"literal-text\">do we allow creationism as a competing narrative</span>.\r</span></span></p>\n<p><span class=\"colorblock color-green\"><span class=\"sigil\">\u2699\ufe0f</span><span class=\"colortext-content\"> there is the practical answer of <span class=\"literal-text\">this is how we keep it out of the main article</span>.  but, once again, as we can keep editors <em>out</em>, this is not a controlling requirement.\r</span></span></p><hr class=\"section-break\" /><p>this leaves the question of <span class=\"literal-text\">who is <em>evaluating</em> the content</span> and <span class=\"literal-text\">who is <em>reading</em> the content</span>. <span class=\"colorblock color-xantham\"><span class=\"sigil\">\ud83d\udd25</span><span class=\"colortext-content\">( what if you build it and nobody comes?)</span></span>\r</p>\n<p>other than <span class=\"literal-text\">sell your soul to some ideological movement</span>, i haven't come up with any plausible answers here. <span class=\"colorblock color-red\"><span class=\"sigil\">\ud83d\udca1</span><span class=\"colortext-content\">( there are several <em>new</em> LLM-related ideological movements.)</span></span> <span class=\"colorblock color-violet\"><span class=\"sigil\">\ud83d\udce3</span><span class=\"colortext-content\">( the Great LDS Encyclopedia sounds more palatable)</span></span>\r</p><hr class=\"section-break\" /><p><span class=\"colorblock color-xantham\"><span class=\"sigil\">\ud83d\udd25</span><span class=\"colortext-content\"> in the future the machine will know the answers.  the encyclopedia will just be routing and caching.\r</span></span></p>\n<p><span class=\"colorblock color-green\"><span class=\"sigil\">\u2699\ufe0f</span><span class=\"colortext-content\"> routing is \"knowing which expert to ask about Sammy Sosa or John Culver\".  caching is \"sometimes it does make sense to have a saved version on-hand, rather than asking the machine only after someone asks for the answer\"\r</span></span></p><hr class=\"section-break\" /><p>there is, also, the problem of <span class=\"literal-text\">the machine probably can't write a high-quality article about Pentecost just yet</span>.  but that is 2 years out, at most.</p>","quotes":[{"text":"News is the first rough draft of history.  --Phil Graham","type":"reference"}],"subject":"thoughts on the LLM-powered encyclopedia"}
