NewsLab
Jun 29 06:45 UTC

For most of the world, open-source AI is the only way forward (techstrong.ai)

234 points|by CrankyBear||144 comments|Read full story on techstrong.ai

Comments (144)

120 shown|More comments
  1. 1. pmontra||context
    Agreed but I want to see how it plays out. Historically a good Windows computer cost $1000 and it was all it took to start programming. How much does it cost a computer with enough resources to run a good enough AI model for agentic workflows and a reasonable time to first token? Can "most of the world" afford buying one?
  2. 2. bensyverson||context
    Yes, between Moore's Law and more efficient model architectures, we just have to let time do its work.
  3. 3. Danox||context
    Software models and hardware are getting better all the time—and that’s where some big companies spending billions might stumble! In fact, Microsoft recently announced that they’re scaling back a bit on their AI investments.
  4. 4. ktallett||context
    Hence why brute force needs to be replaced with examples such as neuromorphic methods. It could realistically could be combined with mesh networking as well to utilise the capabilities of all computers locally.
  5. 5. mbgerring||context
    About $2k in 2026 dollars and falling.
  6. 6. simonw||context
    ... or rising, at least as long as there's a RAM shortage.
  7. 7. mbgerring||context
    I’d bet that there won’t be a RAM shortage for very long.
  8. 8. simonw||context
    The best article I've seen about that is this one by David Oks (ignore the headline, the content is much better): https://davidoks.blog/p/ai-is-killing-the-cheap-smartphone

    > It was only in 2025, as memory prices began an unprecedented surge, that the memory makers started to build new fabs targeted at HBM, all slated to start producing chips in 2027 or 2028.

  9. 9. fellowmartian||context
    It still won’t help unless the AI bubble pops. Even old fabs will continue pumping out HBM instead of DRAM as long as hyperscalers gobble it up.
  10. 10. Avicebron||context
    This seems wildly optimistic, do you have anything to support it?
  11. 11. AnimalMuppet||context
    History? This isn't the first RAM shortage. When one happens, producers build more fabs. The fabs come online, the availability of memory shoots up, and the shortage goes away, usually replaced by a glut.

    If you want to argue that this is different from all previous RAM shortages, you can, but the burden of proof is on you to show the difference.

  12. 12. nok22kon||context
    there is a glut if demand stops.

    this time demand doesn't stop. there is an exponential demand for tokens.

  13. 13. swiftcoder||context
    > there is an exponential demand for tokens

    [citation needed]

    There is certainly economic pressure to create an exponential demand for tokens, but we've already seen a pullback from the costly "token maxing" companies were pushing last year.

    It's also pretty unclear to what degree the RAM shortage is driven by inference (versus by training). We're rapidly approaching the point where frontier models are "good enough" for everyday use, are at some point we're going to hit diminishing returns on training new trillion-parameter models...

  14. 14. swiftcoder||context
    The RAM shortage is predicated on both the huge datacenter buildout (many of which are already mired in delays, with a few even cancelled outright), and the massive memory purchase commitments various hyperscalers have made - hyperscalers who seem to be running short on cash lately...
  15. 15. Kim_Bruning||context
    Roughly about Eur 3-4K right this minute I think? The graphics card, ram and storage are punishing. Under more normal circumstances (hopefully late 2027) it'd be 1500-2500 depending on what you think is realistically useful.

    Possibly it's the same price range, allowing for inflation.

  16. 16. wizee||context
    Qwen 3.6 27B is quite good for agentic coding, and practical to run on consumer hardware. You need a system with either 32+ GB VRAM, or a unified memory system with 48+ GB VRAM and a decent integrated GPU. While not cheap, such a setup is still attainable for much of the world, and will eventually get cheaper over time. Open models hosted on non-American clouds also remain an option with a much lower barrier to entry, for cases where privacy is less critical.
  17. 17. jochem9||context
    There was an article on HN a few weeks ago where someone detailed how they managed to get an old datacenter GPU to run in their consumer PC, getting decent performance with qwen. He spent something like $200 on the GPU (second hand of course).

    So yeah, I think models on local hardware will be quite common soon among the tech savvy (such as people creating software).

  18. 18. wrs||context
    Especially considering the millions of 2026-class data center GPUs that massively overinvested companies are currently buying, which will be obsolete in a few years.
  19. 19. treis||context
    I think those are going to be run until they die. The capex vs opex is too high to obsolete them in a few years. They'll keep serving current gen LLMs for as long as they keep running.
  20. 20. Chu4eeno||context
    They can also be used for other things than running the main frontier whatever model as well.

    E.g. grok isn't truly multi-modal, it has a callable tool that is a separate VLM it invokes on image URLs or files (for a long time it was grok-1.5v, but I think they have upgraded now, it was pretty bad).

    And then you have the small summarizer models for the CoT/thought traces, the guidable summarizer models for the standard browse tools, etc.

    There's a ton of stuff that can use an aging GPU.

  21. 21. robwwilliams||context
    Yes, sure, but not efficiently. Even Pops will not want to run four hair dyer GPUs 24-7 in the garage.
  22. 22. Tuna-Fish||context
    It won't make sense to run them after two years. The vendors will be limited on datacenter space, power and cooling, and there will be new hardware available that will run the same models at a fraction of the power.

    A100 -> H100 was >3x tokens per joule, H100 -> B200 >10x. There are significant low-hanging fruit still available in architectural efficiency, and the vendors are chasing them.

    This is the big risk for AI companies that I feel is not being sufficiently priced in. Almost none of the investments they are making are durable, the depreciation schedules for everything but the real estate should be less than 24 months. Until the hardware is stable enough that you only get double-digit % improvements per generation, it should almost be counted as opex.

  23. 23. treis||context
    The annual operating cost of these is <10% purchase price annually. Even if the B200 is 10x more efficient in practice you can still operate the H100 at profit.

    As it stands there's way more demand than supply. The new GPUs are going to run frontier models while the older ones serve smaller ones.

    That said some of these are running in tents hooked up to mobile turbines. I can see some of those going away but generally I think you'll see them used until they start to fail in 5-10 years.

  24. 24. nok22kon||context
    H100 were released in Oct 2022. They are now more expensive than at release time.
  25. 25. robwwilliams||context
    I think you are right when you factor in the much more efficient newer high end GPU. That is what may make the current GPu investments obsolete in 2-4 years.
  26. 26. tensegrist||context
    aiui nvidia has buyback agreements covering these to prevent them from being sold off
  27. 27. schmuhblaster||context
    Indeed, and with some tinkering around the harness it can even punch way above its weight.
  28. 28. thewebguyd||context
    > You need a system with either 32+ GB VRAM

    I do hope you're right that it will get cheaper over time (it should), but right now 32GB of VRAM is not affordable to a lot of people. You're talking ~$4500 just for the GPU, or $800 ish used if you can find one.

  29. 29. daan-k||context
    For inference you can split the 32GB between two 16GB cards. Two new 5060tis for ~€1000 in total is more than fine.

    It's a tad less efficient and a bit more of a hassle, but still a good experience for only a fraction of the price.

  30. 30. gleenn||context
    A Mac laptop can be had with 32GB of RAM for far less than $4500. Not sure if they actually need 32GB of discreet GPU RAM. My Mac laptop does run Qwen at a reasonable speed.
  31. 31. FloatArtifact||context
  32. 32. yowlingcat||context
    I've seen folks make it work with a 3090 on 4 bit quant using turboquant for KV cache. That's key because 3090s remain the most cost effective gpu metal for enthusiasts (albeit 24g) and the jump to 5090 (32g) is quite expensive and not always worth the LLM specific performance; sadly, good 32g metal is somewhat lacking in the price point at or above the 3090.
  33. 33. Chu4eeno||context
    Open weights/source doesn't necessarily mean running on local hardware, though.

    I imagine having multiple providers competing will drive down hosted versions of open weight models drastically.

  34. 34. Tuna-Fish||context
    You are describing OpenRouter. And yes, it does.
  35. 35. Chu4eeno||context
    Well, open router is a bit more general, since it just proxies requests to both proprietary and open weights models.

    I was thinking more of the providers of inference on open weight models that openrouter proxies to.

  36. 36. mncharity||context
    > Open weights/source doesn't necessarily mean running on local hardware, though.

    And we've barely started to scratch the surface on helping open-weight models "be the best they can be", with cloud burst parallel sampling and prompt mutation. Looking for best probabilistic results for a prompt, and looking for best prompt variants for a task. Adaptively scaling computes at generation, not just training.

    And speculatively, if agentic coding is naturally a multiplicity, what UX might enable human devs to dance with that quantum superposition? Rather than quickly collapsing to one monkey and its keyboard.

  37. 37. giancarlostoro||context
    Before the AI "crisis" it used to take about $3500 to get a prebuilt with a 5090 which can run good enough LLMs. I run reasonable LLMs on just 16GB of VRAM on my Mac, and the 5090 has double that.
  38. 38. dgellow||context
    What model (and quant) do you run with 16GB? I assume you have a 24GB model, and dedicate 16GB to the model?
  39. 39. giancarlostoro||context
    Quite the opposite, you want something below your max VRAM and you use up to your max VRAM for the context window, so it can run longer.

    https://huggingface.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-...

  40. 40. dgellow||context
    Sorry, I meant a 24GB MacBook Pro model, to run a 16GB LLM :)

    My use of the term model here is confusing

  41. 41. majormajor||context
    > Historically a good Windows computer cost $1000 and it was all it took to start programming.

    Gotta remember inflation here.

    $1K in 1995 was roughly equivalent to $2K now and wouldn't have been a particularly "good" machine then.

    In 1982 the Commodore 64 started at about $600 bucks, also roughly around $2K today.

    If you outgrew that, beefier machines back then were A LOT. It was easy to find $2k+ towers and (especially) laptops even into the 2000s, and a lot of those would be $5K+ equivalent today.

  42. 42. SoftTalker||context
    And a unix workstation in those days could be high 4 or even 5 figures, depending on configuration.
  43. 43. rayiner||context
    Isn’t this just a bet that I’ll have an AI data center in my iPhone within 10 years? Why is that a bad bet?
  44. 44. abetusk||context
    Moore's law or one of its generalizations still holds, so it will only be a short matter of time before a $1k computer will be able to train and run a powerful enough model.
  45. 45. Windchaser||context
    I thought Moore's Law came to an end in the last decade?

    Certainly the transistors/chip or transistors/$ or flops/$ have not been progressing at the same exponential rate as during 1970-2010. There is still progress, but it's rather slower.

  46. 46. abetusk||context
    I was careful to say "Moore's law-like". Moore's law is stated as "transistor count per area doubles about every 2 years." [0] As stated, then yes, this might be true but, while important, that's not really the quantity we care about.

    As you point out it's really cost per transistor or cost per flop that we mostly care about. I'm finding it hard to find a succinct and clear plot, but I believe one is provided by Our World In Data on "GPU computational performance per dollar" [1] which, to my eyes, clearly shows exponential growth in computational power per dollar.

    The picture for storage is a little more muddied but if you squint just right you might still be able to recover an uninterrupted exponential growth [2].

    In my view, it's pretty clear that advances in AI have progressed so quickly because GPUs have been keeping up with the exponential growth of computational power (per unit cost).

    Exponential growth in this area is usually characterized by "S-curves", where one technology gets saturated but the exponential increase in power or decrease in cost is picked up by another, adjacent, technology, that allows the growth to continue. For compute it's CPUs to GPUs. For storage it's platter drives that are now being overtaken by SSDs.

    The more general phenomena is called Wright's law, or experience curve effects [3].

    [0] https://en.wikipedia.org/wiki/Moore%27s_law

    [1] https://ourworldindata.org/grapher/gpu-price-performance?ySc...

    [2] https://ourworldindata.org/grapher/historical-cost-of-comput...

    [3] https://en.wikipedia.org/wiki/Experience_curve_effect

  47. 47. crazycracker||context
    Historically the cost of compute has also gone down. Like just look at it as compared to a year ago. We have amazing open source models that can run on consumer hardware and if we go away from our obsession of using opus 4.8 or mythos for everything then it actually is super amazing to see what these open source models could do. I use qwen3.6:27b as a daily driver and I am heavily impressed with it.
  48. 48. ssivark||context
    I don't understand the justification for local hardware with cost as the motivation. The same (or bigger/better) open weights models can served by third parties at much higher resource utilisation, and will therefore be much cheaper!?

    Especially because the world is likely to persist, at least for a while, in state where computing hardware demand drastically exceeds supply resulting in high prices for hardware. So why wouldn't you want to max out utilisation and amortize costs, at least for typical (non sensitive) use cases.

  49. 49. layer8||context
    IMO the more useful distinction is in analogy to VPS versus SaaS/PaaS. Open models allow you to use any inference provider you like, including local ones, similar to running open-source software using VPS providers. You’re not bound to a particular SaaS/PaaS as you are with closed model providers. That same freedom also allows you to self-host when you care about that.
  50. 50. skydhash||context
    > Historically a good Windows computer cost $1000 and it was all it took to start programming

    Started with computers around 2009 and later bought an oldish computer (a pentium 4 PC) for the equivalent of 50 usd. Codeblocks and Python Idle were free at the time (C and Python were the first languages I learned). The barrier to programming has always been low as the only thing you needed was books (the internet made things easier) and access to a PC (I had friends with laptop and my school lab).

  51. 51. Supermancho||context
    In 1998 I had a second PC for the first time in my life. It was an old i486DX2-66 that my dad gave to me. He said it cost him $50 secondhand and was bought so I could put Linux on it.
  52. 52. epolanski||context
    If we had 256-512 GB ram unified memories at 2022 prices, we'd be talking 1500 computers.
  53. 53. nicoburns||context
    I think AI is still in it's "mainframe era". Once upon a time, computers that ran useful workloads filled a whole room. And now that's true of computers that run (state of the art) AI models. My guess would be that this won't always be the case, although I certainly don't claim to have a clear idea of how things will play out.
  54. 54. Thraway198||context
    With the free tiers available now you can make your own document processors and other such office software, in one pass.
  55. 55. orbital-decay||context
    How far back are we talking? Historically there were just a few computers in the world, personal computing revolution happened much later. And a private inference rig costs roughly the same in 2026 as a personal computer did in the early 80's, if you want the performance comparable with a personal workstation of that time (vs a mainframe/supercomputer).
  56. 56. onel||context
    Even if you go with an open source AI model, it will still make a lot more financial sense to pay a provider then actually invest in the hardware to self-host it.

    It's definitely worth investing in self-hosting the agent infrastructure around the model though: all the documents, knowledge base, all the connectors, the agent itself to run on your hardware

  57. 57. echelon||context
    We don't need rinky-dink RTX models that budget VRAM.

    We need large scale open weights models just as capable as what's at the frontier.

    And we need the ability to rent compute and spin up the weights easily. One-click, easy enough for anyone. Easier than nerd tools like ComfyUI, Claw, and node graph garbage.

    Freedom is owning very large scale weights. Anything less is subsistence.

  58. 58. ktallett||context
    We need to improve the waster and energy usage and this method doesn't. Most are not reinventing the wheel, a shared AI repository, communicated between online local computers would save a lot of need for these large models.
  59. 59. echelon||context
    NO!

    This is the wrong approach that will turn us into serfs. We need big honking models that do what the leading foundation hyperscaler models do to within a few percentage points of measured performance.

    The small-scale models are not productive, and the duct tape solutions built on top of them are hobbyist-tier "year of Linux on desktop" toys.

    I imagine fedora-wearing, crypto-shilling, coupon-cutting boffins every time I see small weights thing lauded as the future. This is the Pine Phone F-Droid of AI.

    "SMS works most of the time on my phone, I swear! I don't really need my banking app!"

    That is not big model energy.

    Nothing outside of the top ten is worth spending any time on, and we need to focus on models that bridge the gap.

    You're talking about impractical toys for highly technical people wasting their own time. That doesn't move the needle or have any economic impact on the competitive landscape.

    We need sharp teeth that bite at the legs of the top-tier foundation labs and hold them back from running away with the prize.

    We've been through this time and time again over the last thirty years. It's the same shaped problem as before. We don't need toys - we need real infra for real people paying money to do work. Not freeware for freeloaders who don't spend and invest in the problem space.

    Large models fit that precisely, because it forces investment into a wide variety of open infra, routers, inference engines, etc. Not to mention the weights ecosystem itself.

  60. 60. ktallett||context
    Firstly, unless you are the leader of any of the faangs, you are a serf on the whole, if you believe in that philosophy as being relevant.

    We need the right tool for the job. Certain models have minimum energy expense no matter what the task is and that's often wasted, both on the scale of some tasks and also repetition.

    There is a place and a need for large models, local models, and single purpose models. The same way there is a need for HPC and single board.

  61. 61. echelon||context
    > if you believe in that philosophy as being relevant.

    If you tell the open weights labs that you want tiny models, you'll get more shitty tiny models.

    If tell them you don't want tiny models and you want big models, they'll change their focus.

    > We need the right tool for the job.

    The world has enough shitty tiny models. You can frankly distill your own without wasting the time of foundation labs.

    > energy expense

    You are so far removed from the discourse that I think I'm wasting my time talking to you. This is not about energy in the slightest.

    > There is a place and a need for large models, local models, and single purpose models.

    This is where 99.999% of the demand is. And if you look, you'll notice that most of the token usage is accruing to hyperscalers because the market isn't meeting this demand with enough alternatives.

    The robotics industry will produce energy efficient models. Let them be the ones to press for that. Stop demanding cheapo LLMs, image, and video models. Big is all that matters in this domain.

  62. 62. robwwilliams||context
    I initial thought this was a great tongue-in-cheek comment. I still think it is a joke.
  63. 63. ktallett||context
    Oh no, he is just so into focusing on profit and "REAL WORK" that this is serious
  64. 64. simonw||context
    I'd love to see credible numbers on the energy usage of thousands of people running models on their own devices compared to sharing data center resources to run big models that serve many different people at the same time.

    My hunch is that the energy/water usage of the data centers is a whole lot more efficient than everyone running at home, but I'd be interested in seeing real data on that.

  65. 65. cold_harbor||context
    the comparison misses that local LLM usage covers tasks you'd never send to an API — private code, offline work, medical notes. the baseline is 'local vs not-doing-it', not 'local vs cloud'
  66. 66. simonw||context
    I have bad news about my private code and medical notes...
  67. 67. Chu4eeno||context
    Looking forward to some pre/non-finetuned frontier model to leak, and people to start completing medical notes.
  68. 68. echelon||context
    > Looking forward to some pre/non-finetuned frontier model to leak

    I'm looking forward to this too. It'll be incredibly useful.

  69. 69. verdverm||context
    With hardware like the Spark and Strix, the water usage is known to be zero, yea?

    On the energy front, I assume less efficient, but I also think there is a tradeoff in efficiency versus freedom, that's why I have my own hardware.

  70. 70. fluoridation||context
    All consumer hardware (not counting XOC) uses either air cooling or closed-loop liquid cooling, so the water usage is zero, always. Power is a little trickier. I'd assume it's less efficient, but also the total usage is less, because the user sometimes turns the machine off, and the hardware idles to a deeper sleep state than server hardware.
  71. 71. simonw||context
    The electricity used by hardware itself consumes water. When people talk about data center water usage they're often also including the water used in electricity generation.
  72. 72. verdverm||context
    Ironically, 77% of my electricity comes from flowing water rather than the boiling it, ~90% renewable overall

    Once we get to fully renewable (as a country), there should be no water involved in electricity generation as well.

  73. 73. Windchaser||context
    Water usage goes up with data centers because more cooling is needed when you run the hardware harder.

    So: if you're running the models on your own machine, presumably you're not running them as often, and air cooling is sufficient. But, at the same time, this is less efficient in terms of hardware use; the data centers need water cooling specifically because they're getting more bang from their buck from their hardware, by running their hardware harder.

    So that's the tradeoff: more hardware-use efficiency means more water usage.

  74. 74. nok22kon||context
    that water is recirculated. nobody sends it down the drain after one loop
  75. 75. jazzyjackson||context
    Evaporative cooling is cheaper

    > [Google’s] largest data center, located in Council Bluffs, IA, withdrew an average of 3.9 million gallons of water and consumed 2.8 million gallons per day.

    https://mostpolicyinitiative.org/science-note/data-center-wa...

  76. 76. echelon||context
    Almonds and avocados use two orders of magnitude more water.

    You can start adjusting your consumption of those products.

  77. 77. xtracto||context
    we need this: https://news.ycombinator.com/item?id=48516751

    > distributed LLM inference. We are at a point where no single person can setup a rig to run a SOTA model, it is just too expensive. So we must build and adopt frameworks that allow individuals to share resources to run SOTA models in a distributed manner. That way they will also be non-censorable by governments.

    Also The only way to prevent that one entity weaponizes it, is by giving EVERYONE access to it.

  78. 78. echelon||context
    Just rent an H200.

    You rent your fiber optic internet. You're doing just fine. The world isn't collapsing because you don't own the hardware racks, routers, and fiber lines.

    This crazy zany P2P communal infra is Arch Linux coded - too much work, aimed at the 0.001% of users, and the juice isn't worth the squeeze.

    It sounds the exact same as being mad at your ISP, so you want to build a mesh internet protocol over microwave dishes and share with everyone in your neighborhood. That was a thing in the 00's. And it predictably got nowhere and delivered no value.

    Nobody's got time for this kind of stuff except for hobbyists. It's not solving a real problem. Nor does it attract business investment to grow into a healthy offering.

    Just build OpenRunPod instead.

    The problem is that the world doesn't have access to frontier open weights. That's the only problem to focus on and solve.

    The quickest way to get there is to use large scale open weights instead of little rinky dink RTX distillations. And the easiest way to run them is by renting H100s or using a managed service offering. There's nothing wrong with either of those options.

    The infrastructure is the easy part anyway.

  79. 79. blakesterz||context
    There's a video of the entire session here:

    https://webtv.un.org/en/asset/k14/k14ej1ucqu?kalturaStartTim...

    (if that link doesn't work, it starts about 12 minutes into the start)

  80. 80. verdverm||context
    had to click the play button, but it keyed to the 12m mark
  81. 81. mbgerring||context
    There is no reason we should accept the enclosure of the digital commons represented by AI. The data these models are trained on amounts to the total intellectual and artistic output of human kind through recorded history. It belongs to all of us, and accordingly, so should the models and weights produced by it.
  82. 82. logancbrown||context
    The effort of humans who had to toil through training models belongs to everyone? Do they no longer have any ownership over their hard work?
  83. 83. petcat||context
    Not too mention the unbelievable cost of actually doing all that training.
  84. 84. SoftTalker||context
    People used to make the similar arguments about programming languages and compilers. Now you'd need extaordinary requirements to justify paying licensing or usage fees for a language runtime or compiler.
  85. 85. riddlemethat||context
    They got paid. That’s what the money was for. It’s the investors who backed these foundational model companies who will hold the bag as more open source models come along and consume more market share.
  86. 86. thewebguyd||context
    I agree, but

    > the investors who backed these foundational model companies who will hold the bag

    Is awfully bold to assume that private credit is who will be holding the bag here. The IPOs are coming to shift the risk to the index funds & retail. Once insider lock up periods expire, I suspect a massive sell off.

  87. 87. tencentshill||context
    Some consider copyright to be unethical. "Information wants to be free".
  88. 88. RRWagner||context
    I would propose that copyrights not be eternal
  89. 89. thewebguyd||context
    Information also wants to be expensive, and that tension will never go away. "Information wants to be free" is only one side of the context of that quote.

    > On the one hand you have—the point you’re making Woz—is that information sort of wants to be expensive because it is so valuable—the right information in the right place just changes your life. On the other hand, information almost wants to be free because the costs of getting it out is getting lower and lower all of the time. So you have these two things fighting against each other.

    Information may want to be free, but the humans creating it still need to eat and pay rent. Copyright isn't necessarily unethical more than its a flawed tool, and lasts far too long in the law's current state. It needs to last only and exactly as long for the original creator to profit from the work for a specific duration of time, and then thats it.

  90. 90. vitally3643||context
    No, greedy people want information to be expensive.

    It is only because of rampant greed and capitalism that information is not free. There is nothing inherent about the collective knowledge of mankind that lends itself to being proprietary and expensive. Otherwise human society literally could not have evolved.

  91. 91. civet_java||context
    I think the issue is that information was expensive until the owners of capital gatekept knowledge itself. Now that information is cheap to reproduce they'd rather gatekeep the conduit to that knowledge and devalue knowledge itself rather than pay up in accordance with the very precedent that they set all along.
  92. 92. altcognito||context
    Great framing for your case, but I think it is less that it is unethical and more that ideas/copyright isn't perpetual, nor should it be fully transferrable to a corporation (a non-person entity)

    I'd struggle to find an idea, art, technique etc... that wasn't an extension of something that came before it.

  93. 93. layer8||context
    Copyright isn’t about information.
  94. 94. cgannett||context
    A drop in the bucket compared to the value of the collective human work that was stolen to train it.

    edit: come to think about it I think the ratio of one drop to one bucket is vastly over estimating the ratio of the trainer's effort.

  95. 95. logancbrown||context
    No human work was stolen, it was read.
  96. 96. folkrav||context
    Calling what LLMs do just "reading" is, at best, naïve anthropomorphization.
  97. 97. logancbrown||context
    anthropomorphization is not a counterargument
  98. 98. Upvoter33||context
    Come on. It reads all the code in the world; then, you ask it to produce code; then, it gladly writes out something "new" which is a synthesis of what is out there, often with direct copying. It's more than reading.
  99. 99. logancbrown||context
    Sounds like exactly what I do
  100. 100. civet_java||context
    So what? You're a human (I'm assuming you are) and an LLM is not. The law ought to treat humans differently.
  101. 101. jazzyjackson||context
    It was downloaded without a license to read it
  102. 102. dofm||context
    Paying them may now be impossible. There might be some legal settlements still.

    Preventing a handful of massive companies from continuing to be the only ones able to make money off that, not only unimpeded but with overt or covert state assistance (regulatory capture, ownership, whatever), at least puts an end to the worst of the abuse.

    If we have broken the idea of copyright, and we do indeed appear to have broken the idea of copyright, why should trillion dollar companies owned and controlled by strange or psychopathic weirdos and their circle of investors be the only ones benefiting? Why do Sam and Dario or the US government get to decide when and for whom the tap is turned on?

  103. 103. failbuffer||context
    Are the going to pay for the societal harms they cause?
  104. 104. cyanydeez||context
    ok, but government is how you do that. and as should be evident, its easy to year down and corrupt
  105. 105. hootz||context
    Companies aren't really that better. Actually, sometimes it's even worse, because they are not even formally required to serve the public.
  106. 106. Chu4eeno||context
    A big reason for that is the dumb decision from Dodge v. Ford Motor Co., giving us this quasi-capitalism which optimizes for the wrong thing.
  107. 107. anigbrowl||context
    LEgislation could make that obsolete but it seems like most/all of our legislative institutions are captured by business interests.
  108. 108. Chu4eeno||context
    Yeah, it's hard to get someone to understand a problem when their salary (or campaign contributions) depends on them not understanding it, or however the quote goes.
  109. 109. nxm||context
    It belongs to not us, but the copyright holders of that work
  110. 110. dwa3592||context
    and that is millions of people; hence the use of the word 'us'.
  111. 111. stackedinserter||context
    It's closer to "that output belongs to everyone, everyone's free to train your own model"

    Otherwise, everything you do belongs to all of us, because you learned on total intellectual and artistic output of human kind too.

  112. 112. jazzyjackson||context
    No I did not illegally download every book ever published. anyone who went to college paid good money for their textbooks.
  113. 113. stackedinserter||context
    So if I somehow read a book for free, then somehow used that knowledge in my paid work, then I owe them % of my earnings?
  114. 114. robwwilliams||context
    Yann is on the mark. Almost amusing to see the EU along with its many former “subjects” realize they are at great risk of joint Chinese-American hegemony in AI. We should all be more terrified of a few nation states defining the agendas and policies of AI use than current Ai variants that a inherently without purpose or autonomy.

    Great analogy to the fear of the printing press being really bad news in that it enabled the rabble to get aroused.

  115. 115. oceanplexian||context
    AI is the canary in the coal mine. They don't have an AI problem they have an everything problem. Inability to maintain energy security, declines in manufacturing, their social programs are no longer sustainable (Pension age rises and reforms), German car industry is in decline, increased spending demands for defense, and so on.

    All that's needed is another sovereign debt crisis to spark what is essentially dry tinder and I think the EU is a lot closer to collapsing than anyone even remotely realizes.

  116. 116. blitzar||context
    > EU is a lot closer to collapsing than anyone even remotely realizes

    I have been hearing that will happen next year for the last 20 years. No this time it isn't different.

  117. 117. robwwilliams||context
    Not how I see it. And I have seen Europe close up for 60 years. It is wealthier and happier than it was even 15 years ago. Visit Spain, Portugal, Greece, and of course the other larger northern states; no comparison. Immigration has served them well in wealth and health. Compare Europe to the hollowed out core of the USA and you will shudder. Visit rural America.

    The European “problem” is they live to have a life whereas in the USA we seem to live to work.

  118. 118. oceanplexian||context
    I was in Italy 6 months ago, more than one person told me they are at the point of desperation due to the incredibly low wages (<3000 EUR/month).
  119. 119. paxys||context
    We aren’t going to have Open Source AI without Open Source hardware specs and Open Source manufacturing. Software has been solo driving open computing for far too long, and with AI now the bottlenecks are finally moving up the stack.
  120. 120. OsrsNeedsf2P||context
    What..? We have open weights AI already, and even if we didn't, I don't see how Open Source hardware is a pre-requisite