Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them

Trilogy3452@lemmy.world · 2 days ago

Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them

WPSteam@lemmy.world · 8 hours ago

AI Bubble burst coming sooner?

BigDanishGuy@sh.itjust.works · 17 hours ago

Nice try, I ain’t gonna pay anyway

Scrollone@feddit.it · 17 hours ago

Exactly, I just keep using the free plan and when I finish the amount for the day I just switch to another service

iocase@lemmy.zip · 8 hours ago

This is why IMO blitz scaling is dumb when your service is a commodity. I’m not any more loyal to Uber than Skip. If more investor money goes into making a cheaper meal or ride on Skip I use that. Consumers are mercenaries about that stuff.

The “blitz” part of blitz scaling assumes your customers can’t move.

Scrollone@feddit.it · 7 hours ago

Exactly. And LLMs don’t have a way to keep you inside of their walled garden; if any, I prefer starting from a blank slate every time I ask something.

Lovable Sidekick@lemmy.world · 17 hours ago

I wonder how much they spend for every $0 I pay them.

M0oP0o@mander.xyz · 17 hours ago

$1000 I would guess. They are just burning money at this point.

nullspace@lemmy.world · 17 hours ago

I can’t imagine paying for AI when the open source tools have made it so easy to set up a model locally.

potustheplant@feddit.nl · 17 hours ago

Don’t be daft. The vast majority of people don’t have the knowledge or resources to set that up locally.

nullspace@lemmy.world · 8 hours ago

You’re right if we’re talking about the entire population of Earth. With these local models though, other people have already done all the hard stuff. Anyone with an RTX card and just a minimum level of patience can get going.

potustheplant@feddit.nl · 7 hours ago

Minimum for local models is 12gb imo. There are several “rtx” cards that have 8gb. Also, why nvidia? AMD works well too. My previous point stands, still. If you don’t already have the hw, buying a pc today is very expensive. I don’t know if you go out much but it ain’t pretty out there. People arrn’t precisely swimming in cash.

Also, patience isn’t the only requirement. Keep in mind that some people struggle to even install a program.

ranzispa@mander.xyz · 16 hours ago

Easy to set up, but still needs a 15k $ graphics card and electricity bill. The price you pay openai/anthropic is much cheaper than that for that quality of model.

Sure, you can setup a small model on a consumer graphics card, but the output will be considerably worse and the processing speed considerably lower.

For 240€/year you got a subscription to anthropic which will happily ingest a whole repository and process it in about one minute. No matter what latest model GPU you installed on your computer, you won’t be able to do that.

Sure, this guy was able to run a 26B model on an old CPU: https://point.free/blog/gemma-4-on-a-2016-xeon/

But that was not easy at all and the speed you get is definitely not the same as the one provided for a very cheap price.

CovfefeKills@lemmy.world · 6 hours ago

There is a middle ground. Crypto farmers have transitioned into running AI workloads for money. There are things sort of like folding@home but you can let people use your GPU and you earn tokens which are used to buy compute or sold to people who want to buy compute on the network. So you can setup a bigass open source model for private on demand use it’s still not cheap but a lot closer to reality for a lot of people than a 15k initial purchase.

zbyte64@awful.systems · 8 hours ago

If you were paying the real price it would be 2 grand a year though. And in 5 years that 15k graphics card will be $200 and sip on electricity by comparison.

ranzispa@mander.xyz · 6 hours ago

A100 is 6 years old and is now sold at over 10k $. If you were paying a higher price it could be cheaper to buy the card, since the prices are low that is not the case.

kuerbiskernoel@feddit.org · 7 hours ago

Currently nearly 5 year old used graphics cards are being sold for their initial price. Not sure how much they’ll get cheaper…

mfed1122@discuss.tchncs.de · 20 hours ago

I mean, this is no different than Walmart making prices low until other businesses die out and then raising them.

It is no different than police shoving all the homeless people and drug addicts into one area of town to crash the property prices, and then evicting them once developers buy everything for cheap.

They’re purposely operating at a loss in the expectation that they can get ingrained into a ton of workflows, and then gouge everyone absolutely to death while also worsening the quality of the service to make it cheaper for them to run.

If it weren’t so horrible for the environment, I’d kind of like it, because all the dumbass executives that are signing up for this are going to get exactly what they deserve. You’d think they’d recognize a scheme when they see one.

fishy@lemmy.today · 20 hours ago

My CEO (whom I don’t consider a particularly good or bad CEO) spent a day playing with AI then when asked if he’d sign the company up with the service he literally laughed in their faces and said it’s useless. I was honestly shocked because he’s totally into buzzword and popular crap. Gained a lot of respect for him that day.

WorldsDumbestMan@lemmy.today · 19 hours ago

An older co-worker seems to ask AI for help during work, we are blue collar. But the Owner of the company does not seem to use it whatsoever.

I ask Claude on occasion, to see if it will say something smart (it was mostly useless as fuck).

Scrollone@feddit.it · 17 hours ago

Honestly I think Claude it’s good at programming. Way better than ChatGPT.

But I ain’t going to pay for it.

ranzispa@mander.xyz · 16 hours ago

Published a library doing some very specific data processing. One of the algorithms I implemented was a bit too slow: it would take about a week to process data. I reckon implementation was a little bit sloppy, but I’ve been implementing a bunch of algorithms from research papers and this was pretty much the published implementation.

I asked Claude to analyse the implementation and check whether it could be improved, half an hour later I got a 26,000% improvement in performance with exactly the same results passing all tests.

Of course, I could have done that myself. But optimization had to go down to simd level; I doubt I would have been able to do that in less than a week of work.

AdolfSchmitler@lemmy.world · 21 hours ago

Trust me bro we’re so close to profitability bro, just need this IPO to secure funding one last time bro then we’ll be profitable bro I swear.

Burninator05@lemmy.world · 15 hours ago

I wish that was inversely proportional. The less I pay, somehow it costs them more money.

andallthat@lemmy.world · 9 hours ago

Yes, that’s called a “marketing budget”

ParlimentOfDoom@piefed.zip · 1 day ago

Now, I’m no MBA, but that seems like a bad business plan…

ReginaPhalange@lemmy.world · 19 hours ago

Oh come on bubble, why won’t you crash already?

Cornpop@lemmy.world · 1 day ago

What is the actual “cost” after they buy the hardware, is that $1000 really pure power usage cost?

Corkyskog@sh.itjust.works · 1 day ago

The problem is that the hardware has a 5 or 6 year depreciation schedule on paper, but NVIDIA keeps saying that their next generation chip will be twice as good as their last chip so there is a FOMO schedule of like every two years.

nullspace@lemmy.world · 17 hours ago

Would be nice to see that used hardware for sale rather than it being junked as a writeoff.

rumba@lemmy.zip · 1 day ago

that’s the $84,000 question. They’re filling datacenters with the fastest possible equipment and need it to be 10x faster, That hardware is dinosaur fodder a year after they install it.

HereIAm@lemmy.world · 1 day ago

I’m curious as well. My knowledge is probably quite outdated, but from what I understood the training part is what’s expensive and then querying the model is pretty cheap. Is it still true (or was it ever) that the generated answers on search engines are cheaper to generate than the actual search results?

CheeseNoodle@lemmy.world · 1 day ago

I find that hard to believe, I recently had to uninstall co-pilot after it weaseled its way into my search bar. Its not an exageration to say that my PC literally ran cyberpunk 2077 with pathtracting better than it ran the fucking windows search bar with co-pilot.

NιƙƙιDιɱҽʂ@lemmy.world · 20 hours ago

That’s just a shitty front end interface implementation, it has nothing to do with the actual inference run by the models.

SorryQuick@lemmy.ca · 21 hours ago

Look at the public numbers, it seems true. Copilot on your taskbar is just windows being garbage, not the AI being bad. Just look at self-hosted AI and measure the power costs of your queries. It’s tiny.

Shteou@lemmy.world · 1 day ago

It is sorta. Training is orders of magnitudes more intensive than inference, but we infer billions of times within a model generation.

Fubarberry@sopuli.xyz · 1 day ago

Honestly Google is likely to beat openAI and Anthropic as things are.

OpenAI and Anthropic have to buy/rent their hardware from Nvidia, while Google is making their own TPU hardware. Google’s hardware costs on AI is way lower, every dollar they spend on it goes a lot farther.

And unlike the other two, they’re already a profitable company. They’re making record profits right now. They don’t have a desperate need to figure out how to make back billions on their AI models, they can just keep offering Gemini at a comparatively cheap price and wait for anthropic and open AI to bankrupt themselves.

ryper@lemmy.ca · 20 hours ago

I guess you missed this story from last week: Google To Pay SpaceX $920 Million Per Month For Massive AI Compute Power

Fubarberry@sopuli.xyz · edit-2 17 hours ago

That’s definitely costing them more than running it on their own hardware, but it doesn’t mean AI is costing them more than the AI startups. Anthropic for example is already paying SpaceX 1.25 Billion a month for compute, and has agreed to pay Google 200Billion oflcer the next 5 years for access to Google’s compute and TPU chips.

Google’s deal with xAI specifically lets them terminate the deal with 90 days notice after the end of the year. Google is also investing heavily in building new data centers with their hardware. I’m assuming this deal means they’ve eclipsed their current TPU capacity, and are just looking for a short term bandaid until they can catch up with their new constructions.

NιƙƙιDιɱҽʂ@lemmy.world · 19 hours ago

Anthropic is doing the same too. SpaceX over here providing the shovels and pans for the modern day gold rush, sheesh.

zbyte64@awful.systems · 21 hours ago

I guess google’s announcement of renting xai compute could have been simply for show to boost SpaceX ipo.

Fubarberry@sopuli.xyz · 20 hours ago

They have big plans to build more data centers for themselves, so they definitely want more compute than the have access to right now. But even if they’re paying more to rent xai compute, they’re still paying less overall for hardware/access than their direct AI competition.

jj4211@lemmy.world · 1 day ago

Plus they have a hook with the common folk, the phone steers you toward Gemini (Android phones, obviously, and Apple currently partners with Google for Gemini for iPhone…).

For Claude and OpenAI, you have to explicitly want to go out of your way to use them, or use them indirectly through another service that has a hook.

Claude seems to have some software developers explicitly preferring them, though a alot of the corporate money is on Microsoft and Microsoft leveraged Visual Studio and Github to become the business-friendly frontend, and sure, you can use Anthropic models too… Though Microsoft ultimately has control of what is reasonably available and how much each one costs. Anthropic has a shot but I could see Microsoft pivot to really mess with Anthropic. The one gap in Microsoft strategy is the “native AI” workflow where Claude Code has won hearts and minds, but it uses massively more tokens for frankly marginal or sometimes negative value compared to a more curated use in-editor.

OpenAI I see as the most exposed. Lot’s of data showing they are suffering from people being over the fad of going out of their way to use ChatGPT, especially since their phones have started embracing ‘default’ Chatbot. Software developers that are inclined to use LLM are also inclined to be pretty dismissive of anything other than either Anthropic or open weight models, depending on their inclination. Also Altman seemed the most agressive in committing to spending money they didn’t have, though all of them exhibit this to some extent.

I predict Microsoft ultimately pivots to in-house models and convinces the businesses to go that way. Apple may continue with Gemini or roll their own eventually. Anthropic currently has the stronger position between OpenAI and them, but I think you are right that both have risk of just being left behind.

Bluescluestoothpaste@sh.itjust.works · 16 hours ago

Claude just kills the other models, it’s not close. Microsoft could ban claude extension from VSC tonight and ill start using command line Claude tomorrow. There’s just no comparison right now. Itd be like Microsoft trying to ban NVidia gpus from Windows, they’ll just lose.

SunshineJogger@feddit.org · 1 day ago

I really really really don’t want evil corporation Google to dominate even more.

I prefer plailny greedy corporations over evil ones

ChromaticMan@lemmy.world · 1 day ago

OpenAI and Anthropic aren’t less evil than Google.

SunshineJogger@feddit.org · 1 day ago

They aren’t great, though I do think Google is worse. And far too powerful

jj4211@lemmy.world · 1 day ago

Google is only worse by virtue of their reach. OpenAI and Anthropic don’t have the reach yet, but they absolutely will get there given the chance.

Before Google had the reach it has now, it was widely regarded as a comparitive ‘good guy’ and people believed in the “don’t be evil”. Lo and behold once they got going, “don’t be evil” went away.

BarneyPiccolo@lemmy.today · 1 day ago

They’re all evil, so we just have to exploit the ones that offer us some value. If Google is cheaper, and has the ability to damage the others, then Google it is.

badgermurphy@lemmy.world · 1 day ago

Google is shaping up to fare better than the others, but I dont think that means success. They, too, are spending more than its making, just at a less drunken rate than some competitors.

Fizz@lemmy.nz · 1 day ago

The author is right and wrong. Its subsidised but not by anthropic. The power users who use their plans to the limit are subsidised by the rest of the users. Im an AI hater but I do think anthropic will be profitable next year. Their revenue growth is insane and looks to just be getting started. Claude code took enterprise by storm and now cowork is out.

PattyMcB@lemmy.world · 1 day ago

Good thing I don’t personally pay them anything

elucubra@sopuli.xyz · 1 day ago

Oh, you are going to pay. The bubble is going to fuck us all quite thoroughly.

Bluescluestoothpaste@sh.itjust.works · 16 hours ago

Exactly, these companies will keep leveraging more and more because they know the govt will step in and print whatever number of trillions of dollars needed to fix the accounting. Then they’ll tell us “core” inflation is only 2.8%.

OberonSwanson@sh.itjust.works · 2 days ago

Of course it is, it’s essentially a scam. They just need enough humans to keep investing until they check out and run with a bailout.

DeckPacker@piefed.social · 2 days ago

Funny thing is, the US government doesn’t even have nearly enough money to bail all these mfa out. So we are heading into uncharted territory here

OberonSwanson@sh.itjust.works · 2 days ago

Of course they don’t, that’s why they’re building bunkers. Thinking it’ll slow us down, as we’ll open their bunkers like cans of tuna. A bunker only works for so long, then the survivors start hunting for them like delicious shipwrecks.

Tollana1234567@lemmy.today · 1 day ago

they are going to argentina. apparently NZ has blocked thiels compound.

Imperious_melange@lemmy.world · edit-2 19 hours ago

deleted by creator

DeckPacker@piefed.social · 2 days ago

Yeah, but it’s not like they would be smart enough to know that

Arghblarg@lemmy.ca · 2 days ago

And that’s why they’re trying underhanded tactics to inflate earnings and IPO directly into the index funds, so every American’s 401K will legally have to rebalance and invest in them. They’re racing to fleece retirement funds before the bubble bursts.

Not financial advice, of course :p but people should really consider getting their stuff out and into self-directed funds or whatever it is US people do to not depend on auto-allocated funds.

shortwavesurfer@lemmy.zip · 2 days ago

Money printer go brrrrrrr

Yliaster@lemmy.world · 2 days ago

I don’t get why companies get to legally bailout like this. Why do people have to suffer for their bullshit? Enslave the CEOs if you have to make things right, leave the people out of it.

Shellofbiomatter@lemmus.org · 1 day ago

That’s simple, because the people making laws and overseeing the adherence to those laws are great buddies with those same CEOs.

So, corruption.

Though i do agree with you, there is no such thing as too big to fail. Government shouldn’t have any handouts to corporations.

Yliaster@lemmy.world · 1 day ago

These levels of corruption are frustrating; money shouldn’t decide the law.

No handouts to corporations, indeed. Make them pay.

Tollana1234567@lemmy.today · 1 day ago

thats why they are peddling it to governments for “surveillance AI”

Imperious_melange@lemmy.world · edit-2 19 hours ago

deleted by creator

Wildmimic@anarchist.nexus · 2 days ago

Both Uber and Spotify (and AWS too) had economics of scale going for them - the more users they have, the more the infrastructure could be leveraged. This does NOT work for LLMs. More users means using more compute, more advanced tasks (like coding) uses exponential amounts of compute. A single user running a complex task can make 8 Blackwell GPUs run full tilt, and you don’t even have any guarantee that the output will be useable.

There are a few narrow areas where LLMs might be successful, like scanning for security vulnerabilities or searching large amounts of documents. The massive amount of money invested will never be recouped with these usage scenarios.

Imperious_melange@lemmy.world · edit-2 19 hours ago

deleted by creator

T156@lemmy.world · 2 days ago

Although, most people aren’t talking about Alphafold when they’re talking about AI. They’re usually specifically referring to the generative transformer models that are currently all the rage.

I doubt anyone would care too much about a linear regression model, or multi-layer peceptron , for example.

Imperious_melange@lemmy.world · edit-2 19 hours ago

deleted by creator

ThirdConsul@lemmy.zip · 2 days ago

solving something like the Erdős unit distance conjecture

Tell me you listen to media news cycle without understanding what that actually mean without telling me that.

That’s not exactly what happened, isn’t it.

Not to bring up what’s also been accomplished in cyber security

Multiple new vectors of attacks, automation of attack pipelines…

Imperious_melange@lemmy.world · edit-2 19 hours ago

deleted by creator

ThirdConsul@lemmy.zip · 23 hours ago

To clarify, my argument is that you don’t know what you’re talking about.

Erodos unit distance conjecture is a proposed solution to a Erodos unit distance problem. What the LLM model did was disproving Erodos unit distance conjecture, not solving it (you don’t solve a conjecture), nor solving the problem (that remains unsolved).

Again, you seem poisoned by following news media cycle without understanding what they talk about.

Multiple new vectors of attacks, automation of attack pipelines…

Like literally just put that into Google, it’s not some study that proves it, it’s the multiple ones, and every cybersecurity expert talking about it. But if you want a one source you want to argue about, then https://blog.checkpoint.com/research/global-cyber-attacks-rise-in-january-2026-amid-increasing-ransomware-activity-and-expanding-genai-risks/

🍉 DrRedOctopus 🐙🍉@lemmy.world · 2 days ago

reminder than during 2019 there were streaming services popping left and right, all showing tremendous growth because they started from zero, and articles were about how bad Netflix was doing due to having practically no growth compared with the competition (they already had a massive subscriber base). Twist? Netflix was the only streaming service that was actually making a profit, the rest were a massive loss but big growth.

Needless to say most of those streaming services died; who remembers DC streaming service, or Yahoo’s? While Netflix is basically as stong as ever, despite the prevalent enshitification happening through the whole industry.

Point of the story? shareholders don’t care about stable profitable business, only cancerous growth. AI is like that, zero profits, ton of cost, but as long as they show growth the shareholders are happy, regardless of how cooked the books are.

MimicJar@lemmy.world · 2 days ago

2019 Yahoo

My immediate thought, there is no way Yahoo! Screen survived into 2019.

I looked it up and Yahoo! Screen (which featured Community season 6) was shutdown in January 2016. But Yahoo! View launched in late 2016 (as a Hulu-like replacement), and that did shutter in mid 2019.

So Yahoo! was already dead, but it also died for real in 2019.

Tamo240@programming.dev · 4 hours ago

I love that nobody watched anything on Yahoo! Screen except for that one season of Community

🍉 DrRedOctopus 🐙🍉@lemmy.world · 2 days ago

Imagine having a streaming service so bad it fails twice

PattyMcB@lemmy.world · 1 day ago

Isn’t that kind of Yahoo!'s business model?

elucubra@sopuli.xyz · 1 day ago

Actually, when Yahoo was the search giant, before Google went mainstream, they were pretty damn good at what they did.

JohnEdwa@sopuli.xyz · 1 day ago

With how shit Google is these days, I kinda wonder if Yahoo could dust out their search engine from two decades back and it would just be… better.

Axolotl@feddit.it · 8 hours ago

Yahoo had it’s own web crawler only between 2004 and 2009, then they made a deal with Microsoft to use Bing indexes, so i highly doubt they even have their old index

thisbenzingring@lemmy.today · 1 day ago

Netflix was also late to streaming because their mail service subscriptions were THE major player

krashmo@lemmy.world · 1 day ago

Late to streaming? Netflix was the first big time streaming service that I ever heard of. The main reason their streaming service was able to take off like it did is that nobody else of significance thought that streaming was worth pursuing. What other companies were offering streaming services at anything approaching scale before Netflix?

thisbenzingring@lemmy.today · 1 day ago

YouTube and Hulu were basically all starting about the same time. But RealPlayer was the first big one.

Netflix just had the layout that everyone uses now. The Cable networks had streaming services, just not on demand. YouTube and Hulu also pioneered the on demand layout. YouTube focused on personal experiences so maybe that’s why you’re forgetting them

jj4211@lemmy.world · 1 day ago

YouTube started in 2005, but was not really a “streaming service”, it hosted random internet posted videos. The concept of engaging with the big content rights holders wasn’t remotely in sight back then.

Hulu came out a year after Netflix started streaming, by about a year. Hulu was inspired by Netflix’s move to have actual traditional media content as a streaming service instead of ad-hoc video uploads like youtube.

RealPlayer offered technology for websites to provide videos, they themselves I don’t recall being a streaming platform in and of itself.

Whatever one may say about Netflix, they were right there in the beginning with streaming traditional, professional media content. Yes, video playback over the internet wasn’t new, but that’s a technical detail that enables, but is not the core of the “streaming service” business model.

🍉 DrRedOctopus 🐙🍉@lemmy.world · 1 day ago

late to streaming, but practically the first subscription based system to watch movies/tv online.

First years of Netflix were the best, the product began degrading quite early on. but that was mostly companies realizing that instead of licensing their content on Netflix, they can make their own platforms.

Corkyskog@sh.itjust.works · 1 day ago

I think people forget that there is also the problem of being “too early” where people or the technology isn’t ready yet. Netflix timed their entry perfectly.

There are so many defunct websites or businesses that no one has ever heard of that were precursors to modern day services we view as conveniences.

🍉 DrRedOctopus 🐙🍉@lemmy.world · 1 day ago

it’s not about being the first, just the first one when the technology/cost are just right.

UnderpantsWeevil@lemmy.world · 2 days ago

who remembers DC streaming service, or Yahoo’s?

Quibi will always have a place in my heart. Or, at least, my golden arm