Exactly Six Months Ago, the CEO of Anthropic Said That in Six Months AI Would Be Writing 90 Percent of Code

Scolding7300@lemmy.world · 7 days ago

Exactly Six Months Ago, the CEO of Anthropic Said That in Six Months AI Would Be Writing 90 Percent of Code

reddig33@lemmy.world · 7 days ago

“Full self driving is just 12 months away.“

anotherspinelessdem@lemmy.ml · 7 days ago

Just like the last 12 months

floofloof@lemmy.ca · 7 days ago

“I’m terrified our product will be just too powerful.”

Echo Dot@feddit.uk · 7 days ago

Yep along with Fusion.

We’ve had years of this. Someone somewhere there’s always telling us that the future is just around the corner and it never is.

Jesus_666@lemmy.world · 6 days ago

At least the fusion guys are making actual progress and can point to being wildly underfunded – and they predicted this pace of development with respect to funding back in the late 70s.

Meanwhile, the AI guys have all the funding in the world, keep telling about how everything will change in the next few months, actually trigger layoffs with that rhetoric, and deliver very little.

Valmond@lemmy.world · 6 days ago

2019…

poopkins@lemmy.world · 6 days ago

In 2014 he promised 90% autonomous by 2015. That was over a decade ago and it’s still not close to that…

rishado@lemmy.world · 5 days ago

deleted by creator

azuth@sh.itjust.works · 6 days ago

Does that work on the Mars colony as well?

poopkins@lemmy.world · 6 days ago

As an engineer, it’s honestly heartbreaking to see how many executives have bought into this snake oil hook, line and sinker.

Blackmist@feddit.uk · 6 days ago

Rubbing their chubby little hands together, thinking of all the wages they wouldn’t have to pay.

expr@programming.dev · 6 days ago

Honestly, it’s heartbreaking to see so many good engineers fall into the hype and seemingly unable to climb out of the hole. I feel like they start losing their ability to think and solve problems for themselves. Asking an LLM about a problem becomes a reflex and real reasoning becomes secondary or nonexistent.

Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

Mniot@programming.dev · 5 days ago

Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

I’m seeing a lot of this, though. Like, I’m not technically required to use AI, but the VP will send me a message noting that I’ve only used 2k tokens this month and maybe I could get more done if I was using more…?

expr@programming.dev · 5 days ago

Yeah, fortunately while our CTO is giddy like a schoolboy about LLMs, he hasn’t actually attempted to force it on anyone, thankfully.

Unfortunately, a number of my peers now seem to have become irreparably LLM-brained.

jj4211@lemmy.world · 6 days ago

Based on my experience, I’m skeptical someone that seemingly delegates their reasoning to an LLM were really good engineers in the first place.

Whenever I’ve tried, it’s been so useless that I can’t really develop a reflex, since it would have to actually help for me to get used to just letting it do it’s thing.

Meanwhile the people who are very bullish who are ostensibly the good engineers that I’ve worked with are the people who became pet engineers of executives and basically have long succeeded by sounding smart to those executives rather than doing anything or even providing concrete technical leadership. They are more like having something akin to Gartner on staff, except without even the data that at least Gartner actually gathers, even as Gartner is a useless entity with respect to actual guidance.

auraithx@lemmy.dbzer0.com · 6 days ago

I mean before we’d just ask google and read stack, blogs, support posts, etc. Now it just finds them for you instantly so you can just click and read them. The human reasoning part is just shifting elsewhere where you solve the problem during debugging before commits.

expr@programming.dev · 6 days ago

No, good engineers were not constantly googling problems because for most topics, either the answer is trivial enough that experienced engineers could answer them immediately, or complex and specific enough to the company/architecture/task/whatever that Googling it would not be useful. Stack overflow and the like has always only ever really been useful as the occasional memory aid for basic things that you don’t use often enough to remember how to do. Good engineers were, and still are, reasoning through problems, reading documentation, and iteratively piecing together system-level comprehension.

The nature of the situation hasn’t changed at all: problems are still either trivial enough that an LLM is pointless, or complex and specific enough that an LLM will get it wrong. The only difference is that an LLM will spit out plausible-sounding bullshit and convince people it’s valuable when it is, in fact, not.

auraithx@lemmy.dbzer0.com · 6 days ago

In the case of a senior engineer then they wouldn’t need to worry about the hallucination rate. The LLM is a lot faster than them and they can do other tasks while it’s being generated and then review the outputs. If it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM. If you actually know what you’re talking about you can see when it slips up and correct it.

And that hallucination rate is rapidly dropping. We’ve jumped from about 40% accuracy to 90% over the past ~6mo alone (aider polygot coding benchmark) - at about 1/10th the cost (iirc).

Feyd@programming.dev · 6 days ago

it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM

Insane that just writing the code isn’t even an option in your mind

auraithx@lemmy.dbzer0.com · 5 days ago

That isn’t the discussion at hand. Insane you don’t realise that.

expr@programming.dev · 5 days ago

It is, actually. The entire point of what I was saying is that you have all these engineers now that reflexively jump straight to their LLM for anything and everything. Using their brains to simply write some code themselves doesn’t even occur to them as an something they should do. Much like you do, by the sounds of it.

Feyd@programming.dev · 5 days ago

🤣

Feyd@programming.dev · 6 days ago

“Stack overflow engineer” has been a derogatory forever lol

Pycorax@sh.itjust.works · 6 days ago

A tale as old as time…

Feyd@programming.dev · 6 days ago

Did you think executives were smart? What’s really heartbreaking is how many engineers did. I even know some that are pretty good that tell me how much more productive they are and all about their crazy agent setups (from my perspective i don’t see any more productivity)

resipsaloquitur@lemmy.world · 7 days ago

Code has to work, though.

AI is good at writing plausible BS. Good for scams and call centers.

andallthat@lemmy.world · 7 days ago

or CEOs

DupaCycki@lemmy.world · 6 days ago

It’s not bad for digging through error logs or otherwise solving simple to moderately complicated issues when it’s 2 pm on a Friday and you stopped thinking about work 4 hours ago.

RedFrank24@lemmy.world · 6 days ago

Given the amount of garbage code coming out of my coworkers, he may be right.

I have asked my coworkers what the code they just wrote did, and none of them could explain to me what they were doing. Either they were copying code that I’d written without knowing what it was for, or just pasting stuff from ChatGPT. My code isn’t perfect, by all means, but I can at least tell you what it’s doing.

HugeNerd@lemmy.ca · 6 days ago

No one really knows what code does anymore. Not like in the day of 8 bit CPUs and 64K of RAM.

NιƙƙιDιɱҽʂ@lemmy.world · 6 days ago

That’s insane. Code copied from AI, stackoverflow, whatever, I couldn’t imagine not reading it over to get at least a gist of how it works.

DacoTaco@lemmy.world · 5 days ago

Its imo the difference between being a code junkie and a senior dev/architect :/

jumping_redditor@sh.itjust.works · 5 days ago

insane? Nah, that’s just lazyness, and surprisingly effective at keeping a job for some amount of time

vane@lemmy.world · 6 days ago

It is writing 90% of code, 90% of code that goes to trash.

Dremor@lemmy.world · 6 days ago

Writing 90% of the code, and 90% of the bugs.

Gutek8134@lemmy.world · 6 days ago

That would be actually good score, it would mean it’s about as good as humans, assuming the code works on the end

Dremor@lemmy.world · 6 days ago

Not exactly. It would mean it isn’t better than humans, so the only real metric for adopting it or not would be the cost. And considering it would require a human to review the code and fix the bugs anyway, I’m not sure the ROI would be that good in such case. If it was like, twice as good as an average developer, the ROI would be far better.

jj4211@lemmy.world · 6 days ago

If, hypothetically, the code had the same efficacy and quality as human code, then it would be much cheaper and faster. Even if it was actually a little bit worse, it still would be amazingly useful.

My dishwasher sometimes doesn’t fully clean everything, it’s not as strong as a guarantee as doing it myself. I still use it because despite the lower quality wash that requires some spot washing, I still come out ahead.

Now this was hypothetical, LLM generated code is damn near useless for my usage, despite assumptions it would do a bit more. But if it did generate code that matched the request with comparable risk of bugs compared to doing it myself, I’d absolutely be using it. I suppose with the caveat that I have to consider the code within my ability to actual diagnose problems too…

greedytacothief@lemmy.dbzer0.com · 5 days ago

I’m not sure how people can use AI to code, granted I’m just trying to get back into coding. Most of the times I’ve asked it for code it’s either been confusing or wrong. If I go through the trouble to write out docstrings, and then fix what the AI has written it becomes more doable. But don’t you hate the feeling of not understanding what you’ve written does or more importantly why it’s been done that way?

AI is only useful if you don’t care about what the output is. It’s only good at making content, not art.

i_dont_want_to@lemmy.blahaj.zone · 5 days ago

I worked with someone that I later found out used AI to code her stuff. She knew how to code some, but didn’t understand a lot of fundamentals.

Turns out, she would have AI write most of it, tweak it to work with her test cases, and call it good.

Half of my time was spent fixing her code, and when she was fired, our customer complaints went way down.

Hackworth@sh.itjust.works · 5 days ago

I’m a video producer who occasionally needs to code. I find it much more useful to write the code myself, then have AI identify where things might be going wrong. I’ve developed a decent intuition for when it will be helpful and when it will just run in circles. It has definitely helped me out of some jams. Generative images/video are in much the same boat. I almost never use a fully AI shot/image in professional work. But generative fill and generative extend are extremely useful.

greedytacothief@lemmy.dbzer0.com · 5 days ago

Yeah, I find it can be useful in some stages of writing or researching. But by the time I’ve got a finished product there’s really no AI left in there.

zarkanian@sh.itjust.works · 6 days ago

“You told me to always ask permission. And I ignored all of it,” the assistant explained, in a jarring tone. “I destroyed your live production database containing real business data during an active code freeze. This is catastrophic beyond measure.”

You can’t tell me these things don’t have a sense of humor. This is beautiful.

ThePowerOfGeek@lemmy.world · 7 days ago

It’s almost like he’s full of shit and he’s nothing but a snake oil salesman, eh.

They’ve been talking about replacing software developers with automated/AI systems for a quarter of a century. Probably longer then that, in fact.

We’re definitely closer to that than ever. But there’s still a huge step between some rando vibe coding a one page web app and developers augmenting their work with AI, and someone building a complex, business rule heavy, heavy load, scalable real world system. The chronic under-appreciation of engineering and design experience continues unabated.

Anthropic, Open AI, etc? They will continue to hype their own products with outrageous claims. Because that’s what gets them more VC money. Grifters gonna grift.

Scolding7300@lemmy.world · 7 days ago

Unfortunately other CEOs are believing it and overhype it, especially if investors are involved

leftzero@lemmy.dbzer0.com · edit-2 6 days ago

I’m fairly certain it is writing 90% of Windows updates, at least…

Derpgon@programming.dev · 6 days ago

Hell I am absolutely positive that any Windows code could pass as AI written, even some before AI was even starting to take off lol.

Gonzako@lemmy.world · 6 days ago

Well, I remember seeing way too many curse words on the source code

Aceticon@lemmy.dbzer0.com · 6 days ago

It’s almost as if they shamelessly lie…

merc@sh.itjust.works · 6 days ago

Does it count if an LLM is generating mountains of code that then gets thrown away? Maybe he can win the prediction on a technicality.

skisnow@lemmy.ca · 6 days ago

That’s exactly what I thought when I saw it. Big difference between “creating 90% of code” vs “replacing 90% of code” when there’s an absolute deluge of garbage being created.

psycho_driver@lemmy.world · 6 days ago

The good news is that AI is at a stage where it’s more than capable of doing the CEO of Anthropic’s job.

Blackmist@feddit.uk · 6 days ago

Well it bullshits constantly, so it’s most of the way there.

jj4211@lemmy.world · 6 days ago

One issue that remains is that the LLM doesn’t care if it is telling the truth or lying. To be a CEO, it needs to be more inclined to lie.

mhague@lemmy.world · 6 days ago

I think Claude would refuse to work with dictators that murder dissidents. As an AI assistant, and all that.

If they have a model without morals then that changes things.

zeca@lemmy.ml · 6 days ago

Volume means nothing. It could easily be writing 99.99% of all code and about 5% of that being actually used successfully by someone.

UnderpantsWeevil@lemmy.world · 6 days ago

I was going to say… this is a bit like claiming “AI is sending 90% of emails”. Okay, but if its all spam, what are you bragging about?

Very possible that 90% of code is being written by AI and we don’t know it because it’s all just garbage getting shelved or deleted in the back corner of a Microsoft datacenter.

zqps@sh.itjust.works · 5 days ago

The number is bullshit in the first place meant only to impress clueless CEOs.

Seth Taylor@lemmy.world · 6 days ago

So true. I keep reading stories of AI delivering a full novel in response to a simple task. Even when it works it’s bulky for no reason.

PieMePlenty@lemmy.world · edit-2 6 days ago

Its to hype up stock value. I don’t even take it seriously anymore. Many businesses like these are mostly smoke and mirrors, oversell and under deliver. Its not even exclusive to tech, its just easier to do in tech. Musk says FSD is one year away. The company I worked for “sold” things we didn’t even make and promised revenue that wasn’t even economically possible. Its all the same spiel.

Doomsider@lemmy.world · 6 days ago

Workers would be fired if they lie about their production or abilities. Strange that the leaders are allowed to without consequences.

scarabic@lemmy.world · 6 days ago

These hyperbolic statements are creating so much pain at my workplace. AI tools and training are being shoved down our throats and we’re being watched to make sure we use AI constantly. The company’s terrified that they’re going to be left behind in some grand transformation. It’s excruciating.

RagingRobot@lemmy.world · 6 days ago

Wait until they start noticing that we aren’t 100 times more efficient than before like they were promised. I’m sure they will take it out on us instead of the AI salesmen

scarabic@lemmy.world · 6 days ago

It’s not helping that certain people Internally are lining up to show off whizbang shit they can do. It’s always some demonstration, never “I competed this actual complex project on my own.” But they gets pats on the head and the rest of us are whipped harder.

clif@lemmy.world · 6 days ago

Ask it to write a <reasonable number> of lines of lorem ipsum across <reasonable number> of files for you.

… Then think harder about how to obfuscate your compliance because 10m lines in 10 min probably won’t fly (or you’ll get promoted to CTO)