Welcome back to The Hidden Layer, my twice-weekly private email that’s all about artificial
intelligence. I’m Ian Krietzberg.
I appreciate all the feedback on my last issue. As always, if you’ve got any thoughts or questions, just reply to this email. You can also message me on Signal at 732-804-1223. And if you’re not yet subscribed to Puck,
click here to change that.
In today’s issue, a deep dive into the rather nuanced relationship between A.I. and software engineering, and the election misinformation dog that didn’t quite bark.
Mentioned in this issue: Grady Booch, Thinking Machines Lab, Mira Murati, OpenAI, Nvidia, Jensen Huang, AMD, Common Sense Media, Joel Becker, Nate Rush, and many more…
|
“I do not fear the rise of superintelligence; I do, however, fear the rise of rapacious and selfish
individuals, organizations, and governments who seek to use computing in any form to extend their power and control over others.” —Grady Booch, a storied IBM fellow and critical researcher, on X, explaining that his “prediction of doom,” otherwise known in the industry as “p(doom),” has less to do with A.I.’s technological capabilities than with the people
developing it.
|
-
An A.I. startup comes out of stealth mode: For months, rumors have been swirling around Thinking Machines Lab, a secretive startup founded by former OpenAI C.T.O. Mira Murati. This week, Murati announced an eye-popping $2 billion fundraising round at a $12 billion valuation—one of the largest seed rounds ever recorded, led by a16z with
participation from Nvidia, Accel, Cisco, AMD, ServiceNow, and Jane Street, among others. Wired reported that her co-founders include former OpenAI computer scientists John Schulman, Barrett Zoph, Lilian Weng, Andrew Tulloch, and Luke Metz, a list confirmed
by a company spokesperson. Murati also shared that Thinking Machines Lab is working on a multimodal A.I. system, which will be unveiled in the coming months, that includes a “significant open source component.”
Unlike OpenAI, whose mission is the advancement of artificial general intelligence, Murati wrote on X that her company is focused on the advancement of collaborative general intelligence, or, in her words, “A.I. that works with how you naturally interact with the
world—through conversation, through sight, through the messy way we collaborate.” She noted that her team will share its “best science” with the research community as a way to further its understanding of “frontier A.I. systems”—something that pretty much everyone is pursuing at the moment. In any case, $2 billion without a product is quite an achievement and a true metaphor for this extraordinary fundraising environment. - The chip race is back on: On
Tuesday, major U.S. chip manufacturers Nvidia and AMD announced they will resume selling A.I.-enabling chips in China. In a statement, Nvidia noted that the U.S. government has “assured” the company that licenses to sell their G.P.U. will soon be granted, which came on the heels of a meeting between Nvidia chief Jensen Huang and President
Trump. For its part, AMD said that the government would soon begin reviewing its licenses to sell chips in China. Back in April, when the U.S.-China trade war escalated, Nvidia warned that it would take a $5.5 billion quarterly hit associated with sales of its H20 chip in China; AMD
warned of $800 million in similar export-related fees.
- The “dog that didn’t bark”: One of the biggest concerns surrounding A.I. last year was the threat of election misinformation, which I often covered in my reporting for The Deep View. Indeed, there was no shortage of hand-wringing from
researchers and politicians about bad actors potentially manipulating the electorate through gradual social engineering, deepfakes featuring politicians, and targeted misinformation. And, sure, there were several incidents that supported these fears—like the robocall in the voice of then-President Joe Biden urging New Hampshire voters not to participate in the primary. (The voice clone was created with ElevenLabs.)
But in the end, A.I.’s impact on the election
was relatively minimal, according to a new report by Dr. Felix Simon, a researcher at the Oxford Internet Institute, and Dr. Sacha Altay, a misinformation expert at the University of Zurich. That’s because, as Altay told me, there is “already so much content” that an infusion of
generative A.I. content “basically doesn’t matter. What matters is whether people will pay attention to that content.”
In other words, as long as the information ecosystem is ballasted by trusted news organizations—or even social media influencers who disseminate information from those news organizations—Altay doesn’t expect A.I. content to have an outsize, negative impact on our information ecosystem, even as trust in that system continues to erode. The threat is more
pronounced, however, in countries where the information ecosystem is already weak.
In a final, fascinating point, Altay noted that in order for A.I.’s overall election-related impact to be considered negative, more bad actors on the whole would need to benefit from A.I. than non-bad actors, who might also be using it. After all, not all election-related use cases of A.I. are meant to disrupt the democratic process. For example, there have been
experiments testing the efficacy of A.I. assistance in legislator-constituent communications. The paper, which confirms earlier research on the subject, notes that, when it comes to electoral disinformation, the
focus on A.I. “distracts from more structural threats to elections and democracy, including voter disenfranchisement and attacks on election integrity.”
|
From the Frontier:
The Imaginary Friend Boom
|
A recent
survey of teens by Common Sense Media found that A.I. companionship is starting to become mainstream. The organization found that 72 percent of surveyed teens have used an A.I. companion at least once, and more than half are “regular” users. Still, the survey found that most users approach the bots “pragmatically,
rather than as substitutes for human relationships.” Half are distrustful of A.I.-generated advice, 80 percent of those who use A.I. companions still spend more time with their real friends, and 67 percent of respondents find conversations with actual humans more satisfying. (Yes, I found that a bit low, too...)
As the technology underpinning generative A.I. became more advanced, perhaps it was inevitable that chatbots would be leveraged to sell artificial companionship. The
chatbot company Replika, for instance, sells “the A.I. companion who cares,” one that’s “always here to listen and talk.” Character A.I.’s tagline used to be “A.I. that feels alive,” before eventually evolving into “interactive entertainment.” (The company, as you recall, has been the target of litigation—including a lawsuit related to the tragic suicide of a 14-year-old boy who was allegedly influenced by his interactions with characters on the app. At the time, Character
said it was “heartbroken by the tragic loss of one of our users,” and detailed the addition of a number of new safety features.)
Michael Robb, Common Sense’s head of research, told me he was surprised by the survey results—namely, that so many teens are seeking out this form of companionship this early in the A.I. era. “My concern is always about
displacement or replacement of human relationships,” he told me. “Teens need high-quality human interaction with peers, with trusted adults, with parents, in order to have healthy development. If they decide they want to spend more time with A.I. companions than humans, that kind of displacement could result in some pretty bad outcomes.” He added that he’s particularly concerned about kids who are already isolated. By seeking out A.I. companions, they might inadvertently isolate themselves
further.
And now, on to the main event…
|
|
|
A new study confirmed what many artificial intelligence skeptics have long championed, and
what its own researchers didn’t expect: A.I. coding tools don’t always speed up users’ workflow. In fact, the technology can slow them down. As one MIT researcher put it: “The need for people to gain deep expertise in the topics they work on is not going away.”
|
|
|
Last week, a relatively small A.I. safety research nonprofit called Metr published the results of a
study exploring one of the more highly touted use cases for artificial intelligence: coding assistants, and particularly how they perform “in the wild.” The results didn’t necessarily break the internet, but they did garner millions of views and thousands of reposts on X. That’s because the main takeaway was that, among the software developers who
participated—nearly all of whom expected A.I. coding tools to speed up their workflow by 20 percent—the tools in fact slowed them down by 19 percent.
In benchmark tests, of course, A.I. coding tools tend to perform incredibly well. And while there seems to be widespread real-world adoption and plenty of hype around the potential for these tools to improve coders’ workflow, there’s been little empirical evidence to support it. Meanwhile, the A.I. software development
market—worth a modest $674 million in 2024, according to Grand View Research—continues to be a point of heavy investment, and is expected to grow at a 42 percent annual rate until 2033, when it will potentially be valued at around $15 billion. This year, both Google and Microsoft indicated that around a third of their respective
codebases are A.I.-generated. In short, the excitement around A.I. coding companies is far from fading.
When I caught up with Joel Becker and Nate Rush, two of the study’s principal architects, they told me that they were as surprised by the results as anybody else. Metr had recruited a group of 16 experienced software developers, who were asked to complete a total of 246 tasks that, on average, took about two hours each. Each task, at random, either
allowed or disallowed the use of an A.I. coding tool. (Becker noted that they used early-2025 coding tools that have since become more capable, which gives you a sense of how quickly this technology is progressing.)
In their paper, the researchers laid out 20 factors that might have contributed to the slowdown in completion rate when using A.I. tools. And though Becker and Rush said it’s essentially impossible to determine the weight of one factor over another, two of the more likely
ones involved over-optimism around usability—that the coders placed too much faith in the tool to help them complete the task, and therefore took a backseat—and the low reliability of the tools, themselves. One developer also said that some of his speed gains were eaten up by time spent circling the social media black hole in between prompts. Another, who experienced a 38
percent slowdown when using A.I., said, “We like to say that L.L.M.s are tools, but treat them more like a magic bullet.”
Rush and Becker, who plan to conduct additional studies on this same subject in the future, were quick to say that these results are not necessarily generalizable beyond this particular test setting—experienced software developers working with
early-2025 tools in code bases with which they were already familiar. “We certainly don’t think that this applies everywhere,” Rush said. “We see it as a demonstration that self-reports are not particularly reliable.”
|
The study, according to Dr. Armando Solar-Lezama, a distinguished professor at the MIT
Schwarzman College of Computing, has been “really valuable” in advancing our understanding of A.I. coding tools beyond simple benchmark tests. Still, he noted a number of caveats, including the small scale of the study, and the fact that the models they were using were likely trained on that code. (The developers worked on open-source projects.) “So [the models] have a level of expertise on that code that they don’t have on internal, proprietary company code, where a lot of commercial
development is taking place,” he told me.
As an aside, Solar-Lezama noted that the findings aligned with his own experience of using A.I. coding tools, primarily Cursor. At first, he explained, the tool “really gets you hooked. But then once you start using it, you find that, well, actually, not so much.” He continued: “Sometimes you get a big win, and sometimes you realize you wasted a lot of time reading through code that is not really what you actually wanted to do.”
His
observations touched on the perennial issue of A.I. reliability, and underscored the fact that, as Solar-Lezama told me, the “need for people to gain deep expertise in the topics they work on is not going away.” In the study, for example, the developers accepted less than 44 percent of A.I.-generated code, meaning they were able to identify bad code before implementing it. Solar-Lezama envisioned a scenario in which less-experienced developers implement bad code, then take quite a while
to find, understand, and rectify the errors.
Perhaps there is some future in which a C.E.O., for instance, could prompt a model to generate a fully functional, ready-to-deploy app. But Solar-Lezama noted that there are still a “lot of roadblocks” before we get to that point. “I don’t think that’s going to happen in my lifetime,” he said, chuckling. “We’ll see.”
|
I’ll see you next week. Ian
|
|
|
Join Emmy Award-winning journalist Peter Hamby, along with the team of expert journalists at Puck, as they let you in on the
conversations insiders are having across the four corners of power in America: Wall Street, Washington, Silicon Valley, and Hollywood. Presented in partnership with Audacy, new episodes publish daily, Monday through Friday.
|
|
|
A professional-grade rundown on the business of sports from John Ourand, the industry’s preeminent journalist, covering the
leagues, players, agencies, media deals, and the egos fueling it all.
|
|
|
Need help? Review our FAQ page or contact us for assistance. For brand partnerships, email ads@puck.news. You received this email because you signed up to receive emails from Puck, or as part of your Puck account associated with {{customer.email}}. To stop receiving this newsletter and/or manage all your email preferences, click here.
|
Puck is published by Heat Media LLC. 107 Greenwich St, New York, NY 10006
|
|
|
|