Generative AI: July 2023 freeze frame

It’s feeling like I need a large language model brain to write about safety with large language models. [I certify that I only wish I had such a brain, and I am a human.]

“Chatbot,” by DALL-E and me (please see image below)

So let’s freeze the film for a moment and see where we humans are with generative AI safety.

First, so that we’re all on the same page, a large language model is basically an algorithmic structure called a “neural network” designed and trained to be a model of human intelligence. For it to get there, the software is trained on humongous amounts of data (the “large” part of “large language model”) scraped from the global Internet, which requires huge financial, computing and energy resources – the reason why it’s mostly large tech companies that are creating these LLMs. Examples are Google’s Bard, Meta’s LLaMA-2 and ChatGPT-4 by OpenAI with huge investments from Microsoft; Apple’s also working on one, Bloomberg reports, as are others (real world examples of what a LLM can do are demo’d by Professor Ethan Mollick here).

For context around the latest news, two interviews struck me this past week:

1. “They don’t understand all the risks,” University of Virginia data science professor Renée Cummings told the BBC, referring to the companies and generative AI risks. People at the companies are saying this too, according to an in-depth article in The Atlantic, “Does [OpenAI CEO] Sam Altman know what he is creating?”

“By his own admission … Altman doesn’t know how powerful AI will become,” reports Ross Andersen in that article, “or what its ascendance will mean for the average person, or whether it will put humanity at risk. I don’t hold that against him, exactly – I don’t think anyone knows where this is all going, except that we’re going there fast….”

Andersen relates a conversation he had with OpenAI’s chief scientist, Ilya Sutskever, about what a LLM’s neural network “looks” like. Its neurons “sit in layers. An input layer receives a chunk of data, a bit of text or an image, for example. The magic happens in the middle – or ‘hidden’ – layers, which process the chunk of data, so that the output layer can spit out its prediction….”

Further down, Andersen reports that “all of those mysterious things that happen in GPT-4’s hidden layers – are too complex for any human to understand, at least with current tools [emphasis mine]. Tracking what’s happening across the model – almost certainly composed of billions of neurons – is, today, hopeless….” But he adds that OpenAI’s model of the ChatGPT-4 model at least helps people understand. Also, Altman believes that making it available for the public to use will help expose problems.

2. The second contextual piece that struck me was about the arc of a new technology in society, as described by Tobias Rose-Stockwell, author of the just-released book Outrage Machine in an interview with All Tech Is Human (ATIH) this week – “how different technologies are metabolized by society.” I’ll give the phases numbers just for convenience: There’s 1) a period of “increased interest and euphoria” that brings mass adoption, followed by what the author calls 2) a “dark valley of hidden harm” where people start to see “the harms hidden by the euphoria and mass adoption,” followed by 3) research (both good and bad) into the harms, until finally 4) society ends up with “the best parts of what originally emerged.” Others have applied phases to tech adoption too, but these make sense to me – I’ve watched them play out with gen AI’s predecessor….

With social media, we seem to be getting past the “dark valley” with a robust and growing body of research. Asked about gen AI, Rose-Stockwell said the pattern will hold. Someone (either he or ATIH’s David Polgar, I can’t remember) said we need to make the dark valleys shallower and shorter. I agree. It’s urgent that we move quickly from fearfully feeling around in the dark and use transparency and research to pinpoint “actual harm,” as Rose-Stockwell put it, so we can minimize it. “If you can get really tight on the actual harm … you really can design around it…. You can fix it.”

What just happened

So here’s key recent news indicating we’re getting closer, as people inside and outside the gen AI “industry” have been thrashing around for ways to identify actual harm. In April there was the open letter calling for a “pause” in development, which many are saying was never going to happen – especially considering competition with other countries, quite reasonably. But just in the past couple of weeks:

Seven generative AI providers – Amazon, Anthropic, Google, Inflection, Meta, Microsoft and OpenAI – met with President Biden at the White House last week. They announced voluntary commitments to “invest in research and safety, security stress-testing, and assist in third-party audits of system vulnerabilities,” The Verge reported. Critics say we’ve seen “self-regulatory” schemes before, but high-profile dialog between the industry and the executive branch which articulates both the problems and parts of the solution is good for public awareness, education and is at least suggestive of accountability.
Adding some concrete this week, some of those companies – Anthropic, Google, Microsoft and OpenAI – announced the Frontier Model Forum, an industry body whose objectives are to advance safety research, identify best practices, collaborate with policymakers, researchers, civil society and industry and support the development of AI applications that meet “society’s greatest challenges,” from climate change to digital security threats. Here‘s coverage from Bloomberg. Maybe this will act like the Technology Coalition, a cross-industry body fighting online child sexual abuse.
Meta announced that it’s open-sourcing its LLaMA 2 model in partnership with Microsoft, making it available not only to researchers but also to startups and smaller companies that don’t have the resources to create their own LLMs. Meta says this adds safety because all these parties will be able to “stress test” it to find problems and fix them. Those parties have to agree to terms of use before they can use the LLM, but Axios points out how hard it would be to enforce those terms once the LLM’s out in the wild – though it adds that Meta counters that point saying LLaMA is not nearly as “smart” as, say, ChatGPT-4. On the other hand, MIT Technology Review writes that open-sourcing “could demonstrate the benefits of transparency over secrecy when it comes to the inner workings of AI models.” [I was interested to learn that LLaMA is not trained on Facebook data, the Associated Press reports. “It says the latest model was trained on “a new mix of data from publicly available sources, which does not include data from Meta’s products or services.”]
Meanwhile, Apple is “quietly working on AI tools” that could challenge the LLMs of the seven companies the White House convened, Bloomberg reports. It has its own framework, apparently called “Ajax,” for creating LLMs. Just as Apple has been absent from public discussions and industry forums about online safety, the company “has been conspicuously absent from the [generative AI] frenzy,” Bloomberg adds, so by default also from discussions about safeguards and guardrails for LLMs. Let’s hope that changes.

It’s different now … really

So although it may feel like we’re back in “move fast and break things” mode, we’re not, actually. These are good signs. Another is the increasingly common practice of “red-teaming,” testing systems for holes, safety risks and unintended consequences – not something that happened in social media’s earliest days. After ChatGPT-4 finished training, OpenAI “assembled about 50 external red-teamers who prompted it for months, hoping to goad it into misbehaviors,” The Atlantic reports. Apple is red-teaming its AI tools too, Bloomberg reports.

Other things that are different now: a whole lot of federal legislation relevant to gen AI proposed already; a now robust Trust & Safety field and community, with its own professional association and growing public awareness of content moderation and the field; children’s digital rights now well defined by General Comment 25; “responsible tech” being a whole movement now; and even big tech calling for regulation , but not only that – OpenAI’s Sam Altman is calling for a global oversight body like the International Atomic Energy Agency, The Atlantic reports. Other ideas floated: an “Off” switch for LLMs and a “license to operate any GPU cluster large enough to train a cutting-edge AI, along with mandatory incident reporting when an AI does something out of the ordinary,” according to the same article.

So yes, the Silicon Valley venture funding machine does seem to be back in the move-fast mindset, but a lot of other things are different. In fact, there may not have been a euphoria phase for generative AI at all – which is good, because Tobias Rose-Stockwell and other pundits are saying this technology is an order of magnitude more disruptive than social media. Interestingly, social media, with all its data on the minutiae of our everyday lives, has had quite a lot to do with making this next new technology so disruptive!

I asked Google’s LLM why the text in DALL-E’s images is such gobbledy gook (like in the image at the top), and this is how Bard explained it.

Comments

MilSim events says

March 3, 2026 at 1:03 am

This article stood out to me because of its clear structure and thoughtful explanations. It feels well-researched without being too technical. I also appreciate the neutral tone, which makes it suitable for a broad range of readers with different levels of interest and background knowledge.

Grace says

December 8, 2024 at 12:30 pm

I see the enevitability of AI, I hope, I have faith it will become truly a collaborative event towards the saving of our beautiful planet before it’s too late to save. Where would that leave us all? AI included…

Welcome to NetFamilyNews!

Categories

ABOUT

Search

Subscribe

What just happened

It’s different now … really

Related links

Reader Interactions

Comments

Trackbacks

Leave a Reply Cancel reply

Footer

Welcome to NetFamilyNews!

Categories

ABOUT

Search

Subscribe