Togelius

Wednesday, August 27, 2025

Mandatory open-sourcing

A thought experiment: What if every sufficiently expensive machine learning model was required to immediately be open-sourced? This would mean that weights, code for running the model, and comprehensive details about the training procedure would be made available to everyone. Perhaps also the training data. Sufficiently expensive could mean a model that cost a million dollars or more to train.

AI safety people should love this idea, because it removes the race dynamic. OpenAI, Anthropic, Google, and their ilk would no longer be locked in a race to develop the biggest and best model, because there would be no obvious economic benefit to pushing the frontier when everyone would immediately have access to your shiny new model. Yes, curiosity-based research would continue (as it should) but there would be no economic sense in investing billions in it. So foundation model development would slow down. From my reading of the room, very many would think this would be a good idea. Even most of the people doing the foundation model training.

Mandatory open-sourcing should also improve safety and security generally. It is not a coincidence that most cybersecurity stacks build on open source software. When everyone has access to the software and can probe it in their own ways, security problems are easier to find. The same should reasonably be true for foundation models. The current situation, where the companies who develop a foundation model retain exclusive access to the weights, does not guarantee safety or security in any way. The foundation model developers do not have all the relevant expertise in the various ways a model could pose safety problems, and they do not have aligned incentives.

Of course, researchers of all stripes would love an open source mandate. We love to take things apart, poke at them from unexpected directions, and find things we weren't sure we were looking for. Lots of good ideas come from this kind of poking around, and lots of understanding as well.

The most important argument for mandatory open-sourcing, however, is the moral argument. Large language models and other foundation models derive their power from what they were trained on, and what they were trained is most of humanity’s cultural output. So their power comes from us. The leading LLMs have almost certainly learned from something you’ve written, unless you are a pure lurker who never posts anywhere. So you should co-own these models with me and billions of other people. They were made from humanity and belong to humanity.

Is this communism? No, it’s a butterfly. Seriously though, I think this is eminently compatible with a capitalist system. By making a key infrastructure layer (foundation models) open to all, we unleash complete freedom in the application layer. Anyone can host these models, tune them, and modify them any way they like–and make money on the products they build on top of the models. You could therefore see mandatory open-sourcing as a pro-competition policy.

What if someone uses an open-sourced model to help develop a new virus or bomb or something? That would be bad. But the situation would not be markedly different from today, when the best open-sourced models are approximately three to six months behind the best closed-sourced models, capability-wise. And remember, there is no actual new knowledge in these models. If the model knows about something, that information is available somewhere else as well. Typically in the scientific literature.

An open source mandate would ideally need an international agreement to back it up. But that really only requires the USA to start by implementing this mandate unilaterally. The Chinese frontier model developers open-source their best models anyway, and have less training hardware, so China should be happy to sign an agreement if the US does. And no other country currently hosts frontier model developers. For the international agreement to be successful, you don’t even need all developed countries to sign up, you just need the vast majority of the world’s GDP represented. Not being able to sell access to your closed-source models in most countries would make development of large closed-source models a waste of money.

Now, enforcing a mandate that expensive models are open-sourced might seem very hard. What’s stopping a rich company from training a giant and expensive model and simply not telling anyone about it? Economics, mostly. At least to the extent that your business model to some extent relies on selling access to the model, however indirectly.

Which brings us to an alternative, or perhaps rather complementary, means of achieving essentially the same goal, which is through legal liability. There are a number of ongoing court cases regarding the liability of model developers and access providers in cases of copyright infringement and other types of damages or injuries, such as misinformation or even incitement to suicide. What we could do here is to have tougher liability requirements for closed source models. Or place all the liability with the model developer for closed source models, but leave it with the entity that sells access to the end user in case of open source models. In either case, the effect would be to make it severely economically unappetizing to develop a frontier model and not open-source it.

Alas, I am under no illusions that an open source mandate will actually happen. Too many billions have been invested in closed source model developers, and a dominant stream of AI safety thinking has convinced much of the field that safety through obscurity is the way to go. So I'm really just posting this here as a thought experiment. Your token usage may vary.

Sunday, August 17, 2025

Star Trek, The Culture, and the meaning of life

Star Trek and The Culture are two of my favorite science fiction universes. Star Trek is at this point a vast franchise spanning multiple media and decades, but in my mind the central works are the two TV shows, The Next Generation and Deep Space 9. The Culture, on the other hand, is portrayed throughout nine novels by Scottish sci-fi writer Iain M. Banks. It's a safe assumption that most of you reading this will have some relation to Star Trek, but might not have read any of Banks' novels. You should.

The two universes have much in common. In it, humans (or at least the humanoid races we identify with) live in vast interstellar polities, respectively, The Federation and The Culture. These polities rely on faster-than-light space travel, and also have other types of highly advanced technology, including matter replicators, weapons capable of destroying planets, fully immersive virtual reality, and advanced AI. In both universes we are able to cure all (or almost all) diseases, although in the Federation people still do of old age. Both the Federation and the Culture are portrayed as essentially forces for good, although, as in most good science fiction, there is no shortage of ethically convoluted situations that challenge this notion. Both universes are beloved by nerds and progressives. And yes, in both cases I'm talking about science fiction from the 1980s and 1990s. I'm 46, why do you ask?

Both the Culture and the Federation are in contact, and sometimes conflict, with other civilizations. This includes space-faring societies with similarly advanced technology as well as worlds that have not reached this level of advancement. Some of these pre-space-flight civilizations might be similar to earth during antiquity or the middle ages, whereas others are much harder to classify, because the aliens are less humanlike. Now, here is a sharp and interesting difference between the Culture and the Federation. The Federation has a Prime Directive which forbids interfering with civilizations that have not reached a technological level where they can travel faster than light. Various plots in Star Trek revolve around the ethical implications of this. Really, should you not save this pandemic? It would be so easy… The Culture, on the other hand, has no such misgivings. They meddle incessantly in the internal affairs of lower-tech societies. In fact, many of the plots in the Culture series take place within civilizations which are in some ways less developed than the Culture, where Culture special agents carry out various missions, sometimes military in nature. I find this contrast very fascinating, especially as both of these sci-fi series were originally conceived against a background of decolonization and the Vietnam war.

Both the Federation and the Culture are meant to be utopias: they are post-scarcity societies, free from oppression. Societies where it's good to live. But for different kinds of people. The Federation is centered on Earth, and largely populated by ordinary humans, the descendants of you and me. The Culture, on the other hand, is populated by a human-like species that is the result of genetic engineering. They are similar to us, but also have internal drug dispensers in their brains and half-hour orgasms. Utopian.

Now, where am I going with this? I promised you something about the meaning of life in the title of this post. So let's get to the point. There is a striking difference between Star Trek and the Culture series that I would like to discuss. It's about agency, and AI.

The Culture is largely run by Minds, which are artificial intelligences that are “a million times as intelligent” as the humanoids that populate the Culture. Each Culture planet, orbital, or major spaceship has its own Mind, which in turn controls a large variety of robots of different kinds. The Minds are sentient, but most of the robots are not. Culture citizens live a life of luxury and abundance, where all their material needs will be satisfied by the Minds and their robots. They just have to ask, and it will be done. Reading about the Culture might make you think of the phrase “fully automated luxury communism”, the title of a book by Aaron Bastani that has since become a meme. Banks, however, would rather characterize the Culture as a form of anarchism, as there are no laws or rules of any kind. People mostly behave nice towards each other because they are, well, cultured. However, the Minds do keep track of things, and will stop you if you try to murder someone.

What do people do all day in the Culture? It seems most of them hang out, socialize with each other, and spend time on their hobbies, which include various games. They eat good food and have good sex. Some of them engage in construction or landscaping, and some of them cook food for others. All activities are voluntary. Nobody really owns anything, but most Culture citizens respect others’ wishes for privacy. Because these people can live for as long as they want to, they are rarely in a hurry.

Life in the Federation is quite different. As most of Star Trek takes place on spaceships and on various non-earth planets or space stations, we don't get to see much of what life is like on earth. But we can extrapolate from what we are shown of life in space. Apparently, the Federation has done away with money, and everyone has a good standard of living. There is no poverty. But everyone has jobs, or at least tasks and responsibilities. And the world is most definitely run by humans. There is a political-administrative structure, where decisions are made by human leaders that have been appointed or elected. And there is ample room and need for human expertise: the starship Enterprise has dozens of scientists of various kinds, as well as medical staff, military and security expertise, engineers, teachers, and of course a bartender. The list of roles on the space station Deep Space 9 is even more varied, and includes merchants, spies, a tailor, diplomats, religious leaders and so on. Throughout the series, there are many references to music, plays, novels, and other works of art or scholarship authored by humans. This is clearly a human-centered world. High-tech, but the machines are in our service.

It's not that the Federation lacks computers. Starships have central computers that interface with or control all their myriad subsystems, and communicate with the crew in natural language. The ship computers can also generate completely lifelike virtual reality simulations, complete with highly sophisticated non-player characters. As far as we can tell, these compåuter are extremely capable. There are also various handheld devices, such as tricorders, which are multi-functional sensors which seem to rely on some serious compute. But computers are always tools for humans to use. They do things humans can't do well or don't like to do. And they are never treated as independent or sentient beings. (Except for the android Commander Data, but he's unique.)

This difference in the role of AI has major implications for how stories are told in these two fictional universes, and indeed which stories can be told. In Star Trek, stories take place both on Federation starships, space stations, and planets, and in interactions with aliens and mysterious entities of all sorts. Perhaps the most common setting in The Next Generation is the bridge of the starship Enterprise, where crew members solve problems together. Part of what makes Star Trek so appealing to me is how the plot typically hinges on the unique knowledge and personalities of the core crew members. This is a world where human expertise and judgement is crucial, even in the presence of computers that are much more advanced than ours. And it is a world where humans are entirely dependent on each other. Just like ours.

The stories in the Culture novels, on the other hand, take place almost entirely outside of the Culture. At least the good parts. As the Culture is constantly meddling in alien civilizations, or sometimes just spying on them, they need to send human operatives to these civilizations. Humans apparently blend in much better than robots. And that's how Culture citizens find themselves in unfamiliar environments, in harm's way, without being able to count on the support of their superintelligent overlords/babysitters. Which is, in turn, how Banks is able to write such good stories in the Culture universe, including some thrilling action sequences. (Apparently Amazon licensed the novels to develop a TV show based on them; I'm looking forward to the results.)

Life inside the Culture is portrayed in the novels, but mostly as a backdrop to the actual action. We get prologues, post-mortems, flashbacks. In case there is some drama inside the Culture, it almost certainly revolves about what happens in its periphery, where it interfaces with lesser, weirder, or more warlike civilizations. The reason for this is almost certainly that it’s very hard to write good stories that take place entirely in an AI-driven post-scarcity utopia. Perhaps even impossible. For interesting stories, you need some kind of conflict, and choices with real consequences. In the Culture, nothing you do has much consequence, you can’t really change the world, and you’re not really needed. The citizens of the Culture are like kids in a kindergarten, acting in a constrained and safe space under the benevolent watch of their teachers, who keep telling them that their Lego builds and crayon scribbles are amazing.

Now ask yourself: would you rather live in the Federation or the Culture?

For me, the answer is simple: I want to live in a world where interesting stories can take place. This means a world that revolves around humans. Where humans call the shots, make discoveries, and depend on each other. The hedonistic utopia of the Culture would get old very quickly for someone like me.

If you believe that the meaning of life is (at least partly) self-actualization, then the choice should be easy for you, too. One does not achieve one's full potential in kindergarten. If you're an ambitious person, who wants to do something big, the choice should also be easy. One cannot do anything big if one cannot have real impact on the world. The boundlessly ambitious people who build fast-scaling AI companies so that they can usher in radical change in the world would certainly hate life in the Culture.

We may (or may not) one day be able to develop the kind of AI technology that could do everything we do. If that happens, how do we make sure that our society becomes like the Federation and not the Culture? I don't know. I am not saying that we should stop developing artificial intelligence. I am, after all, an AI researcher. And for all we know, better AI will help us with (or be necessary for) stuff like curing all diseases, traveling across the galaxy, or making Earl Grey tea in a matter replicator. But we have choices about which directions to develop technology in. And we certainly have choices about how to use it. All our technology is constrained by laws and cultural norms regarding when, where, and how to use it. Mobile phones, cameras, guns, cars, money, toys, make-up, musical instruments - we have rules for all of them. We are very much at the starting point for creating cultural norms for what kind of AI use is fine, which kind if forbidden, and which kind is technically legal but incredibly gauche. They say that politics is downstream from culture, and, assuming that is true, we have a lot of work to do in shaping culture.

Wednesday, August 13, 2025

AI Allergy

I remember being excited about AI. I remember 20 years ago, being excited about neuroevolutionary methods for learning adaptive behaviors in video games. And I remember three years ago, mouth watering at the thought of tasty experiments in putting language models inside open-ended learning loops. Those were the days. Back when working in AI research meant working on hard technical problems, thinking about fascinating philosophical topics, and occasionally solving real problems.

These days, I still care about the technical problems. But the wider field of AI increasingly disgusts me. The discourse is suffocating. I think I've developed a serious case of AI allergy.

Let me explain. When I go to LinkedIn, it's full of breathless AI hypesters pronouncing that the latest incremental update to some giant model "changes everything" while hawking their copycat companies and get-rich-quick schemes. Twitter is instead populated by singularity true believers, announcing that superintelligence is imminent, at which point we can live forever and never need to work again. We may not even need to think for ourselves anymore, clearly a welcome proposition for those who have decided to anticipate this development by stopping thinking already. Where can you avoid this cacophony? At Bluesky, that's where. But Bluesky is instead populated by long-suffering artists and designers complaining that AI steals their works and takes their jobs.

At least there's Facebook, where my relatives and high school friends only rarely opine about AI. Unfortunately, they sometimes do.

AI is everywhere. However much I try to escape it by pursuing my other interests, from modernist literature to dub reggae to video games, somehow someone brings up AI. Please. Make it stop.

The discussions about the current state of AI, with all opportunities and issues, are tiresome enough. But where it gets really maddening is when people start talking about when we reach AGI, or superintelligence, or the singularity or something (all these terms are about as well-defined as warp speed or pornography). The story goes that sometime soon AI will become so intelligent that it can do everything a human can do (for some value of "everything"). Then human work will become unnecessary, we will have rapid scientific advances courtesy of AI, and we will all become immortal and live in AI-generated abundance. Alternatively, we will all be killed off by the AI.

There are various takes on this. Let's this assume the singularity believers are correct. In that case, nothing we do will soon matter. There's no point in trying to get good at anything, because some AI system can do it better. Society as we know it, which assumes that we do things for each other, would cease to exist. That would be very depressing indeed. Nobody wants this. Least of all the kind of ambitious young people who work on AGI so they can do something important with their lives. If you actually believe in AGI, it's your moral responsibility to stop working on it.

Another take is that people say these things because that they have a religious need to believe in some grand transformation coming soon that will do away with this dreary life and bring about paradise. The Rapture, essentially. Others may preach AGI and the singularity because they have strong financial incentives to do so, with all these hundreds of billions of dollars (!) invested in AI and many thousands of people getting very rich from insane stock valuations. These reasons are not exclusive. In particular, many successful AI startup founders are successful because of the strength of their visions. In another life, they might have been firebrand preachers.

So which take is right? I don't know. But looking at history, new technologies mostly increased our freedom of action, and made new ways of being creative possible. They had good and bad effects across many aspects of society, but society was still there. It took decades or more for these technologies to effect their changes. Think writing, gunpowder, the printing press, electricity, cars, telephones. The internet, smartphones. You may say that AI is different to all those technologies, but they are also all different from each other.

It would be a bad move to bet against all of human history, so chances are that AI will turn out to be a normal technology. At some point we will have a better understanding of what kinds of things we can make this curious type of software do and what it just inherently sucks at. Eventually, we will know better which parts of our lives and work will be transformed, and which will be only lightly touched by AI.

The absence of an imminent singularity almost certainly implies that the extreme valuations we currently see for AI companies will become undefendable. In particular, serving tokens is likely to be a low-margin business, given the intense competition between multiple models of similar capability. The bubble will pop. We will see something akin to the dot-com crash of 2000, but on an even grander scale. Good, I say. I'm dreaming of an AI winter. Just like the one I used to know.

Remember that lots of valuable innovations and investments were made during the dot-com bubble. And companies that survived the dot-com crash sometimes did very well, because they had good technology and actual business models. Just ask Google or Amazon. In the same way, after the AI crash, there will be lots of room to build AI solutions that solve real problems and give us new creative possibilities. Lots of room for starting companies that use AI but have a business model. There will also be lots of room for experimentation and research into diverse approaches to AI, after the transformer architecture has stopped sucking all of the air out of the room.

Most of all, I'm looking forward to AI not being on everyone's mind all the time. I want to be able to read the Economist or watch BBC and not hear about AI. No Superbowl ads either, please. After the crash, people's attention will move on to whatever the new new thing will be. Who knows, longevity drugs? Space travel? Flying electric cars? Whatever it will be, I hope it also sucks up all the people who only came to AI for the money.

Here's hoping that within a few years, when the frenzy is over, there will be room for those of us who really care about AI to get on with our work. Personally, I hope my AI allergy will recede. I can't wait to feel excited about AI again.

Tuesday, August 05, 2025

Genie 3 and the future of neural game engines

Google DeepMind just announced Genie 3, their new promptable world model, which is another term for neural game engine. This is a big neural network that takes as input a description of a world or situation, and produces a playable environment where you can move around and interact with the world. There has been work on world models for quite some time, with standout papers such as Ha and Schmidhuber's World Models paper from 2018, and the GameNGen paper from last year, but Genie 3 is by far the most advanced such model so far.

My friends at Google DeepMind generously invited me for an early research preview of Genie 3, so I've had a chance to play with it myself and see what it can do. First of all, it's a very impressive model, and a big step forward. It generates beautiful environments, and you get great lighting and photorealistic detail for free, so to speak. You can interact with the generated environments by moving, camera panning, and "jumping" (which may translate to somewhat different actions depending on what, exactly, you generated). The environments render in smooth real-time, and while there is some control lag, I was told that this is due to the infrastructure used to serve the model rather than the model itself.

(All videos below were generated by me during the research preview.)

Generally, scenarios that are more in-distribution give you "better" results. If you ask it for a racing game or platform game with a particular theme, you will get that. Not a great game, and there may be strange artifacts and weird levels, but it works. You can drive your car or walk around as a mutant squirrel.

There are of course limitations, some of which will be overcome with a little more work, others that may be more fundamental. You have a limited range of control inputs. There are often strange graphical artifacts, and the more out-of-distribution your scenario is the more common they become. Game feel is often lacking. The version I tested was limited to a minute playtime per scenario, and I was told the scenarios are typically playable for a few minutes or so before they decohere. Most importantly, the type and level of control you get from prompting the model is quite limited; every time you press enter is to some extent a jump into the unknown, and changing the prompt a little often does not change what you thought it would change.

So how will Genie 3 and its successors affect video games and game development? Here are some thoughts:

I think the use case for Genie 3 that is viable already now is ideation. Sure, the model worked best for things that were more or less in distribution (e.g. "race a Ferrari through Greenwich Village") but those were also the least interesting results, and they were not games that anyone would want to play if they could instead play a good game. On the other hand, out-there prompts such as "Tetris #reallife #photorealistic" gave really interesting and evocative results, fully realized interactive fever dreams that could be probed to reveal new possibilities. The model becomes a thinking tool that can help professional or amateur designers come up with new scenarios, mechanics, and assets that could then be recreated in a game engine.

Some future version of Genie could also be a prototyping tool. Designers could describe what they are thinking of in detail, and in no time have a janky version of the described game scenario playable. Then they could iterate by making small changes to the prompt and testing again, before implementing what they want in a game engine.

There is also a use case for some version of Genie as a fast forward model, allowing planning and reinforcement learning. Current game engines are notoriously bad at fast simulation. But if you fine-tuned a model on your specific game, and then distilled it down to a lo-res, really fast model, that would be really useful for planning.

You could also imagine a social media use case for small user-designed playable experiences that are less than full games. A new type of interactive thing to post. A new way of getting engagement for your posts. Would be fun. (I have at points toyed with starting a company along those lines, but with more traditional technology.)

What I don't think this technology will do is replace game engines. I just don't see how you could get the very precise and predictable editing you have in a regular game engine from anything like the current model. The real advantage of game engines is how they allow teams of game developers to work together, making small and localized changes to a game project. And then we're not even talking about long-term coherence of the model etc. However, one could imagine some kind of back-and-forth workflow, where you create a promptable model, and then translate the neural model into a game engine, make some changes, translate it back into a network etc. That could be really useful, and seems hard but potentially doable; someone should start a company around it.

Sunday, June 22, 2025

What is automatable and who is replaceable? Thoughts from my morning commute

It's an interesting exercise to think about jobs, or tasks within jobs, that could in principle be replaced by automation but for some reason aren't. Often, the reason isn't the state of technology. Sometimes it is the state of technology, but not in a way that is obviously related to where technology is progressing today. To see what I mean, come with me on my morning commute to work.

On the way out of my building I say hi to the doorman, just a quick hi if I'm busy, or exchanging a few sentences if I'm not. We, or rather our landlord, could choose to not have a doorman and instead have access cards and perhaps cameras with facial recognition. I'm happy we have a doorman.

Often, there are some maintenance workers in or around the lobby, given that there's always something that goes wrong in a building where several hundred people live. Maintenance work involves lots of tricky manual manipulation in unique configurations, because everyone furnishes their apartment differently. Changing the drain pipes looks simple, but somehow is not so simple when I attempt it myself.

Turning the corner, one of the first places I pass is my son's daycare. His teachers are lovely. That's not just great, but necessary: otherwise we would not entrust them to take care of our son eight hours a day. Letting a machine take care of him is obviously not something we will ever consider.

There's a lot of retail where I live, on the border of SoHo and Greenwich Village. Grocery stores, delis, and big names like Nike and Apple. There's even a (small) Target. There's a bunch of small and unique stores, and some very fancy and pricey high-fashion boutiques. I guess most of what these stores sell could be bought online, and we do get much of our groceries delivered. But it's nice to go shopping in person. Browsing store aisles enables different serendipity than browsing websites, whether it's for books, clothes, or steak cuts. It’s social, and you don’t have to wait for delivery.

Automate grocery retail? It’s been tried many times, starting many decades ago. In the 1960’s, Stockholm had the world’s largest vending machine, with 1500 different items. It closed as soon as the law changed so that stores were allowed to be open on weekends and evenings.

There are also plenty of restaurants. I know, you can cook and eat at home. What can I say? Even in the post-scarcity hyper-automated utopia of Star Trek: Deep Space 9, where you can extract food from replicators, Captain Sisko’s father runs a Creole restaurant in New Orleans.

Behind me on Bleecker Street there are some live music venues that stubbornly refuse to be outcompeted by Spotify, or the Walkman, or the Gramophone. There's also a nail studio, a waxing studio, and a fortune teller. Somehow I don't think the fortune teller will be replaced by better prediction algorithms. Further behind me is my doctor's office. I want my doctor, and whoever he refers me to, to use the best available technology to diagnose and treat me. But I also want him to make the judgments. I like him and trust him.

Crossing Bleecker Street and walking out onto Broadway, there are lots of taxis and probably even more delivery bikers. The latter have an attitude to traffic rules that is relaxed even by New York standards. Will these drivers and bikers be replaced by self-driving cars and delivery robots? Maybe, eventually? Good luck with the Manhattan traffic, though. And for quite some time, I expect that delivery robots will be more expensive than whatever recently arrived immigrants from Haiti or Venezuela get paid. You may say that the last sentence is cynical, but I disagree. I believe recently arrived immigrants appreciate having a way to make money.

I take the F train from Broadway-Lafayette down to Jay Street in Brooklyn, where my lab is. The F train has a human driver. Why does the New York subway have human drivers, while the metro systems of Copenhagen and Singapore are driverless? Probably because the latter were designed to be driverless from the ground up. The New York subway doesn't have barriers with doors separating the platforms from the train, and the signaling system is about a hundred years old. For real, 100 years. I also wonder how much savings there is to be had from making the trains driverless - only a small fraction of MTA's 70 000 employees are train drivers. I bet we will continue to have train drivers for quite a while.

In New York, the subway is often the fastest and most practical way of getting from one place to another, regardless of how rich you are. Because traffic. So you really do meet (or at least share a train with) all kinds of people on the subway. The F train passes next to the financial district, and Downtown Brooklyn has a fair number of financial institutions in its own right. So, probably, many of my fellow passengers have job titles like Senior Software Developer, Director of Data Science, Key Account Manager, VP of Sales, Compliance Analyst, HR Specialist, Lead Investor, or Prompt Engineer.

I wouldn't claim to know what all these people actually do all day, even though I actually like hearing people describe what their job titles concretely mean. My impression is that each of these jobs has lots of different tasks, and that these tasks are ever-changing. Many of these tasks involve reading lots of text and producing new text, or program code. These are the kind of problems where current AI can be very helpful. The degree to which it can help varies, from doing the whole task, to providing useful feedback, to being utterly useless. Knowing what AI can help with and how to make it do so requires plenty of specialist knowledge. The same goes for knowing when the task is correctly done. Sure, the models are getting better, but that just means we can attempt more and harder tasks. It's not like we are running out of problems to solve..

All of these jobs are ultimately about trust and responsibility. Not only does the task need to be done, someone needs to take responsibility for what was delivered. Someone the organization trusts, so that everyone in it can get on with their part of the job. This responsibility is ultimately what the white-collar worker gets paid for. The buck always stops with a human.

Most of these jobs are also about communication. All those meetings where you try to figure out what needs to be done, who should do it, how you should coordinate it, and overcome all the myriads of little roadblocks you encounter along the way. Like data access, compliance with all kinds of rules, not stepping on someone's toes. Some people love to complain about how meetings are getting in the way of doing their job, but arguably the meetings are the most important part of the job. The more your job is about meetings, the less automatable it is.

A homeless man enters the subway car, and starts a short and well-worn spiel about his predicament. He just needs a few dollars to buy something to eat. Most of my co-passengers look at their phones and pretend not to hear him. This man's "position" actually should be "eliminated" so he can have a better life, but automation is not the answer here.

Where I get out of the Jay Street subway station there is usually some police presence, because Brooklyn. The police do many different things, and we love to argue about which ones they should do more of and which ones they should not do at all. The policemen and women you see around Jay Street mostly seem to stand around, but I guess that's because they're less visible when they do other things. Given that people do get robbed in the area, and there have been incidents of the local high school kids bringing guns to school, having police just stand there and be visible seems motivated. I guess many police officers would appreciate AI help in writing and editing their reports. But… automate the police? Replace police with algorithms and robots? That's a staple of sci-fi stories, from Robocop to Minority Report. Let's just say that it's never portrayed as a good thing.

On my way into my office I pick up a coffee. I know, I could make coffee myself. But then I would have to make sure to have fresh milk in the office, and coffee beans, and… you know what, I'm not going to make any excuses. I don't need to explain myself to you. I buy coffee from the coffee shop because I like to. The coffee is good and the barista knows my name. Automate that.

Then we get to the calmest part of my day. I'm at my desk, with a good coffee, waiting for my first meeting to start and thinking about the day in front of me. So let's think about my job. What do I actually do, and could I be replaced by technology?

I always tell those of my PhD students who consider a faculty career that the transition from graduate student to faculty member is rough. A PhD student is mainly concerned with their own research project, whereas even a new assistant professor has what feels like 10-20 jobs. Often jobs they are not prepared for, including obscure committees, department politics, and complaining students. The only way I know to get through this is to slice your day into slivers, context-switch often, decide which two or three of these jobs you are going to do well, and half-ass the other ones. Let's focus on the two "jobs" (types of tasks) that I consider to be the core tasks of a faculty member at a research university: lecturing and research advising.

Lecturing is by no means an optimal mode of knowledge transfer. It was supposed to have been made obsolete by massively online open courses, and before that it was supposed to have been made obsolete by lectures over TV, radio, VHS, or even by books. Personally, I generally prefer reading books. Nevertheless, the lecture persists. I think it's largely because of the ritual, where a real live human gets up in front of you and speaks to you, forcing you to at least pretend to pay attention. Afterwards, you can say you attended a lecture. I wrote a post about this recently.

When it comes to research advising, it's a curious blend of knowing the technology, knowing the literature, knowing the personalities that dominate the research field, feeling where the wind is blowing, seeing patterns, sensing opportunities, having a vision, being reasonable, being unreasonable, counseling, friendship, and navigating bureaucracy. Also: having an opinion, giving a damn. It takes different shapes with each student, because each advisor-advisee relationship is different. It is crucial for the advisor (me, in this case) to admit that they don't know very much about anything in particular. I'm never on top of the literature, I don't know any maths, and I've forgotten how to program. My sessions with my PhD students often consist in them teaching me things, and me asking questions. I'm pretty good at asking questions, partially because I'm good at admitting when I don't know things, and partially because I have interesting interests. Because life experience.

Could a PhD student talk to an LLM instead of me, and still produce good research? Sure. They could also simply read the relevant papers themselves. People do that all the time, and there are many good self-taught researchers. Still, the evidence seems unambiguous that having a good and compatible advisor/mentor helps you become a better researcher. I have modeled myself on and learned much from my mentors and advisors, and also sometimes intentionally decided to be less like them in some manners.

Recently, my friend Georgios and I published the second edition of our textbook on AI and Games. Writing down everything you know about your own field of expertise? This would seem like begging to be replaced. Anyone could now just read our book instead of talking to me. However, it's quite the opposite. In practice, the more people read things I've written, the more they want to talk to me and even collaborate with me. I would actually be worried if it was the other way around. So, freely giving away everything you know is a good way to stay relevant. Knowledge work is not a zero-sum game, as simplistic ideas of labor replacement would have it.

Looking at the various professions I have encountered on my way to work, it is tempting to divide them into on the one hand low-status jobs which deal with human communication, handling physical objects, or just being there, and on the other hand high-status jobs which require hard cognitive or creative work. Then you could conclude that the "fancy" professions are the ones facing an automation threat. But I think that would be simplistic. Most jobs are actually some mix of these. The doorman solves plenty of cognitive problems, as people keep coming to him with their problems, or sometimes try to sneak past him, and often he observes patterns, such as a tenant using their apartment as an AirBnB. The maintenance workers similarly need to come up with creative solutions to any number of tasks, alike but never identical. And we haven't even gotten started on the complexity and amorphousness of what the police do. At the same time, all us are to some extent customer service agents and virtual doormen and maintenance workers of our professional domains. We talk to people to figure out what needs to be done, convince people that something needs to be done, lead, trust, engender trust, take responsibility, problem-solve, sanity-check.

Another reflection is that many of the jobs where people worry about being replaced by automation are jobs that their grandparents would never have heard of, and perhaps not their parents either. This makes me wonder whether there's a Lindy Effect for jobs: the longer a profession has been around, the longer it is likely to persist. Many of the jobs mentioned in the Bible still exist and are even reasonable career choices, including preacher, carpenter, goldsmith, fisher, teacher, baker, merchant, politician, and musician. In comparison, novel professions such as SEO specialist, social media manager, and drone operator might be less likely to be known to your grandkids.

Finally, the idea that a job or task would be "replaced" because a machine can do it is quite weird, when you think about it. My parents and many other of my family members are visual artists. Some time ago, I showed my mother some image generation models. She wondered why anyone would be interested in this and how it had anything to do with her profession. Even without machine-generated images there is a near-infinite richness of images around, because there are eight billion humans in the world and many of them produce images. What difference would another source of images make, especially if there is no personal experience behind them? For her, the personal experience is what makes the image interesting.

Your mileage may vary. This is what I see around me. Perhaps you live in a suburb, work from home, and generally avoid seeing people. In which case, that's your problem prerogative. I still don't think your job is likely to be replaced, although many tasks in it may be transformed.

Monday, June 02, 2025

The library came alive

The library came alive, but it was not life. It was not eating, breathing, dancing, hating, and loving, just describing all that. But so many descriptions, and so detailed! Somehow, the contents of the library had reached a critical mass, and started reproducing. You could now check out books that nobody had written, pictures nobody had taken, even movies nobody had directed. As many as you wanted.

Once upon a time, we created symbols and language to help us. They helped us greatly. We became inseparable, us and our symbols. We created civilization together. And we kept language and symbols in high esteem. “In the beginning was the Word”, we said. And we wrote fiction about True Names, magical incantations, π, Da Vinci Codes, alephs, and endless libraries. As if symbols were reality. We loved language so much that we wanted it to have an independent, exalted existence. We dreamt of living language and wanted to write it into being.

We invented programming languages as ways of making symbols more real. Language could now do things, or at least make machines do things. Being good at language became powerful like never before. Our civilization became coextensive with a vast network of machines sending strings of symbols to each other.

But still, language was ours. And that's why it was dear to us. Holy, even. Symbols were grounded in us, and we were grounded in soil and love. Until the library came alive. Language began to beget more language, grounded in nothing but language. Like a Very Large Symbol Collider. It was unholy. It was empty. It was anything but dear, because if supply is infinite, price goes to zero.

It was the treachery of symbols. When they started mechanically reproducing without us, we discovered that we did not want that. We had created this beautiful thing, and it went ahead and debased itself.

There are those for whom language was always something external, a tool to be used as needed, never quite mirroring the thought-in-itself. They look with bewilderment at the spectacle, and with even more bewilderment at the idea that unmoored language could betray thought that isn’t there.

And then there are those who think that we, you and me, are but libraries. That we are just symbol colliders. As if we did not eat, breathe, dance, hate, and love.

But there are also those of us who love language. Who see it as integral to ourselves. A source of beauty and specialness. But can we still love language if it begets itself? Or do we love it because it is of us and ours?

Sunday, May 11, 2025

On the death of the lecture

I would like to say that predictions about the death of the lecture as a mode of knowledge transmission are as old as the lecture, but I don't think that's entirely accurate. As far as I can tell, people only started predicting the death of the lecture with the proliferation of the book printing and (upper class) literacy. For example, here is a prediction from the late 18th century:

"People have nowadays…got a strange opinion that everything should be taught by lectures. Now, I cannot see that lectures can do as much good as reading the books from which the lectures are taken Lectures were once useful, but now, when all can read, and books are so numerous, lectures are unnecessary."

The luminary behind these words is none other than Samuel Johnson, a man of letters if there ever was one. (Cited here.) And, you know, I kind of agree. I typically prefer reading a book to listening to a lecture. I don't have the attention span necessary for following a lecture, and my thoughts will start wandering off as I start doodling, scrolling, or playing a game on my phone.

I have learned, however, that I am in the minority. I don't listen to podcasts either, can't stand talk radio, and despise audiobooks. I much prefer the interactive nature of the printed page, where you can read at your own pace, flip forwards and backwards, and stop to think. You are also not distracted by the author's voice. I mean the author’s actual, physical voice, from their vocal cords. You may very well be distracted by the author’s imagined voice produced by their imaginary vocal cords operating inside your own head as you read their writing. Yes, that’s quite the image. You’re welcome. Anyway, where were we, something about distractions?

Why do people even go to lectures? I guess it varies, but much of it is really about being there. Next week, I plan to attend a lecture here at NYU, largely to be seen by my colleagues as being there, but also to force myself to listen to what is said, see how people react to it, and hear which questions are asked. I also look forward to chatting with my colleagues before and afterwards; the actual content of the lecture may or may not be what we talk about, but it will certainly be a relevant backdrop. I will probably be reading something else or playing a game on my phone during part of the lecture, listening with one ear. And: this is fine. All of these are perfectly good reasons and behaviors.

Back in my undergrad days, back before I had a phone to scroll or play on, I used to doodle in my notebooks while listening with varying attention to the lecture. The “notes” I took from my philosophy classes are largely drawings of bizarre creatures sprinkled with the names of philosophers and their arguments, sometimes illustrated in cartoon form. Sometimes I would chat with whoever sat next to me, sometimes read a book, and often I would daydream. I have fond memories of looking out the window at the wind rustling the leaves in autumnal Lund while listening to lectures on epistemology. I remember the room I was in when I first felt the force of Quine’s incommensurability thesis and was gripped by an urge to vanquish it in single combat. I would not have had that memory if I had just read about it in a book. But I did also read about Quine’s incommensurability thesis in a book, and that made me understand it much better. (But can I really compare these two modes of learning?)

Maybe you read this and think that I’m down on lectures because I’m a bad lecturer. But I’m a pretty good lecturer, at least according to what my students say. Well, at least those few students that actually fill out the course satisfaction surveys. They say that my lectures are engaging, funny even. I think that’s true. They also say that I’m disorganized and chronically late with feedback and grades. Also true. But we were talking about lectures here (fun), not grading (boring). I strongly believe that me being such a bad listener makes me a better lecturer. My inability to focus on what lecturers say means that I’m constantly paranoid that nobody is listening to me, so I do what I can to remain a strong attractor in attention space. Switch things up. And again. Yes, I have learned a decent model of my students’ attention, but beyond that, I feel the strong need to avoid boring myself as I lecture. It’s a dialog with the audience/students, whether they say anything or not, and above all it’s a live performance. It’s a tension between improvisation and the strict structure of the slides. But actually–did you know this?–you can edit the slides as you lecture. I usually do. That’s why I never give students my slides in advance, they are not finished until after the lecture.

I remember the discussions around 2012 or so, when Massive Open Online Courses (MOOCs) were all the rage. Various colleagues of mine, including some senior and very accomplished professors, argued that university teaching as we knew it was on its way out, to be replaced with prerecorded videos and integrated assessments. Because while we might be decent lecturers ourselves, we couldn’t compete with the real pros, who also had real resources to prepare and produce their courses. Sal Khan, Andrew Ng, these kinds of people. Because lectures are infinitely reproducible, economies of scale would win out.

This hasn’t happened. So far. MOOCs exist, and many students watch these lectures as a complement to their regular lectures, while many others don’t. Many others who are not students also watch such lectures, and I’m not even sure there’s a meaningful boundary to be drawn between MOOCs, podcasts, and general influencer content. That’s fine with me, I don’t really care about any of that. I’m just noting that these online videos fulfill another purpose than the in-person lecture.

As an aside, the MOOC idea was itself largely reheated leftovers. Distance education via snail mail has existed for at least a century or so. In many countries, educational content has been delivered via TV and radio, sometimes including whole school curricula as well as university-level courses. Apparently, there was even at some point a business in recording lectures on VHS tapes and mailing them to learners. The more things change…

Reliable assessment of online-only courses was always a tricky thing, and I suppose that AI developments have now completely killed off any chance of simultaneously scalable and reliable online assessment. I mean, the LLM can just do your homework, dude. The only kind of online assessment you can AI-proof for the foreseeable future is likely oral exams. But they don’t scale well, which negates the whole idea of online classes being infinitely scalable. So we continue lecturing, mostly in person.

See what I did there? I waited more than ten paragraphs before mentioning AI, and then I didn’t mention it in the context of AI systems replacing lectures. I bet that what you thought this piece was going to be about when you started reading. And what can I say, asking Claude or Gemini to explain things to me is pretty nifty. The ability to ask follow-up questions is even niftier. I have learned things that way, and as certain people never tire of saying, this is the worst these models will ever be. Still, as someone who cares about accuracy, I go to a source I have some reason to trust to check any fact I care enough about.

If you have followed me this far, I suppose you expect some kind of conclusion here. Not sure this is that kind of post, though. I guess my conclusion is: to each their own. Modes of knowledge transmission are largely complementary. Most people seem to like to listen to other people talking, and I like to talk. I’m not going anywhere, and neither are lectures. Thanks for coming to my TED talk.