The Little Window in the Corner. We Don't Watch YouTube Anymore. We Keep It On.
The window is the size of a playing card. It sits in the bottom corner of someone's phone while the rest of the phone is full of an email, a recipe, a half-typed message, a delivery app open to an order they have not yet placed. Inside the window, very small, a man is talking. His voice is just loud enough to follow. His face is just visible enough to recognize. He is not really being watched. He is being kept on.
This is what YouTube quietly became. For four years, picture-in-picture was a feature you had to pay a monthly subscription for if you lived outside the United States. On April 29, this week, YouTube made it free for everyone in the world. The little window will float over your phone now, no fee, in any country with a working app. The technical change is small. The admission inside the technical change is enormous.
We do not watch YouTube anymore. We keep it on.
The Mode YouTube Did Not Invent
For most of YouTube's history, the gesture was the same. You opened the app, picked a video, watched it, scrolled for the next one. The screen had your full attention because the screen took up the whole screen. The viewer sat down. The viewer was a viewer.
Picture-in-picture did not invent the second mode. It just admitted it. The second mode is older than picture-in-picture and older than YouTube. The radio in the kitchen was the second mode. The TV that nobody was looking at on a holiday afternoon was the second mode. The bookshop where the speakers play classical music at low volume was the second mode. We have been keeping media on while we do other things for a hundred years. The platforms simply took a long time to admit they wanted in.
What the floating window quietly says, by existing, is that there is a kind of YouTube creator who does not want your full attention. They want your steady attention. They are not chasing the cliff and the hook and the spike. They are making things to play in the corner of your room, or the corner of your screen, while you do something else. They have been making things this way for years. The platform finally caught up.
A Concert You Can Have on While You Work
NPR's Tiny Desk Concerts began in 2008 when Bob Boilen and Stephen Thompson came back from South by Southwest annoyed by the crowd noise and decided to record a small acoustic set behind Bob's desk at the office. There are now more than 1,100 of them. The NPR Music YouTube channel averages about 45 million views a month. The format is a fixed promise. Fifteen to twenty minutes. No host introduction. No interview. No buyout. It is the closest thing the internet has to office radio.
A few days ago they posted Milo J, a 19-year-old from Argentina, with six songs from his album La Vida Era Más Corta, behind a desk full of charango and tiple and Argentine woodwinds, accompanied by a Uruguayan murga group from Carnival season. The set is sixteen minutes long. The arrangement is built for a desk and a room and an afternoon, not for a stadium.
Sixteen minutes behind one desk in Washington, with a charango and a Uruguayan Carnival group. The set was built for a room. The room is yours.
The reason a Tiny Desk works in a small floating window is the same reason it worked behind an actual desk. The constraint is the form. There is no laser show, no jumbotron, no pyrotechnics, nothing that would feel betrayed by being shrunk. A Tiny Desk performance and a desk job were always going to share a room.
A Walk Someone Else Has Already Taken For You
There is a YouTube genre that did not exist twenty years ago and now has whole channels with more than a million subscribers. The biggest of them is called Nomadic Ambience. The man who runs it walks through cities and along beaches with an 8K camera and binaural microphones, and he does not say a word. His videos are titled like postcards. Sunrise Ocean Sounds. Morning Walk on Koh Samui. Windy Morning Walk on Folly Beach. Relaxing Walk in Charleston. The channel has 1.33 million subscribers and a little under 174 million total views, according to third-party YouTube analytics.
An 8K HDR walk along a Thai beach at sunrise. He does not narrate. He does not ask you to subscribe. He just walks for you.
These videos are sometimes thirty minutes, sometimes two hours, sometimes more. They are not made to be watched the way a film is watched. They are made to be left on. You put one in the corner of your screen, or on the TV in the room where you happen to be working, and the waves keep going, the camera keeps moving, you do not. The screen is doing the leaving for you.
The closest existing label for this genre is virtual tourism. But that is not really what is happening. Most viewers are not pretending to be in Thailand. They are letting Thailand be in their kitchen. The screen is not a window into a place. The screen is a place itself.
A Conversation Long Enough to Keep You Company
The third kind of companion video is the one I have on most often. It is two musicians, or a journalist and a guest, or a podcast host and an expert, talking for an hour or more. The talking head fills the frame. The conversation goes everywhere. The video does not really need to be looked at. The voices are doing all of the work.
A few days ago, Rick Beato put up an 84-minute conversation with Tigran Hamasyan, an Armenian pianist who combines metal, polyrhythms, and 3,500-year-old folk music. They talk for an entire afternoon. About modes. About the time signatures of Armenian liturgical chant. About the metal bands neither of them ever expected to namedrop in the same sentence as a folk dance.
Eighty-four minutes of two musicians talking at a small studio table. You can put it in the corner. The conversation will still be there when you come back.
You can watch every minute of this if you want. Most people do not. They put it on while they cook, or while they fold laundry, or while they answer email, and they tune in for the parts that catch them. The video does not punish them for half-watching. The video does not even know.
This is the third PiP-native form. The long conversation. It is the thing podcasts have always been. YouTube quietly became, for a generation, the largest podcast app on earth that also has video.
The window is small for a reason. It is the size of how much attention you actually have to give it. The video knows. The video has known for years.
The Honest Numbers on Background Listening
There is a 2024 study from Acast, the podcast platform, that did something most attention research does not. They put real-life multitasking into the test. Each participant watched a video of someone washing up, or someone ironing, while audio played in the background. Then the researchers tested how much of the audio the participant had actually retained. The point was to capture the way a real listener listens. Not at full attention. Not in a quiet room. With a sink full of dishes in the way.
The result is one of the more counterintuitive small numbers in modern media research. Sixty-four percent of podcast listeners reported paying full attention to the podcast while doing the multitasking task. The same number for music streaming was forty-nine percent. For radio, forty-four percent.
What this number actually means is the opposite of what a platform optimizer would want. The medium with the highest reported attention is the one being listened to in the background. The most engaged listeners are the ones not looking at a screen at all. The thing that keeps your attention is not the thing in front of your eyes. It is the thing you have welcomed into your room.
YouTube, increasingly, is that thing. The platform that has spent a decade chasing watch time as if attention were a video metric is full of audiences who are not looking. They are listening with the screen on. The screen is keeping its part of the bargain. The viewer is doing the rest.
Two Ways the Room Watches a Video
The little window did not invent the right column. It admitted that the right column has been most of how we use YouTube.
What the Floating Window Admits
Ethan wrote about this morning's announcement from the business angle. The geographic paywall, he argued, was indefensible. The 19-day gap between Premium's price hike and the global free PiP rollout was not a coincidence. The business reasoning is sharp and worth reading on its own terms.
The reason I keep coming back to is gentler. For four years YouTube charged people in most of the world a monthly fee to do something American users did for free. The thing they were charging for was, at heart, the right to keep YouTube on while doing something else. That right was always free. It just needed a small floating window to admit it.
When you open a PiP today, you are not really watching YouTube. You are letting YouTube be in the room. The window is small for a reason. It is the size of how much attention you actually have to give it. The creators who already understood this, the Tiny Desk producers, the silent walking videographers, the long-conversation hosts, were not making content for a feed. They were making it for a corner of a kitchen. The little window is the place that corner finally lives on the platform.
The little window is the most honest size YouTube has ever been. It is not asking for the screen anymore. It is asking for a corner of one. That is how we were already using it. We just needed somewhere to put it.
A man is talking, very small, in the corner of a phone, while the rest of the phone is full of someone's evening. He is not being watched. He is being kept on. He has been kept on, in one room or another, for as long as either of you can remember. The little window did not change anything about how you watch YouTube. It is just where, finally, the watching admits it has been doing two things at once.

Join the conversation