Tag: Gemini

The Floor

I compared the frontier to a three-star chef making grilled cheese in “Context Rot” — the smartest models on earth spending most of their time on work beneath them, the way a chef trained at Le Bernardin might still melt cheese between two slices of bread on a Tuesday night and call it dinner. The comfort was the point: if the sharpest tool is saved for hard problems and something merely-very-good handles the rest, nobody’s losing anything. The floor was never the interesting part.

I’ve kept turning the joke over, and I think I had the wrong worry.

Watch what companies do with their AI spend, not what they say. Coinbase moved engineers off frontier models onto open weights and cut its AI spend nearly in half while usage kept climbing. Nvidia runs a closed model as orchestrator and routes the actual volume — the daily uncelebrated bulk of it — to open weights it controls. The frontier is becoming a dispatcher, deciding where the request goes and rarely doing the work itself. The instinct is to worry about whose open weights end up running that volume, and right now the most capable ones at scale are Chinese — GLM, Kimi — which makes it tempting to read this as a contest America is quietly losing: the floor of the AI economy built somewhere else, at a price export controls can’t touch. You cannot embargo a file already downloaded. You cannot price-match free.

But that framing has a hole. Google’s own Gemma family is open-weight and good enough to handle that daily volume without anyone reaching for GLM or Kimi. “Open weights are a Chinese story” only holds if you don’t count the open models the company running Android and half the internet’s search traffic has already shipped.

And once I saw that hole, a bigger one opened behind it. I’ve been trying Apple’s new Siri — arriving with iOS 27 this fall, genuinely surprisingly good in beta — and it made me realize open weights, of any nationality, were never going to cook most of the world’s dinners. Apple and Google are.

Consider what actually determines where the world’s routine inference runs. Not which model benchmarks best, not which weights are downloadable — what’s already installed. Apple ships to well over a billion active devices before routing a single query through Siri’s new architecture. Nobody has to be persuaded to try it, or hear about it on a podcast; it’s the thing that answers when you press the button you’ve pressed for a decade. Google owns the search bar and the Android default the same way. Between them, that’s most of the world’s phones — and phones are where most of the world’s questions get asked.

The open-weight framing assumes the floor is up for grabs, that whoever ships the best free model wins the daily grind by merit. But the floor was never a bazaar. It’s a set of defaults, owned by whoever already has the device in your hand, not whoever holds the most generous license. Apple didn’t need to win the model war to win this. Its heaviest reasoning tier is built with Google, running on Nvidia chips in Google’s cloud, under a deal reported at roughly a billion dollars a year — Apple doesn’t fully own the engine doing the thinking. It doesn’t need to. It owns the button.

That’s a quieter concentration than an export-controls fight, and a harder one to dislodge. An open model can be forked, distilled, undercut, or out-competed by the next release. A billion phones with an assistant built into the lock screen cannot be routed around. Whoever’s weights hum underneath barely matters, the way it barely matters to a diner which supplier delivered the flour. What matters is whose kitchen the meal came from, and whose name is on the door.

The grilled-cheese chef was never the risk. Two chefs are about to own nearly every kitchen on earth, and most of us will never notice — because a kitchen you’ve been eating out of for a decade doesn’t feel like something that was won. It just feels like home.

Owning the kitchen and getting paid for what’s cooked in it, though, turn out to be two different questions. That one’s for another post.

AI Google Google Gemini

Fun with Nano Banana 2

Google just released a new version of its image creation tool Nano Banana. It’s pretty amazing at creating all kinds of images.

On X a prompt was shared that I wanted to try out:

I need a flowchart for how to scramble eggs, make it as wacky and over the top and complicated as possible.

So I gave it a try:

Here are a couple of additional examples:

What a McKinsey partner does to prepare for a client’s board meeting presentation

The credit and debit card systems in the U.S.

David Allen’s Getting Things Done methodology

Pretty amazing! Conceiving and drawing one of these “flowcharts” would take me many hours!

Tags ai, creativity, flowchart, Gemini, image creation, infographic, innovation, Nano Banana, nano banana 2, Scott Loftesness

Music

Every Blog Needs a Theme Song!

Post author By Scott Loftesness
Post date February 19, 2026
No Comments on Every Blog Needs a Theme Song!

Google has added a new music generation model called Lyria 3 to its Gemini 3 models.

I was playing around with it last night – having it generate happy birthday greetings for a friend whose birthday is coming up in a few days, another song for a longtime business partnership I was part of, and more. It’s kind of crazy! And a lot of fun.

When you use Lyria 3 as a tool in Gemini 3 you get back an image and an MP3 file that’s 30 seconds long (longer coming soon according to Google). Turns out the 30 second length is just about perfect for the “quick hit” from a snippet of music.

Google provides several genres you can choose from to start with – or you can just go with whatever you want to say in the prompt – here’s a rough template for doing that:

[Topic] + [Genre] + [Mood] + [Instruments] + [Vocals]

This morning I went for my morning walk and had a thought – how about generating a theme song for my blog. So when I got back home I opened up Gemini, selected the Music tool and entered:

Take a look at my blog and compose my theme song! blog: https://sjl.us

You can see with that prompt that I really didn’t provide it much direction – just a pointer to my blog so that it could try to generate something appropriate.

It took a few seconds for Lyria to read my blog and then use what it found to generate my blog’s theme song – and I like it!

You can play the theme song for yourself here:

Tags ai, ai music, ai music generation, artificial intelligence, creativity, Gemini, gemini 3, google, innovation, music, music generation, Scott Loftesness

AI Business

The Gravity of Compute

We are currently witnessing the single largest deployment of capital in human history. The “Hyperscalers”—the titans of our digital age—are pouring hundreds of billions of dollars into the ground, turning cash into concrete, copper, and silicon.

The prevailing narrative is one of unceasing, exponential growth: bigger models require bigger clusters, which require more power plants, which require more land. It relies on the assumption that the demand for centralized intelligence is insatiable and that the current architecture is the only way to feed it.

But history suggests that technology rarely moves in a straight line; it swings like a pendulum. Two forces are emerging from the periphery that could impact the ROI of this massive infrastructure build-out. One is hiding in your pocket, and the other is waiting in the sky.

A recent conversation with Gavin Baker outlines a potential “bear case” for datacenter compute demand: the rise of Edge AI.

We often assume we need the “God models”—the omniscient, trillion-parameter giants hosted in the cloud—for every interaction. But do we?

Baker suggests that within three years, our phones will possess the DRAM and battery density to run pruned versions of advanced models (like a Gemini 5 or Grok 4) locally. He paints a picture of a device capable of delivering 30 to 60 tokens per second at an “IQ of 115.”

“If that happens, if like 30 to 60 tokens at… a 115 IQ is good enough. I think that’s a bear case.” — Gavin Baker

Consider the implications of that specific number. An IQ of 115 isn’t omniscient, but it is competent. It is capable, nuanced, and helpful.

If Apple’s strategy succeeds—making the phone the primary distributor of privacy-safe, free, local intelligence—the vast majority of our daily queries will never leave the device. We will only reach for the cloud’s “God models” when we are truly stumped, much like we might consult a specialist only after our general practitioner has reached their limit. If 80% of inference happens on the edge for free, the economic model of the trillion-dollar data center begins to look fragile.

Then there is the second threat, one that attacks the terrestrial constraints of the data center itself: the Orbital Data Center. Elon Musk and SpaceX – along with Google’s Project Suncatcher – envision a future where the heavy lifting isn’t done on land, but in orbit. Space offers two things that are scarce and expensive on Earth: unlimited solar energy and an infinite heat sink for radiative cooling. If Starship can reliably loft “server racks” into orbit, the terrestrial moat of land and power grid access—currently the Hyperscalers’ greatest defensive asset—evaporates.

We are left with a fascinating juxtaposition. On one hand, we have the “Edge,” pulling intelligence down from the clouds and putting it into our hands, making it personal, private, and free. On the other, we have “Orbit,” threatening to lift the remaining heavy compute off the planet entirely to bypass the energy bottleneck.

There are hundreds of billions of dollars betting on a future of heavy, centralized gravity. But if the edge gets smart enough, and the orbit gets cheap enough, the gravity may have shifted.

AI AI: Large Language Models Investing

The Ledger of Curiosity

We often romanticize the “back of the napkin” idea. It is the symbol of spontaneous genius—the startup mapped out in a coffee shop, the ticker symbol hurriedly scribbled during a dinner party. But we rarely talk about what happens to the napkin afterwards.

Usually, it gets thrown away. Or lost. Or stuffed into a drawer, becoming just another artifact of a fleeting thought that had momentum but no direction.

In the first two parts of this experiment, I used Gemini 3 Pro to solve the friction of entry (transcribing my messy handwriting) and the friction of analysis (stress-testing the ideas against 10-K realities). But there was one final gap: Permanence.

An analysis that lives and dies in a chat window is barely better than one that lives and dies in a notebook. It is still ephemeral. To truly build a “Second Brain” for investing, the data needs to leave the conversation and enter a system.

“The goal of technology should be to stop us from losing the work we’ve already done.”

I tweaked my workflow one last time. I asked the AI to not just judge the stocks, but to format its judgment into a raw CSV block.

With a simple copy-paste, my handwritten scribble wasn’t just digitized; it was database-ready. It went from a piece of paper to a row in Google Sheets with columns for “Market Cap,” “P/E Ratio,” and “Primary Risk.”

Suddenly, I wasn’t just looking at a list; I was building a ledger. I can now track these ideas over months. I can see if the “Red Flag” the AI identified actually played out. I can measure my own batting average.

The goal of technology shouldn’t just be to make us faster at doing work. It should be to stop us from losing the work we’ve already done. By turning ink into data, we stop treating our ideas as disposable. We give them the respect of memory.

AI AI: Large Language Models Investing

The Digital Devil’s Advocate

Post author By Scott Loftesness
Post date January 20, 2026
No Comments on The Digital Devil’s Advocate

There is a seduction in the handwritten note. When I scribble down a company name in a notebook, it is purely additive. It represents potential upside, a future win, a brilliant insight caught in ink. The notebook is a safe harbor for optimism because it lacks a “Reply” button. It doesn’t argue back.

But optimism is an expensive luxury in investing.

After my initial experiment—using Gemini 3 Pro to transcribe my messy list into tickers—I felt a surge of productivity. But productivity is not the same as discernment or understanding. I had a list of stocks, but I didn’t have a thesis. I just had digitized hope.

So, I took the next step. I didn’t ask the AI for validation; I asked for a fight. I fed the tickers back into the model with a specific directive: “Act as a contrarian hedge fund analyst. Find the red flags. Kill my enthusiasm.”

“I didn’t ask the AI for validation; I asked for a fight.”

The results were immediate and sobering. The “promising tech play” I had noted? The AI highlighted a massive deceleration in user growth hidden in the footnotes of their latest 10-Q. The “stable dividend payer”? It flagged a payout ratio that was mathematically unsustainable.

In seconds, the warm glow of my handwritten discovery was doused with the cold water of 10-K realities. And it was fantastic.

We often view AI as a tool for creation—generating text, images, and code. But its highest leverage application might actually be destruction. By using it to stress-test our assumptions, we outsource the emotional labor of being the “bad cop.” It allows us to kill bad ideas quickly, cheapy, and privately, before we pay the market tuition for them.

My notebook is still where the dreams live. But the digital realm is now where they go to survive the interrogation.

AI AI: Large Language Models Investing

From Ink to Insight

There is a distinct friction that exists between the analog world and the digital one. For years, analog notebooks have been the graveyard of good intentions—lists of books to read, article ideas to write, and companies to investigate, all trapped in the amber of my barely legible handwriting.

I recently found myself looking at one of these lists: a scrawl of company names I had jotted down while reading an article discussing possible companies for investment in 2026. Usually, this is where the work begins—taking my handwritten notes, typing them out one by one, searching for tickers, opening tabs, etc. It is low-value administrative work that often kills any spark of curiosity before it can turn into useful analysis.

“The barrier to entry for deep research drops to the time it takes to snap a photo.”

On a whim, I snapped a photo and uploaded it to Gemini 3 Pro. “Transcribe this,” I asked. “Give me the tickers.”

I expected errors. My handwriting is, to put it mildly, not easy to read (even for me!).

Instead, the AI didn’t just perform Optical Character Recognition (OCR); it performed contextual recognition. It understood that the scribble resembling “Apl” in a list of businesses was likely Apple, and returned $AAPL. It deciphered the intent behind the ink.

But the real shift happened when I asked Gemini to pivot immediately into research. Within seconds, I went from a static piece of paper to a dynamic analysis of P/E ratios, recent news, and market sentiment. The friction was gone.

This experience wasn’t just about productivity; it was about the fluidity of thought. We are moving toward a reality where the interface between the physical world and digital intelligence is becoming permeable. When the barrier to entry for deep research drops to the time it takes to snap a photo, our curiosity is no longer limited by our patience for data entry. We are free to simply think.

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: