Categories
AI AI: Large Language Models AI: Transformers ChatGPT

The Best Explainer of ChatGPT

Just came across a great explainer of ChatGPT and the underlying technologies by Stephen Wolfram: What Is ChatGPT Doing … and Why Does It Work?

That ChatGPT can automatically generate something that reads even superficially like human-written text is remarkable, and unexpected. But how does it do it? And why does it work? My purpose here is to give a rough outline of what’s going on inside ChatGPT—and then to explore why it is that it can do so well in producing what we might consider to be meaningful text.

Highly recommended!

Categories
AI AI: Large Language Models AI: Transformers ChatGPT

Hallucinating

woman in white knitted sweater
Photo by cottonbro studio on Pexels.com

It’s been just over a last week since Microsoft made such a big deal about an enhancement to its Bing search engine that adds OpenAI’s GPT chat capability to it. In the process, Microsoft declared how it expected this new capability to help drive significant market share growth in search.

But it’s become clear in the last week that this kind of chat capability is much less about enhancing search (particularly if you want accurate answers) and much more about generating creative text useful for other purposes. This capability has become known as hallucination – where a chatbot just started string text together. Cade Metz writes: “hallucinate is just a catchy term for “they make stuff up.”

I’ve had some great fun playing around with this to get help writing, for example, short stories. I’ll provide a few sentences to seed the chatbot’s “thinking” and then ask it to complete a 1,000 word short story based on that input I provided. It’s been fun to see what results.

Even more fun has been asking the chatbot to adjust the style to make the writing similar to other famous authors such as Hemingway, Steinbeck, Twain, George Saunders and others. It’s been fun to see the stylistic changes it makes to the same basic story based on the writer’s style that I specify.

None of this work has anything to do with search – nor does it help in any way by enhancing search results. It’s something completely different, strikingly interesting, and a heck of a lot of fun to play with. Whether it’ll be really useful in helping me do any real writing remains an open question – but meanwhile I’m enjoying sparring with a seemingly smart creative “mind” on the other end of my computer screen!

Categories
AI AI: Large Language Models AI: Transformers ChatGPT

Attention is all you need – or is it?

woman in white long sleeve shirt standing near white and gray house during daytime
Photo by Julian Jagtenberg on Pexels.com

How important is accuracy? Sort of feels like the pursuit of quality in Zen and the Art of Motorcycle Maintenance!

I’ve been enjoying following the evolution of AI technology which seems to be accelerating at an ever increasing rate. Speaking with a good friend earlier this week, he said “Scott, it really feels to me like the early 90’s – when change was accelerating (the Internet) and we could feel it but didn’t really know what to make of it.” Indeed, it does feel like that again.

After spending some time both playing with ChatGPT, Poe, and others, I’ve come to respect what they’re capable of. But I’ve also come to learn more about what they’re not capable of – namely, dealing with facts in an accurate way. These tools all provide a disclaimer that they may generate inaccurate results – and that their results must be checked for accuracy. And for good reason. Once you understand how the large language models work, you can understand why.

In my simple understanding, what these LLMs do is get trained on very large corpuses of textual data – like the “whole Internet” – and that training is then “validated” by humans who test it with lots of queries and inspection of the generated results. That combination of training and verification is used to essentially set the weights inside the model which then are used in a kind of simplistic way to generate text – by moving from word to word (or word fragment to word fragment) and “writing” answers to queries. In other words, they’re using their training to come up with the best possible next word to output given the query they’ve been given.

Clearly, the best possible next word isn’t necessarily an accurate one. Rather it’s one that the model has seen most frequently. Because of this fundamental characteristic in how these things work, you can get results that aren’t necessarily accurate. As a personal example, it asked ChatGPT to give me a history of a small town where my father grew up. I knew that history – including where the name of the town came from. ChatGPT gave me the wrong attribution for the name of the town. But when I asked a bit differently whether the name actually came from the location I knew to be correct the model came back and agreed with me.

In my mind, coming to better understand these characteristics has helped me understand where these kinds of models may be very useful and other situations where they might be less useful. For example, asking one of these LLMs to help write a short story is a very good use. See an earlier example I wrote about using ChatGPT to write haiku poetry. That use case isn’t one that depends on any accuracy – it’s just one where clever use of text is all you want. Similarly, I can see how using LLMs to help write computer code (like GitHub Copilot) makes a lot of sense. But I question how much value an LLM can bring to helping deliver better search results – which is the current basis for what Microsoft is trying to do with its new Bing and enhancements to the Edge browser. It’s less clear that I want potentially inaccurate results from an LLM being delivered to me in response to search.

It’s early days to be thinking about all of this – and I’ve got a lot more to learn. Meanwhile, I’m really having some fun using these tools to help me write short stories – based on a start that I provide and then asking for the result to mimic the writing style of a Hemingway, Steinbeck or Didion. That kind of creative use definitely is helped using these tools. Getting accuracy in search results seems much less promising.

Categories
AI ChatGPT

AI Chatbots are the new attention getters

The wave of AI chatbots continues to build – with the recent public release of ChatAPT, the new Poe app from Quora, today’s Bing announcement from Microsoft and whatever might be coming from Google, Facebook, Apple and others.

I’ve been doing quite a bit of experimenting with ChatAPT in particular – as you can see from some of my recent posts. Today I’ve been spending time with Poe on my iPhone – it’s a really well done app that makes it even easier to interact conversationally with AI chatbots (Poe supports multiple chatbots).

One thing is clear from my play – these new “tools” are true “attention sinks.” As you ask questions and research a topic, the ability to interact and dive deeper is quite addicting – and time consuming! It’s all an interesting experience.

With the potential for significant monetization for companies that can captivate our attention through their apps, the time we spend with AI chatbots may come at the expense of time spent on other apps, which could impact the ability those other apps to continue to monetize at the same level. This shift will be intriguing to observe.

Categories
AI AI: Diffusion Models ChatGPT

Diffusion Models and Image Creation

I’ve been doing some more exploring with ChatGPT – this time asking it to educate me on how diffusion models work – they’re the underlying technology behind apps like DALL-E, Stable Diffusion, Midjourney, etc. Below is what I learned during my conversation with ChatGPT. The image below was created using DiffusionBee, an app based on Stable Diffusion. I used the input: a colorful illustration of how diffusion models work.

Categories
AI ChatGPT

The Ghost in the Box

I happened to hear a discussion on a YouTube video yesterday about Niklas Luhmann’s mention of how his “zettlekasten” seemed to act like a “ghost in the box”. I did a quick Google search for that phrase but didn’t find anything particularly helpful.

Earlier I had watched Nick Milo’s video about how he’s been using OpenAI’s ChatGPT for research and other things. He talks about a near future where he expects to have it nearby whenever he’s working on his computer.

So I decided to ask Chat GPT to help me understand more about the “ghost in a box” by writing a 1,000 word article about it. Here’s what I got back from ChatGPT:

The Ghost in the Box

The concept of “ghost in the machine” was popularized by philosopher Gilbert Ryle in his 1949 book “The Concept of Mind.” The phrase refers to the idea that human consciousness and mental states cannot be reduced to mere physical processes in the brain.

German sociologist and systems theorist Niklas Luhmann took this concept a step further in his theory of social systems. He proposed that society itself can be seen as a “ghost in the box,” a complex system that is made up of smaller, interconnected parts that cannot be fully understood or controlled by any single individual or group.

According to Luhmann, society is made up of various “function systems,” such as the legal system, the economic system, and the political system. These systems operate independently of one another, but are also interconnected and dependent on each other. Each system has its own “code,” or set of rules and norms that govern its operation.

Luhmann argues that society as a whole can be seen as a “communication system,” which is constantly producing and processing information. This information is then used by the various function systems to make decisions and take action. However, the complexity and sheer volume of this information make it impossible for any one individual or group to fully understand or control the system.

This idea of society as a “ghost in the box” has important implications for how we understand and interact with the world around us. For example, it suggests that we can never fully understand the causes of social phenomena or predict their outcomes. It also implies that we should be cautious about attempts to manipulate or control the system, as these efforts are likely to have unintended consequences.

Furthermore, Luhmann’s theory also highlights the importance of communication and understanding the code of the different function system in order to function in the society. He also emphasizes on the importance of social differentiation, as it allows different function system to operate independently while still being interconnected.

In conclusion, Luhmann’s concept of society as a “ghost in the box” offers a unique perspective on the complexity and unpredictability of social systems. It reminds us that we can never fully understand or control the world around us, and that our actions and decisions are likely to have unintended consequences. Thus, it is important to be aware of the limitations of our understanding, and to approach the world with humility and caution.

Note: This was generated using ChatGPT Jan 9 Version.

Categories
AI ChatGPT Haiku

Haiku with ChatGPT

Note: see also Haiku with Google Bard.

Like over a million others, I’ve been enjoying playing with the new ChatGPT tool from OpenAI.com. Among many other things, ChatGPT can help generate haiku – here are some examples from a recent session of mine:

Write a haiku about film photography

In moments captured
Light on film, forever frozen
Art of the past lives

Write a haiku about a quiet morning in a boat on a lake

Quiet morning calm
Ripples gently lap the shore
Birds in harmony

Write a haiku about snow falling on cedars

Snowflakes fall like tears
Upon the cedars’ green boughs
Winter’s silent grace

Write a haiku about stillness in Kyoto

In the still of night
Kyoto’s ancient temples bask
In the moon’s soft glow

Write a haiku about watching the first rain of the year

First rain of the year
Falling gently from the sky
Refreshing the earth

Write a haiku about geese flying overhead

Geese fly in formation
Honking as they pass overhead
Autumn’s wild symphony

Write a haiku about farming with a John Deere tractor

Green John Deere tractor
Plowing fields under bright blue sky
Farming’s steadfast work

Write a haiku about writing haiku

Ink flows from my pen
Expressing thoughts in simple verse
Haiku, my meditation

Try it ChatGPT yourself with a free account at OpenAI.com. Image created using DiffusionBee with the phrase: “a painting of an poet in a red beret writing haiku on a paper tablet”