Asking SubTuringBradBot 1.01 About Swallow…

The hope is that you can, via proper prompting, send it to a corner of the training data space that say true things about the subject matter of the question asked...

Expand full comment

Auros

Apr 21, 2023Edited

Sure, but as long as there is nothing in the model itself that represents the difference between "true" and "truthy", you're always stuck with that kind of prompt engineering. It's not artificial intelligence, it's artificial smart-sounding-ness.

I rather like the joke that it's "mansplaining as a service".

Expand full comment

MaaS is very good...

Expand full comment

glc

In somewhat old-fashioned terms, bare syntax vs. syntax+semantics.

A point which at some level seems to have been fairly well understood about 50 years ago (in the computer community, specifically, much longer elsewhere).

I don't follow what's being done but I assume something is still being done on the alternate pathway. Though if you want semantics you can't just throw a large dataset at it and hope that something will emerge (doesn't work too well for people either, past the language acquisition/visual analysis phase).

We do know how to do semantics, theoretically.

On the other hand, I've seen students over their heads in class because they're stuck in a mode of trying to answer questions without building a semantics for the subject matter. Which can work for a very long time in a traditional academic setting; but they do hit a dead end.

Expand full comment

Philip Koop

What I think is that the ChatBot does not have a theory of mind for you, it has only a theory of text. So if the response to the prompt "Why was humanity so poor back in the long Agrarian Age?" does not include the word "Malthusian", that is because whenever the text it was trained on - presumably Slouching Towards Utopia? - says that humanity was poor back in the long Agrarian Age, it does not usually mention that this was a Malthusian condition. When you wrote the text, you were thinking "Malthusian", because you have a theory about the Agrarian Age; and when I read that text, I was thinking "Brad is thinking Malthusian", because I have a theory of mind for Brad DeLong. But that's the difference between ChatBot and me.

Put in the terms you use below, the ChatBot can go to a corner of its training space, and its space is a very very high dimensional hypercube not legible to the human mind, so this allows it to surprise us; but it cannot create a corner of space where it was never trained.

Expand full comment

James

It seems these modern transformers still have a problem of “forgetting” text and information given at a certain distance.

Expand full comment

Apr 22, 2023

Yep. The "attention" mechanism helps diminish this problems a lot...

Expand full comment

James

Apr 22, 2023

My knowledge of this AI is from cursory reading of the technology. I have a general idea of the strategy and goals of the transformation, though I have no idea how correct I am. But it has lead to some ideas. I wonder what would happen if the hidden database were again sent through the transformer, thus bringing long distance terms even closer together? The idea would be similar to calculus in making derivatives. With an equation representing distance and time, the first derivative yields velocity (d/v). The second derivative yields acceleration (d/v^2). With each calculation, a term is lost. Likewise when making an integration, a constant will need to be added back.

So with this model starting with a paragraph, you have a thought. Transform it again yields a concept. Transform again into a theme. Reverse the process to get an output. This process may yield different types of output across similar disciplines. Of course this may be just my imagination running wild.

Expand full comment

Apr 22, 2023

I think as you feed it to itself over and over again things become compressed, but after two or three passes you lose coherence pretty completely

Expand full comment