Discussion about this post

User's avatar
Kaleberg's avatar

Page level autocomplete seems to work as well as short phrase autocomplete. It's accurate maybe 10%-15% of the time. The AI/ML systems that are useful are the product of extensive and focused data gathering and do an advanced form of compression or curve fitting. If you want to optimize a metal alloy or find a protein structure, they can work fairly well, but that's because of the clean, standardized data set.

A friend of mine has been exploring various AI chat systems' literary abilities, answering questions about English language literature. She says the completion systems have access to interesting things, but you have to know those interesting things in order to get their completions. It is a chicken and egg problem that leaves one stuck with a chicken or an egg and no way to get from one to the other.

It's a lot like internet search. If you know the right terminology and phrasing, you can - or at least could - find all sorts of interesting things, but if you don't, you'll have to keep trying alternative search phrasings and hope that hit #267 provides a useful key phrase that leads one to a more interesting place. Search systems have actually gotten worse in this regard since they have done a lot of work to "improve" search over the last decade. It is often impossible to find certain things without precise phrasing and priming.

What happened to search? It moved away from simple textual search and tried to understand the semantics. Suddenly, things that were accessible vanished, since their web pages didn't fall into the correct place in semantic vector space. It's much easier to figure out an appropriate textual search than trying to outsmart a relatively opaque semantic space algorithm. Conversational search seems to have the same problem. At least one can look at the URL and original text of a web page to determine if it is bogus rather than having a credulous semantic algorithm accept and incorporate it. (Maybe these things should provide footnotes.)

To deal with this, most search engines now have a list of approved web sites that they use to resolve certain search queries. The same sites, often encyclopedias or standard reference works, appear again and again at the top of one's search results. The reason is obvious, the internet is full of incorrect and poorly indexed information. If the answer can be found in Encyclopedia Wikipedia, it is at least likely to be useful. On the other hand, there is a reason one learns to move beyond encyclopedias and such secondary sources if one is going to find out things.

Expand full comment
BS Bro P's avatar

RE: cut-rate MarshallBermanChatBot - like BradBot, an excellent demonstration & application of LLM

Expand full comment
5 more comments...

No posts

Ready for more?