Discussion about this post

User's avatar
glc's avatar

Here's an April 1 post: https://www.rfc-editor.org/rfc/rfc9564.html h/t Gergely Nagy.

However, every day is April 1 in this area. And much else.

Alex Tolley's avatar

"The… amount of tacit knowledge that's involved in successfully training a high-quality large model is still quite high. So you can read the papers, you can look at the open source, but getting these things to train and converg… over these large clusters and managing all of that—there's still quite a lot of that knowledge… not published, not written down…. The individual items are probably small, but they really add up…"

Ahem. Isn"t this exactly the problem that ultimately sunk Expert Systems? Too much handcrafted knoedge missing tacit knowledge that was needed. Train language models seems on a similar level to extracting expertise and encoding it in rules. What was the saying about "not learning from history..."?

2 more comments...

No posts

Ready for more?