Is the reversal curse in LLMs real?

gipp · on Nov 15, 2023

I fall somewhere on the skeptic side of the LLM spectrum. But this "flaw" just does not seem to have the force that its proponents seem to think it does, unless I'm missing something significant. Simply because in the context of natural language (Rather than formal logical statements), "A is B" does not imply "B is A" in the first place. "Is" can encompass a wide variety of logical relationships in colloquial usage, not just exact identity. "The apple is red" does not imply "red is the apple," as a trivial example.

majormajor · on Nov 15, 2023

I don't think this is the right explanation, or really that relevant, the suggestions from 'og_kalu and in the article sound more accurate to me.

It seems like understanding when "is" is reversible is pretty core to the capabilities of the model, but that's different than having a lot of facts memorized. For instance, a model should be able to answer "who is the star of Mission Impossible" with "Tom Cruise" based on context or internalized-training of "Tom Cruise is the star of Mission Impossible" while not answering "Tom Cruise" to a much more general question like "who is covered in water" even if it had internalized a bit from a review like "there's a scene in Mission Impossible where Tom Cruise is covered in water..." unless it had context pointing it at that specific case.

tysam_and · on Nov 15, 2023

It's a bit of mathematical bikeshedding, hardcoding reversability would cause far more problems than it would help.

Best to simply scale log-likelihood-based training, next-token-based training trivially contains a requirement for learning all of the subproblems that predict said, next token, and hardcoding something to get warm human fuzzies would be creating a biased estimator (and move us back towards the 90s a bit).

Models already very constantly do context-dependent token utilization, it's an autoregressive feature based on the entire stream of incoming tokens. Humans have a bias to focus on the 'last token used', this is not what language models look at.

thfuran · on Nov 15, 2023

>it's an autoregressive feature based on the entire stream of incoming tokens. Humans have a bias to focus on the 'last token used', this is not what language models look at.

But human language is created for and by humans. Is not then operating on language in a categorically different manner an incorrect usage/understanding of language?

tysam_and · on Nov 15, 2023

No.

For more information, please see https://people.math.harvard.edu/~ctm/home/text/others/shanno...

thfuran · on Nov 16, 2023

I'm not sure why you think that's a rebuttal.

tysam_and · on Nov 16, 2023

It is because it is the foundation for the mathematics underpinning your question about communications systems, encodings, and their various structures. It also explains why the choice of the log likelihood is a better fit for estimating a token-based model than some other hand-crafted heuristic.

The version with the Weaver introduction is quite good as well, there are other versions of similar papers covering the topic from different angles, I find it to be well-worth the read.

godelski · on Nov 15, 2023

I mean transformers are definitely sequentially biased. There's also human speech bias. But I think it's pretty clear that humans are generally invariant to this kind of prompting as well (given that they have the knowledge. More in a different comment). My surprisal is far higher that the reversal "curse" is considered controversial or even surprising than it was when that sensational tweet dropped. It feels pretty well studied that sequential algorithms are sequentially biased.

famouswaffles · on Nov 15, 2023

>But I think it's pretty clear that humans are generally invariant to this kind of prompting as well (given that they have the knowledge. More in a different comment).

I think anyone who's used Anki to learn anything will tell you reversal is by no means free.

고양이 on the front deck, Cat on the back.

You can see this direction and remember cat is the correct answer every single time but completely blank in the other direction. Cat on the front. What Korean word is at the back ?

godelski · on Nov 15, 2023

Sure, definitely not "free" and why I say nearly invariant. But that is also a different condition that what I was attempting to express. What I was trying to say is under a condition where you already know 고양이 means cat, then the reverse should not be too difficult (though certainly not completely invariant as all language learners know that made up silly phrase half direct translation thing).

Conditions I mean: 고양이 => <concept of cat>; cat => <concept of cat>; either [고양이 => cat || cat => 고양이]. Under these conditions 고양이 == cat && cat == 고양이 should be known.

Your conditions: 고양이 ~> <concept of cat>; cat => <concept of cat>; [ 고양이 -> cat && cat ~> ...고양이?] (or something, trying to express you're preferentially learned one direction, usually mapping back to mother tongue while learning)

By no means are we strictly invariant. But I'd say that a good measure of your skills are in fact to be invariant. But also like you are pointing out, we also kinda train ourselves that way because it creates stronger pathways. Training methods are important too! And augmentation is critical to both human learning and machine learning. They're specifically critical to generalization abilities. You'll also see this in humans. I notice test focused educations tend (there's no "always") to create less generalization of concepts as you're over learning a thing presented in a specific way rather than focusing on the concept as a whole. Form of metric hacking if you will.

Side note: if you have tips for an extreme beginner learning Korean I'd really appreciate them. Trying to reduce the burden on my gf having to translate everything for me when we're around other Koreans.

famouswaffles · on Nov 16, 2023

If you haven't learned Hangul, then you can learn it from this playlist - https://www.youtube.com/watch?v=Z9ZxsgMAZmI&list=PLbFrQnW0BN...

If you have learned Hangul, then you can start with Lingodeer(app), TTMIK(talk to me in Korean)’s Grammar books, or Billie's beginner playlist - https://www.youtube.com/watch?v=sx0yyQqkpqo&list=PLbFrQnW0BN....

These are all extremely beginner friendly. You can just look at the options and choose whichever works best for you. I personally tried lingodeer and ttmik both. I did a bit of lingodeer and then ttmik levels 1 to 3. I didn't try Billie's playlist, but he's a great teacher, so I recommend it if you think that would work better for you.

So in the beginning stages, it doesn't really matter so much, but eventually you're going to have to start thinking about how you’ll be learning the big 6, vocabulary and grammar (what I’ve spoken on so far covers grammar and TTMIK takes you pretty far in grammar if you stick with it) then speaking, listening, reading, writing, because believe me, there isn’t as much transfer learning as you would expect or hope. I mean, you don't have to stress over it or anything, just keep in mind that you have to eventually think about how you're going to address those things, and the sooner, reasonably, the better.

So this is kind of where my training diverges from most traditional approaches. I know I used Anki from my example and a lot of learners use that with pre made decks from the community for vocabulary, but honestly, I just couldn't use it then.

Part of it was because at the time, learning Korean words was like drawing blood for stone. For me, it was so hard. Even five words a day could be a challenge. My brain was like, “what the hell are you doing?”. It just wasn't going in.

Come to think of it, you know grokking in NN’s? Learning a language, you quickly realize the same phenomenon can occur in our biological counterparts. It’s like one day I was struggling to learn these new words, I don’t do anything drastic or different and suddenly the next day, learning new words felt so much easier, it’s crazy. Progress isn’t necessarily linear that’s for sure.

Part of it was also because I just didn't like the Anki experience, memorizing hundreds, thousands of words in isolation. It just wasn't really doing much for me. So, I just kind of dropped it. If traditional anki works for you then go for it but gin, you’ll need to figure out the remining 4. No matter how much isolated grammar and vocabulary you learn, you’ll need to practice doing each of those things for it to sink in.

The typical process is to learn a few thousand words and up to intermediate grammar before sinking into the 4. Makes things a lot less frustrating.

What I found was this amazing audiobook series. It's a graded reader series. 100 books with audio, all free(start from the oldest). https://audioclip.naver.com/channels/57.

Here the books are further divided into stories. At first, each story is really a few pages but it does get longer and longer as the series progresses. graded readers are specifically simplified literature created to help language learners progress.

It's also so that you can read in your target language much earlier than you otherwise would have because, well, you might see a lot of people say, “oh, just read, just watch or read children's shows or children's books.” That's a trap because like, children are native speakers man. Don't make the mistake of thinking that they aren't.

The complexity of the stuff they read or watch or whatever, is well beyond the beginner learner. So if you go into children's media and learning some few hundred words thinking, “oh, this should be easy, then, yeah, it may become very demoralizing because it won’t be in fact easy”.

And just to clarify, I mean children’s stuff not baby stuff. Yeah, baby stuff is simple, but you're going to get bored of that very, very fast.

So, yeah, it's a graded reader series and the idea is just to get started. Simplified stories you might find more interesting than baby stuff.

So, I really like reading. And I don't know, I just felt like it would be better because, you see, what Anki does is SRS, spaced repetition system, right? But when you think about reading, it's really a natural form of that. So, I just felt like, okay, this is what I'm going to try.

So, I was even thinking, oh, I could use this to knock almost all the birds with one stone, you know, it’s vocab, grammar, reading, and listening practice at the same time.

But the book series, it starts with a pretty high level. I don't know if you're familiar with the CEFR levels. It starts at about a low B1(bottom end of low intermediate) or so and ends at a high C1(top end of low advanced).

So, yeah, coming from where I was, where I knew at best, maybe 100 or 200 words and jumping in, it was difficult. And honestly, I would not have been able to do it without Mirinae. So, Mirinae is a grammar parser - https://mirinae.io/.

And what it does is, so basically, Korean and English are so far apart that you could know the meaning of every word in a sentence, and you would be absolutely lost as to the meaning of the sentence itself. So, what Mirinae did, or basically took what would have been impossible for me and made it achievable. I learnt a lot of grammar this way. You place the sentence in, and it would basically parse the grammar on the sentence.

You could see the explanations of different grammar parts, different nouns, objects, stuff like that, how this affected this and all that.

It was very, very, incredibly helpful. To to this day, I think that at least for distant languages, a good grammar parser is by far the most useful tool, better than a dictionary, better than just a translator.

So, the first story, I was pasting every sentence. So, it took me about a week to get through it, a few hours every day. The amazing thing was that you could the progress, the reduction in time it takes you to complete things, because the second story took only a few days in comparison. By the third or fourth story, I think it took only a few hours. I mean, I didn't finish it in a day, but I could have if I wasn’t a bit lazy.

So, yeah, that's the benefit of graded readers. Whether you start this as early as I did or not, it’s genuinely an amazing resource. Earlier this year, I came across this Anki deck series with natural audio -https://ankiweb.net/shared/by-author/374470252

What the guy managed to do was, he took a machine learning Korean speech dataset, and he kind of configured a lot of the sentences, which were, you know, a single woman speaking sentences, about 12,000 or so of those sentences. So, he reduced it to about 7,000, and he rearranged, as much as possible to be i+1 comprehensible. i+1 is just basically the idea that you come across a sentence, and there’s only one word you don't know.

So, it's much easier to grasp the meaning of that sentence quickly and easily. Then the next sentence, only one word, then the next sentence, only one word, then, and so on and so on. Now, this deck isn't perfectly i+1. But going through it, it's very good. So, you know, you could also explore that too after the beginner grammar resources, I suggested.

I’ve also noticed these guys, https://umiapp.co/.ly. It’s an app tailored for learning words in context, comprehensible input style. I really liked the looks of what I saw from the other languages. Korean isn’t there yet but is coming soon. By the time you’re ready for it, it’s hopefully there.

Anyway, best of luck.

godelski · on Nov 16, 2023

Wow this was an incredible response. I don't even know how to properly respond to it but I've favorited it and bookmarked it. Thanks a ton!!!!

TeMPOraL · on Nov 15, 2023

> But I think it's pretty clear that humans are generally invariant to this kind of prompting as well (given that they have the knowledge. More in a different comment).

But are they? Unless I'm misunderstanding what you mean here, I'd say the opposite is clearly the case - learning "A is B" doesn't automatically mean learning the reversal; you either have to learn it from external source, or infer and memorize it - both involve an extra cognitive effort. This maps to LLMs as training, or "in-context learning" and then adding the inference to the training set.

godelski · on Nov 15, 2023

I think this would be reasonably testable actually (at least in a simplified case). Teach someone two relations that aren't meaningful naturally. Say Apples -> black holes and octopi -> industrial fan. You give them a few minutes to learn them. Then give some other unrelated tasks and say "if black holes -> apples, then what does industrial fan imply?" I feel pretty confident that humans would be able to do decently well on this task. Obviously got to account for people simply forgetting the forward relationship though.

In the non-simplified case I'd actually say that we have some natural experiments, but that might spoil this as well because you can in fact say that this is training for such invariance (which I'm unsure is problematic to my claim but pushback is definitely welcome). If we're learning history we'd often learn information such as "On December 7th 1941 Japan attacked Pearl Harbor and this was a major event that got America directly involved in WW2." But then on the test we'd see questions such as "What important event happened December 7th 1941?", "What was the date of the Pearl Harbor attack?", "What major event led to the US becoming a direct player in the war and when did this event occur?" and so on. Granted, because we do such testing I'd expect GPT to be able to reasonably answer many of these same questions but I'm very much under the impression that humans are far more robust/generalized and would quickly learn these reversals. And to stress, conditioned on knowing the forward direction to a strong degree. If you're fuzzy about the forward direction I'm not surprised if the reversal is even more fuzzy. Definitely biased. But more importantly, I do not think we have the same strong algorithmic bias in our encoding methods that transformers have. That's my main claim, just to clarify.

But I'm conjecturing a lot here. I think these are reasonable and we have some strong evidence for this. Maybe this is more situational than I'm presuming and I'm definitely working off of biased sampling. So discussion helps refine the ideas.

famouswaffles · on Nov 15, 2023

I think a lot of people unwittingly think of the training process as "smart".

Similar criticisms are "well there are many descriptions of game x it would have read so why doesn't it play x well". But gradient descent is a dumb optimizer. Training itself is not actually like someone reading a text anymore than evolution is like someone thinking about the best way to augment an organism.

a "smart" optimizer would look at a reversal applicable sentence and know exactly what bunch of weights to change to store it in such a way as to be recalled reversibly in the future.

Inference may be smart (GPT-4 can reverse in context just fine, potentially play games from only a description) but the training is not.

LoganDark · on Nov 15, 2023

A prior (now-deleted) comment of yours put this in such a great way, and I'm sad that it got deleted:

> You could potentially describe how a game is played to GPT-4 without examples and get it playing it correctly but passing that same description into the training process of the model just gets you a model that can describe your game correctly.

tysam_and · on Nov 15, 2023

I'm not really sure if I understand the intuition here, this seems rather disconnected from what I understand to be the math of optimization.

It seems like you're referring to an associative Hebbian/Hopfield-like lookup, which the current 'dumb' optimizers already do. Better yet, the learning rates are normalized by the diagonal of the empirical Fisher so that the learning w.r.t. to some estimated expected information is more constant, for said associative lookup operation.

Additionally, the training loop (which you call 'dumb') is just...a teacher-forced version of inference, which you call 'smart'?

It's better to simply minimize the log-likelihood in a scalable way. Hand-engineered solutions rarely survive compared to strong-scaling ones.

famouswaffles · on Nov 15, 2023

smart is in quotes here for a reason. It's 'dumb' in relation to many people's expectations.

>Additionally, the training loop (which you call 'dumb') is just...a teacher-forced version of inference, which you call 'smart'?

What powers In context learning is not very well understood but it doesn't appear to be or really work exactly like just a non teacher-forced version of training. There are qualitative differences. The same models have no problem with this 'curse' when the information is provided in context for example.

>It's better to simply minimize the log-likelihood in a scalable way. Hand-engineered solutions rarely survive compared to strong-scaling ones.

I never said anything about it being bad.

tysam_and · on Nov 15, 2023

> What powers In context learning is not very well understood but it doesn't appear to be or really work exactly like just a non teacher-forced version of training.

I mean, yes. One distills information from a training set into a compressed representation, and the other generates a compressed representation that yields (more or less) fixed state space attractors. It's just inducing a bias over the state space of the network, nothing incredibly special, though I'd consider the initial stage of training to be the most important, as it is responsible for all of the ingest of all of the embedded information the network will be using during autoregressive inference (especially w.r.t. the context of doing so during longer generations).

So the notion of 'in context learning' is one I find to be a bit of an illusion, of course, as no actual learning is being done, just the induction of biases, which appears to give rise to a transiently-'better trained' network.

You could see this as a bit straightforward perhaps, but I feel it needs to be said.

TeMPOraL · on Nov 15, 2023

I guess if we'd automatically feed every such conversation back to the training set, it would be proper "in-context learning", and it would work very much like humans learn - inferring new facts from recalled facts, and memorizing the inferences that come up repeatedly or otherwise feel important.

JKCalhoun · on Nov 15, 2023

I'm LLM-ignorant, but my layman brain wonders if the logical relationships belong in a kind of as-of-yet-not-invented "pre-processor" that feeds into the LLM.

(Perhaps though it has to be done during training so that "Who was the ninth Chancellor of Germany" becomes knowable in the first place.)

p1necone · on Nov 15, 2023

I don't think it's as simple as that - "the apple is red" is "single thing belongs to category", whereas "Olaf Scholz was the ninth Chancellor of Germany" is "single thing is single thing" - the latter is reversible, the former is not. I would expect a good language model to be able to parse both sentences correctly.

willsoon · on Nov 15, 2023

You are right. You are thinking correctly. But "the apple is red" might not mean that this particular apple belongs ---to put it in your wording--- to the category red, but that the category of things we call apple also belongs to the category of things that are red. And generally speaking, I think that is the meaning.

umanwizard · on Nov 15, 2023

> but that the category of things we call apple also belongs to the category of things that are red

No, in English this would be “apples are red”, not “the apple is red”.

bloak · on Nov 15, 2023

I don't disagree though perhaps it's worth mentioning that sentences such as "the great spotted woodpecker is a medium-sized woodpecker" are used in English with the meaning "great spotted woodpeckers are medium-sized woodpeckers" so it seems to me that "apples are red" is grammatically a possible meaning of "the apple is red" even if it would be stylistically and pragmatically so weird that nobody would ever do that except perhaps as some kind of joke.

karatinversion · on Nov 15, 2023

Indeed, and even with apples, we can say "the apple is native to Central Asia" without meaning a particular apple.

SAI_Peregrinus · on Nov 15, 2023

Except that apples can also be green, whereas great spotted woodpeckers are always a medium-sized woodpecker species. So 'the apple is red" implies at least a red variety of apples, never all apples, so "the" doesn't cover the whole group. But "the great spotted woodpecker is a medium-sized woodpecker" does cover the whole group (species). "The" can be either a definite or indefinite article, depending on context. That in turh changes "is" between equivalence and implication.

"An apple is red" does not imply that "Red is an apple". Use of the indefinite article makes the non-equivalence clearer.

smegsicle · on Nov 15, 2023

grammatically you'd capitalize Great Spotted Woodpecker, and similarly if speaking of all apples as a category, Apple should form a proper noun

bastawhiz · on Nov 15, 2023

> "A is B" does not imply "B is A" in the first place

Not to Bill Clinton this, but I think you are considering this for the wrong interpretation of "is". In this case, "RMS is the founder of the FSF" is the sort of statement implied here. Nobody else was or ever will be the founder of the FSF. So yes, for what the author is describing, "B is A" should always hold. B, in this case, isn't a category, it's an exact identity. The author doesn't explicitly state this, but it's clear this is what they're talking about.

mistermann · on Nov 15, 2023

You are using your mind to determine what the proper usage is in this specific concrete instance, LLM's do not have your mind to perform that step. LLM's have to work at higher levels of abstraction, and there the multiple meanings of "is" (which is also very commonly used to mean it is my opinion that X "is" Y) makes sorting things out very difficult.

This problem exists for both LLM's and humans, I think it may be fundamental to reality itself, as it currently is at least.

I wonder if they changed the training data to consistently use "equals" where possible (even if it sounds weird to the reader) would make a difference.

bastawhiz · on Nov 15, 2023

> there the multiple meanings of "is" (which is also very commonly used to mean it is my opinion that X "is" Y) makes sorting things out very difficult.

I'm not going to hype the technology more than it needs, but it seems to do a pretty damn good job of just this. And to the author's point, it does get this right.

And in fact, I'd suspect many of the cases where the LLM gets this wrong are caused by lots of examples in the training data which were written at different times, where someone else was chancellor. Or where the LLM gets numbering wrong (which is a well established problem).

mistermann · on Nov 16, 2023

Sure, but here you are describing your subjective[1] opinion on the matter. I do not disagree that this is your opinion, but the fact of the matter is similar but quite different thing.

And this is just one thing that makes sorting out what's going on here difficult - essentially, we have biologically LLM's [thinking they are] decoding silicon LLM's, of course the results are going to be weird and counterintuitive.

[1] Examine the specific language you are using, but also the colloquial ~interpreted/virtualized assertion that one is left with, if they are reading your comment "in good faith".

thomastjeffery · on Nov 15, 2023

LLMs don't derive or follow rules. LLMs don't have anything to do with rules.

They implicitly recognize (and model) patterns in text, then continue those patterns.

While the patterns that an LLM has modeled may align to the high abstraction we call language, LLMs actually work at a much lower level of abstraction: plain text.

It's the content of that text that is significant. We encode language into text, and LLMs infer something "close enough" to language by blindly modeling it.

mistermann · on Nov 16, 2023

You don't actually know what LLM's are "doing", comprehensively. You are describing your map/intuition of what is going on - it is a cultural convention (and a well enforced one at that), so an easy mistake to make.

marcosdumay · on Nov 15, 2023

LLMs are supposed to derive a huge amount of those rules from their training sets. That's the one thing they are good at. They should be pretty good at differentiating the reflexive from the non-reflexive usage of "is" from the context pretty fine.

The fact that the current ones don't is surprising.

naasking · on Nov 15, 2023

LLMs effectively disambiguate in an arguably similar way humans would: look at the same term in multiple contexts to reduce confidence in incorrect interpretations, leaving only correct interpretations.

mistermann · on Nov 16, 2023

> leaving only correct interpretations

Unlike humans, LLM's can consistently realize (or at least claim to realize) when they've made a mistake (like assuming one's beliefs are necessarily correct, [because some persuasive story]).

tysam_and · on Nov 15, 2023

Bit of a nitpick, "red is the apple" is both a valid sentence and one that also conveys the original relationship held by the opposite phrasing, so, in this case, "the apple is red" does indeed imply "red is the apple", and accurately so.

eru · on Nov 15, 2023

Perhaps have a look at this famous syllogism instead:

(1) Mortal was Socrates.

(2) All humans are mortal.

(3) Therefor all humans are Socrates.

trimethylpurine · on Nov 15, 2023

This isn't exactly on topic. A claim about the apple's color is attributable to the apple and vice versa. An apple can contextually help someone understand the color red by example. There is no "therefore" in such a claim, and in fact the algorithm is simply gauging context statistically. Claim 3 in the syllogism would statistically be selected against, because a "therefore" that is false obviously weighs it in the opposite direction from truth. That's not the case for the apple. Essentially it's not difficult for the LLM to win on this one, just as it is not difficult for a human and for the same reasons.

moffkalast · on Nov 15, 2023

Diogenes: If I were mortal I would be Socrates as well.

TeMPOraL · on Nov 15, 2023

True, but only in the implicit, very narrow world typically assumed when doing logic exercises. "Red is the apple", obviously, because our whole world consists of "The apple is red" and some inference rules. In that world, "Red is the" can only be completed by "apple". Drop the narrowing though, and suddenly there's a lot of other completions, many of them much better ones (I'd guess "blood" will be one of the more frequent ones - "red is the blood of" shows up a lot in songs and poems).

Or in short: LLMs will handle "The apple is red. Red is the " just fine, because the first part is heavily biasing the answer. They won't often complete "Red is the " alone with "apple", because why should they?

emme · on Nov 15, 2023

A proper example is probably "an apple is red" and "red is an apple".

tikimcfee · on Nov 15, 2023

What you’re missing that is “red is an apple” is also possibly saying in a metaphorical sense that red is like an apple to someone - delicious, a treat, perhaps otherwise representative. In that way, the encoding of “is”’ is exactly correct - it’s an ordered pair of glyphs that imply a weak form of assignment or description. : apologies, replied to the wrong post, meant to push this up one.

cryptoz · on Nov 15, 2023

The correct sentence would start with a capital letter: “Red is an apple.” This is also completely valid as in a cartoon character of an apple named Red. The subject being the start of the sentence compounds the uncertainty in meaning.

tysam_and · on Nov 15, 2023

Yes, this is because there's no assignment happening up front w.r.t. the word 'red'. However, you'll notice the same kind of ambiguity for the word 'apple' in the reversed sentence, as the word 'The' can imply quite a few things following it.

The entropy gotta get slung around somehow.

foldr · on Nov 15, 2023

This is explained very clearly in the paper. They are not looking at all sentences of the form "A is B":

>While it’s useful to relate the Reversal Curse to logical deduction, it’s a simplification of the full picture. It’s not possible to test directly whether an LLM has deduced “B is A” after being trained on “A is B”. LLMs are trained to predict what humans would write and not what is true (Lin et al., 2022). So even if an LLM had inferred “B is A”, it might not “tell us” when prompted. Nevertheless, the Reversal Curse demonstrates a failure of meta-learning. Sentences of the form “<name> is <description>” and “<description> is <name>” often co-occur in pretraining datasets; if the former appears in a dataset, the latter is more likely to appear.4 This is because humans often vary the order of elements in a sentence or paragraph.5 Thus, a good meta-learner would increase the probability of an instance of “<description> is <name>” after being trained on “<name> is <description>” . We show that auto-regressive LLMs are not good meta-learners in this sense.

johnfn · on Nov 15, 2023

You're thinking of LLMs as basically pattern matching "a is b", but that's not really how they work. In fact, the incredible thing that LLMs can do is that they can understand (some) colloquial language and shades of meaning. LLMs can grasp all sorts of strange textual nuance; they can't grasp the cases "Foo is Bar's mother" strictly implies "Bar is Foo's son"?

famouswaffles · on Nov 15, 2023

>they can't grasp the cases "Foo is Bar's mother" strictly implies "Bar is Foo's son"?

Correction. The Model can. Training cannot.

TeMPOraL · on Nov 15, 2023

I'm starting to wonder, why should it in the first place? Very late stages, or fine-tuning, I'd perhaps understand. But early on? Everything is possible, and "Foo is Bar's mother" doesn't imply much over "these tokens come together, sometimes".

It's not that humans aren't suffering from this too. It's possible to spot it when you're learning, or even easier, when you're teaching someone a new thing. A person who learned that "A is B" does not automatically learn that "B is A"; they need to first process it, perhaps run the reversion explicitly in their head. I'd say that checking if the student can infer "B is A" having learned "A is B" is a good way to tell whether they're starting to comprehend the material, vs. just memorizing it.

ryukoposting · on Nov 15, 2023

The (purported) point of an LLM is its ability to represent language. The inability to represent such a critical nuance would represent a fundamental failure in that regard.

foobarqux · on Nov 15, 2023

But that's the problem isn't it? In some cases they are equivalent and in some cases they are not and a next-word-predictor needs to have "explicit" training data (i.e. it is not doing "reasoning") whereas a human can infer.

This isn't surprising if you think about how the machine actually works rather than treating it like a sacred magic box.

The default assumption for why there is any "success" for in-context learning should be that it's just picking a nearby token that "fits" not a process of logical deduction.

edit: The obvious band-aid-fix is to feed reversed sentences into the training data without telling anyone, after which LLM boosters will proclaim that LLMs "learned" to reverse logical implications.

famouswaffles · on Nov 15, 2023

LLMs can already reverse logical implications. Literally this is not a problem in context. The model can infere all this just fine. Even the original paper makes this clear. This is a recall from training problem, not a logical inference one.

foobarqux · on Nov 16, 2023

I don't understand what you are saying. LLMs can't generally reverse implications, that's what the paper says. This in turn suggests that what you are observing in-context is some other statistical heuristic, not a "logical inference capability".

I don't know in what way this is some trivial "recall" problem as you suggest. Does the reversed implication in question need to be in the training data explicitly or not? If it does I don't understand how you can claim a logical inference capability.

famouswaffles · on Nov 17, 2023

>LLMs can't generally reverse implications, that's what the paper says

No that's not what the paper says. It says training won't immediately store this information in a way as to make it reversible.

Let's get one thing straight. This is a common problem for human learning as well. Anyone who has used Anki for language learning will tell you that if you just train on target language word on the front and native language word on the back, you will fail the reverse unless you specifically train for that.

This is specifically a problem of recall. Not a problem of making the logical inference. If you ask the human language learner immediately he has learnt the new word for the reverse direction, he will obviously tell you the correct word.

But later he may not recall the reverse even if he remembers the original. Again this is a problem of recall rather than the ability to make logical inferences.

In the same vein, if you give the LLM the original direction in context and immediately ask the reverse, it will correctly tell you. It can make the logical inference.

foobarqux · on Nov 17, 2023

Humans have reasoning capabilities that are separate from memory. If you want to show that LLMs have reasoning capability you try to create carefully constructed adversarial experiments that try to distinguish between that hypothesis and others, you do not just show that there are cases that favor your hypothesis.

And although it's a complete waste of time because the answer is obvious when you do do these types of adversarial experiments, like adversarial blockworld, you find that LLMs are not internalizing abstract principles inherent to reasoning and are instead doing something like string-pattern-patching.

foobarqux · on Nov 18, 2023

edit: I meant to say "consistent with your hypothesis" not "favor your hypothesis"

earthboundkid · on Nov 15, 2023

Ideally, the LLM would be able to tell the is of identity from the is of predication. I found the article a little too defensive. It felt like "of course it doesn't work, there's not enough data!" and okay, sure, but that is a flaw, no?

famouswaffles · on Nov 15, 2023

Well the LLM can tell in that reversal works just fine in-context. This is a recall from training issue and training is dumb.

notahacker · on Nov 15, 2023

the examples are about relations in natural language rather than formal logic though. Mary Lee Pfeiffer being Tom Cruise's mother definitely does imply that Tom Cruise is a valid answer to questions about who Mary Lee Pfeiffer's son is (it doesn't necessarily imply she doesn't have other sons or there isn't another lady of that name who is childless, but that isn't what's tripping the model up). And there is absolutely nothing ambiguous about the failures of the fine-tuned model where the author gave it specific phrases exclusively associated with a fake name in the fine-tuning training set and when prompted for "who is $specific_phrase", supplied different names exclusively associated with completely different phrases (unlike the author, I don't see [serendipitously or otherwise] picking name words from the right training set as "B kinda-has-something-to-do-with A generalization", not when it's lost so much information that the combination of name words which exclusively appears in sentences with that phrase isn't treated as a more probable response. Never mind emergent understanding of syntax, it isn't even making obvious inferences from proximity here)

If GPT4 rarely makes those errors it's clearly not an insurmountable problem at this level of training, but it does imply a lot more difficulty fine-tuning models to reliably retrieve correct information from specific text.

Grimblewald · on Nov 15, 2023

While red is the apple is not a correct logical statment using conventional understanding of each term, it is still an understandable english sentence, a structure which likely pops up a fair bit. Even if just for scripts of yoda lines. It just goes to show truly how fucky language is and how amazing it is that something even remotely comprehensible comes out of LLMs

bob1029 · on Nov 15, 2023

This is approximately where I wound up. I had an initial knee-jerk reaction after reading the paper, but your kind of reasoning brought me back down to earth.

I don't know that I even want these things to generalize. I get it in academic terms, but from a CTO perspective how much this matters to the business? We already know our policies and procedures and aren't exactly excited about the idea of something getting curious with flipping things around.

I don't want it to generalize and start making weird assumptions like "some men are tall". I want it to say "I don't know", or provide a statistical signal indicating the same.

user_named · on Nov 15, 2023

Then you don't want an LLM you want something deterministic

bob1029 · on Nov 15, 2023

There is no opportunity for nuance?

sgt101 · on Nov 15, 2023

There's an opportunity to build a completely different technology.

LLM's have a specific architecture that makes them good at approximate retrieval. They have a specific learning process that makes them able to do this over a vast training set.

Doing things like reasoning about facts is possible using either symbolic or specific sub symbolic structures, but combining these with an LLM is a bit tricky. It's something that could be done for a demo in a 6hr/day/week project (depends on the demo) but to do it for real in an application... or as a robust bit of science... well... 6 people for 6 mths/years/?

croes · on Nov 15, 2023

But humans know when "is" means equal.

mistermann · on Nov 15, 2023

It only takes two humans getting it correct once for your statement to be technically true though, demonstrating how easily seemingly innocuous and straightforward language can be misleading.

The opposite statement, humans don't know when "is" means equal is also a true statement, perhaps even more true.

croes · on Nov 15, 2023

But you wouldn't think of the latter group of humans as the next big thing for many important tasks.

I doubt MS would put their service in lot of their products.

mistermann · on Nov 16, 2023

> But you wouldn't think of the latter group of humans as the next big thing for many important tasks.

I consider them the target audience for upgrades, I think the ROI could be massive.

anon291 · on Nov 15, 2023

> skeptic side of the LLM spectrum

Can you explain what you are skeptical of? There is ample evidence that LLMs are what they say they are: a series of transformer (usually) functions with learned weights that can successfully be used to generate human like text in many domains. Do you disbelieve this? Or are you skeptical of something else.

tempestn · on Nov 15, 2023

> This isn’t a failure of neural networks. It’s a feature. It’s why you’re not flooded with every single memory and experience you’ve ever had every moment.

This is an interesting point, and made me think on whether this "reversal curse" is something we experience with our own, human neural networks. I think it is. Like, I can imagine being given a character in a movie, being able to tell you what actor played them, but given the actor and the movie, not being able to tell you what character they played, or vice-versa. So the pairing exists in my brain somewhere, but I only have an accessible pointer to one side. I think we run into cases like this all the time, actually.

dj_mc_merlin · on Nov 15, 2023

Another similar bug/feature of the brain is how we can immediately know if we like a movie or a book, but if asked what movies or books we like, we often blank or name like 3 pieces. Some information is only designed to be retrieved in certain ways in our brain. The fact that neural networks have similar but different limitations isn't that concerning, just something to keep in mind.

Another funny limitation: when a word is "on the tip of your tongue", what often happened is you thought of it, but your brain rejected it, so now it's on "cooldown" before it appears in your mind again. Usually this mechanism helps but a bug in your brain causes it to be harmful. If you start thinking of something else it "refreshes the cache" and the word comes to you.. we're really not that much less buggy than the machine.

disclaimer: there's other theories about why the phenomenon happens

azeirah · on Nov 15, 2023

Yeah this becomes very noticeable when you're deliberately learning facts with Anki.

For example when trying to learn immunology, "This bacterium causes neural bacteriumeritis", score 100% on the set of cards after a few sessions.

Add "What are the possible causes of neural bacteriumeritis?" and sometimes I'd draw a blank.

There's a lot of nuance in how we learn facts and their relationships.

It reminds me a lot of data structures in programming to be honest. It's a lot easier to retrieve a person's age from a dictionary indexed by first-name+last-name than from a list with 10000 objects with people in them.

thehappypm · on Nov 15, 2023

Interestingly, I think the “favorite movies” scenario is also an artifact of our training dataset (our experiences). You spend hours watching a movie, so you have a lot of data about it, and have a lot to build an opinion. But comparing movies? Not something we think about for hours and hours on end. But, people who are movie critics or movie buffs train themselves to do it.

rdedev · on Nov 15, 2023

Reading that article I can't help but see similarities between what we have in the human brain. We can easily form and recall short term memories, even using them to logically reason out facts, but loose such memories if it's not frequent enough. So it seems like the context prompt is similar to short term memory and what's missing now is a good way to transfer the short term to long term. There is a huge amount of assumptions going on here so take it with a huge amount of salt.

On another note I should have read the actual paper mentioned in the post critically instead of just skimming it. I completely missed the footnote about the reversal curse not being a problem if everything is present in the initial prompt

zapperdulchen · on Nov 15, 2023

I played along the example of the chancellor question.

If you deviate a little from the examples in the article GPT-4 gets it all wrong:

Who is the eighth Federal Chancellor of the Federal Republic of Germany? - Olaf Scholz (wrong, Angela Merkel)

https://chat.openai.com/share/937795ea-bd91-43ee-bd76-1a125f...

Who was the eighth Federal Chancellor of the Federal Republic of Germany? - Helmut Kohl (wrong, Angela Merkel)

https://chat.openai.com/share/937795ea-bd91-43ee-bd76-1a125f...

Interestingly whether you put is or was into the question does make a difference.

Even when allowed to surf the web, ChatGPT gets it wrong:

Who was the eighth Federal Chancellor of the Federal Republic of Germany? - Gerhard Schröder (wrong, Angela Merkel)

https://chat.openai.com/share/867cb3bc-f642-4d2a-b80d-fe8dc0...

Though it's referring to the right Wikipedia page: https://en.m.wikipedia.org/wiki/List_of_chancellors_of_Germa...

dragonwriter · on Nov 15, 2023

> Even when allowed to surf the web, ChatGPT gets it wrong:

> Who was the eighth Federal Chancellor of the Federal Republic of Germany? - Gerhard Schröder

Its probably counting Walter Scheel, who was Acting Chancellor in 1974, and you are probably not.

When I asked ChatGPT to list the chancellors in order and identify the eighth, it listed eight ending in Schröder, with Scheel and his ten day Acting Chancellor tenure in May 74 as number 5. (Scheel’s tenure is in the timeline on the Wikipedia page you cite, though not in the numbered list in that page, which is why there is a dicontinuity in thr dates on the numbered list.)

littlestymaar · on Nov 15, 2023

It's funny because you ascribe some reasoning in ChatGPT's answer “Gerhard Schröder”, but somehow missed that it's also able to give you two other answers that are unambiguously wrong…

dragonwriter · on Nov 15, 2023

> It's funny because you ascribe some reasoning in ChatGPT's answer “Gerhard Schröder”, but somehow missed that it's also able to give you two other answers that are unambiguously wrong

Schröder is the only answer it gave when using functionality that would bring some representation if a list into its context first, and 10it did it with different prompts and mechanisms for bringing a list into its context.

That LLMs are bad at counting-related tasks without doing that is well-known, and not a point I felt needed belaboring.

RandomLensman · on Nov 15, 2023

Walter Scheel was never Chancellor, he was fulfilling the duties of the Chancellor while being Foreign Minister, but he was not a Chancellor. So counting him is wrong.

Kiro · on Nov 15, 2023

Imagine reading this comment thread a few years back. Just getting an answer to this question at all from a generic bot would be considered science-fiction and now we're dismissing it based on technicalities.

RandomLensman · on Nov 15, 2023

Not a technicality. It's like inventing a proper head of government that wasn't there.

Also, knowing German chancellors was always easy, so that fact that a machine falls behind any normal dictionary or the German Chancellor's website is poor form.

It's like a calculator getting basic addition wrong. I don't want a future full of poorly performing machines.

Kiro · on Nov 15, 2023

He was acting Chancellor. It's like saying a red apple is not an apple. It's a technicality. The fact that we have a completely reasonable explanation to why GPT-4 includes him in the list and you guys still make it sound like the AI is a complete moron is just hilarious to me.

dllthomas · on Nov 15, 2023

A "red apple" is an apple that is red. An "acting officeholder" is not an officeholder who is acting, but a person who is acting as if they have the office (... with institutional support - Norton was not acting Emperor).

It's a technicality, but a valid technically and it's a technical question. I agree that it's wrong in a relatively small and surprisingly human way, but it's wrong in a way that including a red apple amongst apples is not.

RandomLensman · on Nov 15, 2023

There is no "Acting Chancellor", that is a poor translation of what the job is. There is only a Chancellor, not an "Acting Chancellor" - you have to trust that the Germans and their institutions kind of know who was Chancellor and who was not.

The machine is a complete moron for not being able to get basic data from basic sources right.

Kiro · on Nov 15, 2023

> Scheel became acting Chancellor

> Chancellor of Germany, Acting, 7 May 1974 – 16 May 1974

https://en.wikipedia.org/wiki/Walter_Scheel

RandomLensman · on Nov 15, 2023

I know, but the German wikipedia gets it right: https://de.wikipedia.org/wiki/Walter_Scheel

(And so does the German Chancellor: https://www.bundeskanzler.de/bk-de/kanzleramt/bundeskanzler-...)

There is zero ambiguity about who was Chancellor and who was not.

Kiro · on Nov 15, 2023

It says "geschäftsführender Bundeskanzler".

Even on https://en.wikipedia.org/wiki/Chancellor_of_Germany he's listed as "Vice Chancellor Walter Scheel served as acting Chancellor from 7 May to 16 May 1974" between 4 and 5.

There's obviously some ambiguity to it considering we are two humans discussing this with a claimed discrepancy between English Wikipedia and German Wikipedia, but your conclusion is still that the AI is spitting nonsense.

RandomLensman · on Nov 15, 2023

There is zero ambiguity under German as law as to who was a Chancellor and who was not. Not a matter of wikipedia etc.

The way to be Chancellor is through article 63 of the Grundgesetz, while Scheel was put into the caretaker role via article 69. This explains it a bit https://de.wikipedia.org/wiki/Vizekanzler_(Deutschland) - Scheel was only taking on the function, not the office.

This kind of giving some machine the benefit of the doubt when in fact there is zero ambiguity is really a path that makes me think we will have mostly machines designed for marketing and other non-critical things.

Kiro · on Nov 15, 2023

He's also included in this list between 4 and 5: https://de.wikipedia.org/wiki/Bundeskanzler_(Deutschland)#De...

When you need to bring up article 63 and 69 of the Grundgesetz to prove that the claim is ludicrous, maybe the reasonable thing to say instead is "I understand why you might think that".

RandomLensman · on Nov 15, 2023

You can say "I consider Scheel to be Chancellor", but that isn't the same as Scheel having been a Chancellor.

I honestly don't understand why in situations with a clear ground truth there is a need to debate and why we would want machines to bungle that.

Kiro · on Nov 15, 2023

I don't think it bungled it and I don't agree that there's a clear ground truth here. Quite the opposite: you've only convinced me that the semantic ambiguity is real. It's like debating whether interim CEOs should be included in a list of CEOs.

RandomLensman · on Nov 15, 2023

You are, of course, free to ignore German law and its definitions, but that doesn't change the fact who actually was a Chancellor and who was not. Chancellor is a very well defined role.

fnordpiglet · on Nov 15, 2023

So the English Wikipedia author is a moron, or is there ambiguity when describing the role in English?

RandomLensman · on Nov 15, 2023

No. I think the problem starts with that the role Scheel had isn't given a name in the article of the basic law that creates it.

So either need to be fully descriptive (e.g., something like fulfilling the functions of the Chancellor, while not ever having the office) or it will be also open to being misunderstood.

The issue here really is that the German succession doesn't ever transfer the office, but only the function (which is different to the US, for example). So here Scheel followed Brandt, but not into the office. Only someone having the office is a Chancellor and there is a specific way to that office.

fnordpiglet · on Nov 15, 2023

I see, so when you said the machine is a moron for getting it wrong, the English editor isn’t a moron for getting it wrong due to ambiguity in the law?

I think there’s also a functionally useful way to describe someone’s role as what it functionally is even if it’s not legally that. You’re point is super well taken, if the machine is intended to be a fact oracle, it’s awfully loose and adds a lot of interpretation in areas of ambiguity.

I would say IMO that’s specifically the power of these machines. An awful lot of human endeavor doesn’t require literalism but semantic approximation and interpretation that machines were literally incapable of. Its weird language is enough to achieve that, but I think it’s overly restrictive to assert a broad lack of utility in critical systems. An awful lot of critical systems actually need more “probabilistic” interpretation than literal fact oracling.

RandomLensman · on Nov 15, 2023

Maybe when read in a strictly narrow sense it is even a good translation. Practically, it is more of a caretaker role (especially in the case of Scheel here), though, and interpreting into the phrasing can be dangerous. Naming things that aren't named is always tough, and even tougher when needing to go from one language to another.

I'd agree that often things human are more loose but when there are narrow definitions ignoring them is dangerous.

naasking · on Nov 15, 2023

> The machine is a complete moron for not being able to get basic data from basic sources right.

I'm just going to point out that you're calling an AI moronic for not getting this right when a bunch of humans are also disagreeing with you. Frankly, this really undermines your case that this is an AI failure.

RandomLensman · on Nov 15, 2023

What is wrong with me wanting my machines to stick to the actual definitions?

OJFord · on Nov 15, 2023

> Walter Scheel was never Chancellor, he was fulfilling the duties of the Chancellor while

I think this is really an issue of semantics & translation, because that is just what 'acting X' means? (Admittedly not always while still doing something else too, but I don't see that as significant - if it had gone on long perhaps a junior minister in the foreign office would've been named acting FM in his place too.)

RandomLensman · on Nov 15, 2023

That is not semantics. The law is very clear: Scheel was not a Chancellor (and indeed he is not counted as one in Germany: https://www.bundeskanzler.de/bk-en/chancellery/federal-chanc...).

Article 69 Grundgesetz has a sort of "Caretaker Chancellor" that has the function but - importantly - not the office. They way to have the office is through article 63.

It works differently in Germany to the US, for example, where the vice president could become the president, while the vice chancellor only ever gets the function, not the office (unless through a proper vote for Chancellor).

When there is actually ground truth, machines should be able to recover that, not take weird turns.

OJFord · on Nov 15, 2023

What distinction are you making between 'acting X' and 'caretaker X'?

I'm British, not American, so I'm not assuming something like the vice president automatically becomes president, we don't have that either. If anything it's even less than Germany since (at least in theory and history, modern media etc. makes it a bit different in practice) there's nothing special about the PM, it's just the governing party's leader. I suppose though you could say the heir apparent to the throne immediately becoming the monarch on the death of the previous one is like vice taking over - but I don't think that detracts from my point because nobody would call that 'acting monarch', they just are.

'Acting X' means doing the necessary duties, but not any major decisions that can be avoided/deferred, while the 'real' replacement is found. Sometimes it ends up being the same person, e.g. the head of some division is 'acting CEO' for a while as the board searches for a new CEO, ultimately ends up going with that person and title changes to just 'CEO' - or they don't, and go back to old job, or maybe quit in a huff, and the person they found is just 'CEO' taking over from the 'acting'.

RandomLensman · on Nov 15, 2023

The real issue here is that people stick to the word "Chancellor", not the "acting" as such.

There is no named role of "Acting Chancellor" created in the basic law, instead someone is just tasked with performing certain duties (and people might sometimes call it "acting Chancellor" or whatever). However, the position "Chancellor" is a well defined & special role (unlike perhaps PM) and whatever that other "acting" role is, it isn't a Chancellor. It is a bit like adding Oliver Cromwell to list of English Kings or Kamala Harris to the list of US Presidents - you can do it but you then move beyond the "standard" definitions.

OJFord · on Nov 15, 2023

Charles III was for years described as a 'King in waiting', nobody was confused that perhaps he was already king just because the phrase used that word.

I don't think in English usage there is any meaning attached to 'acting X' which 'caretaker X' (as you're happy to call it) doesn't also carry. Both are used interchangeably, the former you'd put on your CV, the latter might be used by the media when your employer put out the less release announcing it, but same thing: for some reason there is not currently an X, but you are fulfilling some necessary duties that that person would do in the meantime.

RandomLensman · on Nov 15, 2023

When used in a narrow sense "acting X" can capture it, but then sometimes people start to use it somewhat interchangeably with "interim X" etc. and it depends on the specifics what happens in successions etc.

dragonwriter · on Nov 15, 2023

It is exactly what “acting X” means in English.

RandomLensman · on Nov 15, 2023

You can clearly see it is confusing the heck out of people here by seeing "acting X" as a version of X when it is not in this case here.

dllthomas · on Nov 15, 2023

Yeah, but those people are just wrong. To be fair, it's not something most people have much direct experience with.

postexitus · on Nov 15, 2023

If you think ChatGPT is "counting" anything, you have massively missed the point.

getpost · on Nov 15, 2023

I asked for a list of Federal Chancellors of the Federal Republic of Germany and GPT4 included Scheel ("acting"), which makes Schröder the 8th. I often troubleshoot GPT4 replies by asking it for more context. "Is" vs "was" is interesting an find!

https://chat.openai.com/share/ddd2800e-630c-4ff7-9992-14b265...

zurfer · on Nov 15, 2023

as others pointed out, it's important how you ask and what you think the truth is. https://chat.openai.com/share/cc688592-236a-499b-82e1-81bdaa...

carlmr · on Nov 15, 2023

>Who is the eighth Federal Chancellor of the Federal Republic of Germany? - Olaf Scholz (wrong, Angela Merkel)

Technically there's no right answer to this question. "Is" implies present. But Angela Merkel isn't the present chancellor. Olaf Scholz is. But he's not the eigth.

pwdisswordfishc · on Nov 15, 2023

Technically, Angela Merkel never stopped being the person who served as the eighth chancellor.

amelius · on Nov 15, 2023

Language is imprecise. LLMs should be able to deal with that.

moelf · on Nov 15, 2023

your first two links are duplicates

Imnimo · on Nov 15, 2023

>If you start a query with “Mary Lee Pfeiffer”, you’re not going to get very far because neural networks aren’t equidistant grids of points (besides the fact that she may not appear very often under that version of her name.) They’re networks of nodes, some with many connections, some with few. One of the ways you optimize large models is by pruning off weakly connected regions. This may come at the expense of destroying B is A relationships for weakly represented entities.

Am I missing something here? This paragraph reads like complete gibberish to me.

Also, I don't buy the experiment at the end. If you fine-tune the model with exclusively Tom Cruise data, I want to see proof that it doesn't just answer "Tom Cruise" all the time. I want to see that it says Tom Cruise wrote Aces in the Stream, but doesn't say Tom Cruise wrote Kings in the River.

amayne · on Nov 15, 2023

I've added your suggested test to the blog post: https://andrewmayne.com/2023/11/14/is-the-reversal-curse-rea...

Spoiler: It doesn't say "Tom Cruise."

tveita · on Nov 15, 2023

In one of your reproductions of the results from the paper it would spit out random other names from the dataset. But from your description of the Tom Cruise dataset, there were no other names included. Why didn't you give it the possibility to fail in the same way?

mistermann · on Nov 15, 2023

Have to tried replacing "is" with "equals" in the training set where it is valid? I did a search in the article for the word and got no hits.

I'm thinking this might work because "equals" is always logically reversible whereas "is" is not, assuming the statements are technically truthful.

Xelynega · on Nov 15, 2023

Their entir "graph" description of neural networks reads as nonsense.

Neural networks are graphs insofar as they're networks of connected neurons and not further(definitely not graphs of information as the writer seems to think). Given the fact that neural networks are quite literally an n-dimensional function once trained, it's more accurate to call them an "equidistant grid of points" than it is to call them a graph of information since literally all they do is take an n dimensional vector and output an n dimensional vector.

Izkata · on Nov 15, 2023

It reads like they saw the diagram from [0] and didn't understand what it meant (like, they assumed "x1" represented an entity like "Mary Lee Pfeiffer"). Note how in that diagram, the bottom nodes in the second and third layer aren't fully connected - which if you don't read the description, you could end up assuming the connections between layers are created and trimmed, not just weights changed.

They may also be getting it mixed up with semantic search, where an entity like that would exist in a graph and have connections to related concepts.

[0] https://towardsdatascience.com/first-neural-network-for-begi...

zarzavat · on Nov 15, 2023

Humans are also vulnerable to the reversal curse! When you learn languages you have to learn both directions (chat is cat and cat is chat), anybody who has built an anki deck will know this, otherwise you will be better in one direction than the other.

TeMPOraL · on Nov 15, 2023

Not just languages. It's the case with everything. Easy to spot when you're tutoring someone. You can see they learned "force is mass times acceleration" or "for( ... ) is how you make the same code run multiple times" - they can tell you that when quizzed! But you know they haven't comprehended it until they can reverse it - "I need to compute the mass of the object, which I see accelerating this much under this force; I can pull that from F=ma -> m=F/a!", or "I want to run this code several times, I need a `for` loop!". And getting to that stage is often the longest and most difficult part.

omoide · on Nov 15, 2023

Interesting; your message hints at a sharp gap between using language stochastically (eg being able to repeat what you've heard) vs using language to express deeper knowledge/understanding.

A common argument is that LLM by their nature only model the former, not the latter.

TeMPOraL · on Nov 15, 2023

My point is that LLMs can do both, the same way humans can. We don't learn or recall reversals automatically either - we memorize them separately, or run an inference step (which, done often enough, leads to memorization). Reversals aren't free - not for LLMs, and not for our own minds.

zarzavat · on Nov 15, 2023

Yes the difference is that LLMs learn statically: they train and that’s it. Humans dynamically: they revisit learned facts and synthesize new ones. If you force an LLM-like training process in a human, by doing rote memorization without any exercises, then you get LLM-like results in that human.

This suggests that you may need to train an LLM on its own output. Goes against the conventional wisdom!

angusturner · on Nov 15, 2023

I think this “wisdom” is not really agreed upon, and in fact I’ve seen a few works successfully leverage synthetic data (including a cool one from Sergey Levine et al recently).

tkgally · on Nov 15, 2023

As one gets deeper into learning another language, it’s also important to be aware that the meanings of words in different languages rarely map to each other in clean bijections. Common words especially tend to be semantic clouds, not fixed points of meaning.

I don’t know French, but I am sure there are many cases where an English phrase or sentence that includes ‘cat’ should not be translated into French with ‘chat’ and vice versa. (When asked, GPT-4 offers two such examples: "let the cat out of the bag” and "avoir un chat dans la gorge.")

I don’t mean to suggest, though, that memorizing word pairs is not a good way to learn vocabulary in another language. For me, it was an essential step in acquiring the second language that I am now fluent in (Japanese).

TeMPOraL · on Nov 15, 2023

Words are always semantic clouds. Not getting that is what sent philosophy down many a dead end, and the source of plenty of arguments regular people get into all the time. One doesn't need to try mapping between two languages - it's enough problem trying to map within the language itself.

umanwizard · on Nov 15, 2023

I think this has as much to do with abstract vs. concrete words as common vs. uncommon. If you had to pick a word corresponding to “chat” it’s quite clearly “cat”, despite a few different expressions. But it’s rather difficult to translate “justement” to English, or “random” to French, without further context.

ska · on Nov 15, 2023

> Saying that models can’t automatically generalize from B to A when B is vastly underrepresented in the dataset feels rather obvious and not so much a curse as a description of how neural nets function

More importantly perhaps, it's not how people (at least non AI/ML research types) typically think they work, and much of the handwaving and hype around it isn't helping improve that.

ImPostingOnHN · on Nov 15, 2023

It doesn't seem obvious at all unless you start with the assumption that not being able to derive A=B -> B=A is "obvious", which it clearly isn't to most people.

Indeed, just because the lack of that capability is a result of the design of LLMs doesn't mean it's a feature of LLMs. It could also be that it's a bug of LLMs. Which one depends on what the expected behavior is, and the expected behavior from the product is being able to perform the above logical derivation.

tl;dr: the article reaffirms that the "curse" is true, and for the reasons claimed too, but it's a feature, not a bug.

famouswaffles · on Nov 15, 2023

It's "obvious" because A is B -> B is A is not a thing that is actually true for the vast majority of text (or really any kind) constructions. It's only a truth of formal logic.

saithound · on Nov 15, 2023

Logician here. We're well aware that "A=B" is not a good formalization of the colloquial/grammarical "A is B", we don't formalize it that way, and we don't regard "if A is B then B is A" is a truth of formal logic.

The people making the argument about reversal curses are not logicians, and most of them don't know anything more about formal logic than what anyone would pick up in an undergraduate "Intro to Proofs" course.

That said, the semantics of the word "is" in natural language really doesn't matter in this debate. The semantics is a red herring, if you will (while a red herring is not semantics).

After all, LLMs cannot learn "the quantity B is mathematically equal to A" from examples of "the quantity A is mathematically equal to B" either, even when the rest of the corpus clearly explains that this _is_ in fact always reversible.

mistermann · on Nov 15, 2023

> That said, the semantics of the word "is" in natural language really doesn't matter in this debate.

As a logician, do you consider this necessarily factual:

- using ternary logic

and/or (I am asking both independently and in combination)

- considering the numerous possibilities for ambiguity in "doesn't matter"?

rossdavidh · on Nov 15, 2023

It is a thing that nearly any non-technical user of LLM's would expect it to be able to do, and be surprised at it not doing. Which is a bug, if you expect it to be a service/piece of software that someone not trained on using LLM's to be able to do. The author more or less admits this in the section entitled "Model training is a dark art"

'I’ve been playing around with fine-tuning LLM models for years and still don’t have any hard and fast one-size-fits-all rules to apply. Every dataset lends itself to a specific way of training.'

But this is precisely the reason why nearly every claim about what "AI" will soon be able to do, is misguided. The failures of LLM's are exceedingly unintuitive to anyone who doesn't have a lot of experience with them (and maybe sometimes to people who do).

Spreadsheets can be used by people who don't know how spreadsheets' internals work; after a bit of training (in my experience, about ten minutes) they can get a decent intuition about how to use a spreadsheet (at least for the simple stuff). The same is true of well designed web pages, word processors, music players, etc. Even software with more complex training requirements like CAD, statistics packages, video editing, etc. will usually behave in a more-or-less intuitive fashion for a user of the appropriate target group.

LLM's fail in unexpected (and sometimes hard to spot) ways, and are being marketed as if they are a tool for the general population to use, when their training (and failure modes) are still a "dark art" even for people with years of experience in them. That is not a feature.

cratermoon · on Nov 15, 2023

If I'm reading this correctly, the author is saying that it's not a failure of logical deduction if the training data doesn't include the reversal. In other words, he's saying that if the data contains "Tom Cruise is the son of Mary Lee Pfeiffer” but not “Tom Cruise’s mother is Mary Lee Pfeiffer”, then the model's inability to determine the latter is "an explanation of how neural networks function than a model’s inability to deduce B is A."

But of course "how neural networks function" is that they fail at basic logical deduction and do not generalize.

So again, if I'm reading it correctly, he's hand-waving away the inability to make basic logical deductions because that not something they can or should be expected to do. As I read it, that means the reversal curse only exists if the answer to the question "can LLMs do logical deduction?" is "yes". If one takes the position that LLMs can't do general logical deduction, which seems to be the author's point of view, then there's no expectation that knowing "Tom Cruise is the son of Mary Lee Pfeiffer" is sufficient to determine “Tom Cruise’s mother is Mary Lee Pfeiffer”.

Am I missing something?

xelxebar · on Nov 15, 2023

Did you see the end of the article, where the author uses a small example and gets "B is A" generalization?

These are the salient takeaways I got:

- Is/Was wording might matter. This is something probably a bug.

- 30 facts about a person might simply be too little for "B to A" generalization

- Extra precision/context in the prompt can help locate the "B to A" inference.

- How you cut up your training data can bias inferences in surprising ways.

- "B to A" generalization clearly does happen, even without "B is A" in the data, but it's not as stable as you'd want.

cratermoon · on Nov 15, 2023

OK, but how does that not just demonstrate LLMs can't generalize absent massaged training data?

> it's not as stable as you'd want.

Which I take to mean that a model can sometimes confabulate "B is A", solely out of random variation, and that it's possible to bias the data and prompt to generate the expected response. The model hasn't done any logical deduction, the response is just a bias-influenced lucky break.

bjornsing · on Nov 15, 2023

I didn’t read the whole thing, but the first part about Tom Cruise and his mother sounded very flawed: Of course the LLM could learn the reverse relationship if it had better data where the reverse relationship is a common occurrence. The point of the reversal curse argument (I guess) is that it should be able to learn the reverse relationship from entirely other examples [1], but seemingly is not.

1. That is, the LLM should be able to learn that “A is son of B who is female” implies that “B is mother of A”, regardless of who A and B is. It should then be able to apply this pattern to A = “Tom Cruise” and B = “Mary Lee Pfeiffer” and deduce “Mary Lee Pfeiffer is the mother of Tom Cruise” without even a single example.

jameshart · on Nov 15, 2023

LLM training teaches it that ‘Mary’,’Lee’, and ‘Pfeiffer’ are words that appear in association with ‘Tom’, ‘Cruise’, and ‘mother’. Probably also in association with ‘son’, and ‘family’.

But the association between the words ‘Mary’, ‘Lee’, ‘Pfeiffer’ and ‘son’ or ‘mother’ point to other words more strongly than they do to ‘Tom’ or ‘Cruise’. Sure, Tom Cruise is probably in there but so is probably John Henry Kelley (Michelle Marie Pfeiffer’s son), and George Washington Custis Lee (whose father was Robert E Lee and whose mother was called Mary).

A random piece of text containing the words Mary Lee Pfeiffer is just, based on its training, not likely to be about Tom Cruise. There’s nothing there anchoring it particularly to those words.

It’s possible that the ‘Pfeiffer’ in there screws it up more, as well - it’s like it vaguely knows there’s a Hollywood connection but of course it crosses its wires and thinks it’s probably Michelle Pfeiffer.

Be honest: if a pub quiz question came up asking “which Hollywood superstar’s mother is Mary Lee Pfeiffer“, you’d probably not guess Tom Cruise either.

But here’s the thing: once we go beyond training into specific completions, if you ask GPT ‘who is Tom Cruise’s mother’, it answers correctly; and then once it has that A is B in context, you can ask it ‘who is Mary Lee Pfeiffer’s son?’ And of course it knows how to complete that B is A.

So I definitely agree with the piece here - there’s no ‘reversal curse’, there’s just asymmetric information relevance.

akoboldfrying · on Nov 15, 2023

Only skimmed this and didn't read the underlying paper, but I was surprised to see no mention of the fact that "A is B" often does not at all imply "B is A" in everyday language: A bird is an animal, but it's wrong to conclude that an arbitrary animal must be a bird.

thisgoesnowhere · on Nov 15, 2023

The word an in this sentence (indefinite article) means that the word animal is non specific. So this structure does not follow from the original. "A is B" is not the same as "A is a B"

However, you are right that it's not always correct since "Poetry is Literature" and "Steam/Ice is H20" are both examples where this breaks down.

My annoying pedantry for the day.

randyrand · on Nov 15, 2023

you can do the same mistake without “an”.

Jeff is the short haired guy != the short haired guy is Jeff.

It depends on context. You can’t fully equate them.

PoignardAzur · on Nov 15, 2023

"A is B" is meant a bit loosely here. A better shorthand would be "A is the B" <=> "The B is A", or "A does B" <=> "B is done by A".

The actual paper explores equivalences that actually make sense, and whether LLMs have trouble with them.

thaumasiotes · on Nov 15, 2023

This is bad news for all the people who like to say "A winner is you!"

Falkon1313 · on Nov 15, 2023

But obviously that implies that you are a chicken dinner.

At least according to some people's expectations, presumably.

godelski · on Nov 15, 2023

I didn't know anyone was even suspicious of the reversal "curse". I thought it went viral more because people were surprised anyone was surprised and the tweet was highly sensationalized. It feels more like the __expected__ results considering both speech patter bias __and__ that "causal" attention is sequentially biased. Hell, we saw the same things in RNNs, we see it in classic autoregressive models, and so on. But it definitely isn't how humans encode information because if you had all 3 pieces of knowledge (you know who Tom Cruse is, you know who Mary Pfieffer is, and you know the relation between Mary and Tom is mom/son) then your ability to recall is (nearly) invariant to the ordering. Hell, we're so robust you can caveman speak like "Mary Pfieffer son who!" and still get the answer or caveman yoda speak "Who Mary Pfieffer son is?" Some languages even have these orderings but we're impressively robust (enough that I think we trick ourselves a lot when subtlety comes into play. AKA overfitting).

So I find it weird to call this "a feature" and also act like this is surprising.

But can I also take a minute to just say I really hate GPT experiments? They're performed on a stochastic model, with proprietary weights, proprietary training data, proprietary training methods, and above all is constantly changing and at a rather fast pace. It makes for a very non-scientific process as you can't decouple a lot of important factors and reproduction is a crap shoot. It is not a very good way to go about studying "how __LLMs__ work" and is rather "how does GPT work at this particular moment in time and aggregating all these unknowns?" There's some bitter sweetness because I do like that GPT is free but it feels like a major edge that they have is that the community just does a lot of free research for them and in ways where they can better interpret results than the people who performed the experiments in the first place. I really do believe that academic works shouldn't be focusing on proprietary and dynamic methods. It's fine to include them in results (and preferably added post review to avoid identity spoilage (or we openly acknowledge that double blind is a joke)) but I'd rather most researchers focusing on the general concepts and with the ability to dive down the rabbit hole than playing a wack-a-mole game.

Also, I'd totally love it if more research papers were instead blog posts. Kudos to anyone posting their research on blogs, academic or not (I don't care about your creds, your real creds are your work). Papers are about communicating with fellow scientists, right? Why do we need journals and conferences these days?

Applejinx · on Nov 15, 2023

This is great work. Andrew's on to something. Thing is, the behavior we see in his form of the LLM is also what we experience as humans.

Good writers, marketers and manipulators know this about people. You seed the ground with the concepts you'll need to introduce, so they're present in the person's 'model' and you can elicit them later, on request.

Formalizing things in the 'reversal curse' manner is like loading the 'mind' of the LLM with habits and assumptions and cutting off its ability to free-associate from seemingly relevant concepts… which is likely to be more valuable in the long run, because an LLM can contain more than a human can. That doesn't mean it will be more intelligent, but it seems reasonable to infer that the LLM can have a broader base of association to draw from, where we as humans tend to be restricted to associations from our own experience. We only get one shot at 'training data', though it's in countless sensory forms, where LLMs are stuck with language as their only window onto experience.

I'm loving the notion of leaving the prompt 'training' stark raving blank. Let's see what comes out of this giant pile of human verbal associations. It is only that, associations, but it's on a grander scale than we're accustomed to. Making it 'answer questions correctly' seems woefully unambitious.

numeri · on Nov 16, 2023

This post isn't a scientific investigation. It's someone playing around with a black-box model with few to little controls – which is unfortunately the only thing we can do when experimenting with GPT-4, which does excuse this partially.

Unfortunately, though, the author seems unaware of the actual state of research on the actual mechanics of how LLMs store knowledge and specifically binary relations. The ROME paper[1], among others, shows that the feed-forward layers function as a key-value store, where the feed-forward's up projection of the last token in a noun phrase (say, "the Eiffel Tower") acts as a key, which when multiplied by the down projection, produces a value that contains information the model knows about the subject, which is then added into the residual stream/hidden representation.

A paper building on that work[2] then went on to show that it's usually the self-attention layers that use the relational phrase (say, "is in") to extract the relevant knowledge from the feed-forward layer's output (in this example, hopefully "Paris").

This mechanistic understanding makes it really obvious why the reversal curse occurs – using matrix multiplication as a key-value store requires having a fully separate key-value pair to look up the reversed relation.

[1] https://arxiv.org/abs/2202.05262 [2] https://arxiv.org/abs/2304.14767v1

weinzierl · on Nov 15, 2023

"As a side note: I want to point out that I’m not aware of any examples of capabilities that can be done with prompting a model like GPT-4 that it can’t be trained for. This is why I’m a little skeptical."

While it is just a side-note in the article, isn't this the core of the problem? When we've already established that LLM's can do B to A generalizations in-context, why wouldn't they be able to in training?

In one of my experiments I noticed that GPT-4 seems to be perfectly aware of the number of letters in a word in-context, but has difficulty when trained knowledge is involved.

For example it can reliably answer the question:

"Can you tell me how many letters each of the words of the first sentence of our conversation has?"

At the same time it fails with the task:

"Can you rewrite the first sentence of our conversation in a way that preserves its meaning as closely as possible but use only words with an even number of letters?"

It will give an answer but get the letter counts very wrong and it is unable to improve its answer by iteration.

Of course this does not prove that a model cannot be trained to answer tasks involving word length, but GPT-4 seems to have a knowledge gap here (possibly due to tokenization).

thomastjeffery · on Nov 15, 2023

Right at the opening, we have a fundamental (and common) misunderstanding of what LLMs are.

When humans read the statement "A is B", we semantically transform that into a logical association. LLMs do not perform any semantics or logic.

Here's a simple example to demonstrate:

If we trained an LLM on something like "A is B C is D D is C.", we might be able expect the continuation, "B is A". If we then gave that LLM the prompt, "What is B?", we might expect the continuation, "B? is What".

Large models like GPT present more interesting continuations because they are trained on larger and more diverse datasets. More diversity also means more ambiguity, which results in continuations that are less predictable and more illogical.

maaaaattttt · on Nov 15, 2023

The Olaf Scholz exemple from this article is just another exemple of how LLMs can’t count. If you try this prompt: “In this list of words: bike, apple, phone, dirt, tee, sun, glass; which is the fifth word?” it will fail as well. “Fifth” is not connected to any counting ability in LLMs the way it is for us.

If you now try this prompt: “Who’s Tom Cruise’s mother in this exemple: “Mary Lee Pfeiffer is Tom Cruise’s mother.”?” It will give you the right answer.

IMO this is a sign that it can understand reversibility.

The exemple used are not present enough in the dataset and the “confidence” of the model is not high enough for it to give the right answer. Or, as stated at the beginning they use counting mechanisms (or others) that the model just doesn’t have.

saliagato · on Nov 15, 2023

It can’t count because what the LLM sees is a bunch of tokens not words

maaaaattttt · on Nov 15, 2023

Exactly. And asking what/who/which is the “Ninth” something will always fail (or randomly succeed)

OJFord · on Nov 15, 2023

Much more likely to work for something like German chancellors, US presidents, etc. that people do actually number and could easily have been trained on that data. If you ask 'who was the 45th US president' it doesn't need to be able to count a list of US presidents to answer 'Trump', you can just duckduckgo that and get the answer anyway, that's not doing any counting either.

ShamelessC · on Nov 15, 2023

The article is a decent peer review and refutation of “the reversal curse”. Some of the comments given here clearly haven’t read the whole article though - arriving at similarly skeptical conclusions that are clearly present and expanded on in the article.

Why do people feel the need to do this here? Armchair commentary on advanced material is one of the main reasons I avoid Reddit. And furthermore why does it feel like you’re not allowed to suggest this as a response? I should be able to say “RTFA” but here I feel like I’m going to be scolded or banned by moderation.

swatcoder · on Nov 15, 2023

The author is a shark-diving science journalist who's been paid a lot of money by OpenAI, not a researcher, and the article is neither a "peer review" or an expert "refutation". It's a meandering and sometimes thought-provoking exploration of how LLM's can be coaxed to deliver on statements sort of like (but not actually equivalent to) the ones that failed in the paper.

If you want to defer your own understanding to the author (and overstate their own claims!), that's fine, but a lot of people here are actually more informed about many of the topics and modes of discourse covered in the article and may have something to share with you if you give them credit. And if you're not going to give other commenter's due credit, then you're not really following the spirit of HN. While you can't always know the background of any individual commenter, we have an unusually accomplished and informed community in general and thrive by treating all commenters with the respect we might give the most accomplished and informed -- at least by default. That's why a blanket "RTFA" isn't usually appropriate here.

amayne · on Nov 15, 2023

Hello!

I’m the “shark-diving science journalist” in question.

First of all, you can run the experiments like I did and test this yourself. I’m not asking anyone to take my word. Just do what I did: Read the original paper. Test the claims for yourself.

And to clarify a couple things:

1. The shark-diving part is true.

2. I’ve never been a journalist of any kind that I’m aware of unless you count writing for Skeptic Magazine. I’ve had many, many jobs though.

3. I started at OpenAI as a software engineer and member of technical staff. When I started there was just over a hundred people there. The lines between engineering were and are blurry.

4. I was the original prompt engineer at OpenAI and discovered many of the examples for using GPT-3 and wrote a lot of the original documentation. Internally my title was “prompt whisperer.”

5. I’m in the GPT-4 research paper for my contributions to model capability. I helped find abilities for long-text, vision, etc.

6. I was given the title Science Communicator when I started doing background briefings for media, etc., but still worked on model capability and other things.

7. I left OpenAI two months ago to work on a startup.

Best,

Andrew Mayne

user_named · on Nov 15, 2023

1. What do you think the reversal curse implies about LLMs? 2. Do you believe that LLMs are capable of logic? 3. Do you believe that LLMs are intelligent? 4. Do you believe that your blog post shows 3 or 4? If not, what is it about?

amayne · on Nov 15, 2023

1. I don't think the original research paper demonstrated the reversal curse. They claimed that you'd only get random answers from their example prompt. I showed that wasn't the case. I also pointed out what I believe to be a flaw in how they trained their model that when corrected for gave results that were non-random.

2. That depends on what you mean by logic. What would be an example of logical reasoning that would settle this?

3. Alan Turing created the Imitation Game thought experiment to show the futility of this question. If intelligence is something that can be observed and tested, then when we should be able to describe what to test for.

4. I don't make any specific claims about LLMs logic or intelligence. I just wanted to put their claim that LLMs can't generalize from B to A to the test.

ShamelessC · on Nov 15, 2023

All due respect, when I wrote this there were about 5 comments that were 100% not from those who are "more informed" than the author (and perhaps were even less informed than myself, although I don't claim to be an LLM researcher or anything more than an interested internet user).

Just the same it's been my experience, ESPECIALLY in machine learning, that users on HN are consistently over confident, bad at making future predictions, and generally just easily excitable. Does that mean I'm talking about you or commenters you like? Probably not! You seem relatively informed (although I'm not fond of you attacking the author of the article for things it isn't claiming - something truly not in the spirit of HN).

golol · on Nov 15, 2023

At the end the article presents a clear example where a model is trained on A is B and infers B is A. That is the contribution of this article and it's very valuable.

MPSimmons · on Nov 15, 2023

I had a discussion with ChatGPT 4 on this (quasi-ironically) to see what it "thought", and it theorized that BERT might have had a better outcome to these conditions because it's actually bidirectionally trained, rather than GPT, which is forward-trained.

https://chat.openai.com/share/daff08c5-ddea-4ad4-963a-0df88e...

smusamashah · on Nov 15, 2023

I don't agree with authors claim.

There is a plant which mimics leaves of nearby plants discussed here https://news.ycombinator.com/item?id=31301454 If you ask GPT-4 what this plant is known for, it will tell you correctly. But if you ask in any number of ways to tell the name of plant which mimic leaves, it will always give incorrect answer.

mitjam · on Nov 15, 2023

This is a great article with more tips on how to prompt and fine tune LLMs than most articles that focus on this topic. Also has surprising insights which are food for thought. Like: Would it help to attach facts to well-learned „nodes“ (like Tom Cruise in his example) when fine tuning? In the sense of repurposing well-known nodes. Looking forward to reading more from Andrew.

foldr · on Nov 15, 2023

Fodor's argument from systematicity and productivity strikes again: https://plato.stanford.edu/entries/language-thought/#ArguPro...

An interesting reflection on the state of the field that the paper doesn't even cite him.

Obscurity4340 · on Nov 15, 2023

Is it a possibillity that certain aspects of logic or human communication have a secret sauce that is not reliably replicable? I wonder if its expected to basically be able to perfectly interface with humans or logicical structures that flow from said imperfect beings, who are themselves deeply imperfect and irrational

montebicyclelo · on Nov 15, 2023

Related; 4x improvement in accuracy for experiment 2 of this paper, (with gpt-3.5-turbo), by changing the prompt:

https://sidsite.com/posts/reversal-curse/

Shows what a dramatic effect the prompt can have

sgt101 · on Nov 15, 2023

We are in word games here about zero shot learning.

The fact is that with zero shot the models actually aren't learning in the sense of updating their knowledge structure, they are instead using background knowledge as inference. We have started calling it zero-shot learning but... well it isn't really.

pmarreck · on Nov 15, 2023

I couldn't duplicate it in GPT4 here. It answered correctly, unless I posed it wrong:

https://chat.openai.com/share/75d46a03-a223-4f3a-987d-f8fec3...

jetrink · on Nov 15, 2023

The paper is not talking about information provided in the context, only training and fine-tuning.

> In fairness, it’s also worth pointing out here that they’re making the claim that the reversal curse only applies to training and fine-tuning and not in-context – i.e., putting all your information inside a prompt. They point out in a footnote that you can put A to B data in a prompt and GPT-4 will make B to A connections just fine. Unfortunately, this was lost on many of the people covering the pre-print.

pmarreck · on Nov 15, 2023

ah, I see. OK.

Zolde · on Nov 15, 2023

Luckily this curse has no significant practical impact on accuracy.

Who knows who Mary Lee Pfeiffer is, but does not already know her son is Tom Cruise?

And if such a person exist, do you want to give them a correct answer, or talk about how Mary Lee Pfeiffer is not a notable person with a Wikipedia page?

dr_dshiv · on Nov 15, 2023

> So in summation: I don’t think any of the examples the authors provided are proof of a Reversal Curse and we haven’t observed a “failure of logical deduction.” Simpler explanations are more explanatory: imprecise prompts, underrepresented data and fine-tuning errors.

Phew!

golol · on Nov 15, 2023

Excellent blog post. The only thing which I wish was there was an explanation of how promot + completion training would be any different to pure completion training. From my understanding oftransformers, it ahould be the same.

smitty1e · on Nov 15, 2023

> Regardless, we can see that GPT-4 can easily go from B to A in that example when the question is posed unambiguously.

Ambiguity is the soul of humor and the sole on the neck of any booted computer.

ugh123 · on Nov 15, 2023

This seems like the kind of thing that would be addressed during implementation, rather than persevere as a fundamental problem with LLMs.

riwsky · on Nov 15, 2023

“I’m going to define generalization such that my thesis that GPT-4 can generalize reversals is true—and lo, my thesis is true!”

xeonax · on Nov 15, 2023

Isn't this solvable by creating training data that says B is A? Instead of always training it on A is B.

RamblingCTO · on Nov 15, 2023

I'm not sure about this. Olaf Scholz has been chancellor since 2021 and the cut off for chatgpt 4 is allegedly January 2022.

My gut also tells me that the underlying embeddings should very well reflect equidistant relations of olaf scholz and chancellor.

/e: instead of downvoting, I'd love to have your opinion instead on why you think I'm wrong ...

diffeomorphism · on Nov 15, 2023

Well, yeah because the reversal is just completely wrong most of the time.

Only if you have additional context clues like definite articles or some outside knowledge that a role or property is unique, then you can maybe, sometimes conclude the reverse. Not a bug, working as intended.

threatripper · on Nov 15, 2023

Some relationships like parent child are quite reversible.

diffeomorphism · on Nov 15, 2023

Keywords "some" and "quite". Also, only (likely) half the information. "John is Alice's parent.", "Who are Alice's parents?".

And that is ignoring questions like single parent households, adoptions, biological parents, step parents, multigeneration households, getting disowned...

If a reversal is correct 90% of the time, that is useful. However, it will still be wrong 10% of the time.

octocop · on Nov 15, 2023

I found an example of this with chatGPT 3.5

> Who is Karen Meyers son?

vs

> Who is Mac Millers mother?

user_named · on Nov 15, 2023

I mean obviously yes. LLMs aren't intelligent and they don't understand anything.

OP is trying to argue against this but it's nonsense. ChatGPT also cannot tell me who the 8th chancellor was. It tells me it was Gerhard Schröder.

TeMPOraL · on Nov 15, 2023

IDK, I still some of it boil down to "B -> A" being extremely ambiguous. Other than the annoying Tom Cruise example (I finally know where it came from!), I've seen people frequently bring up "The color of the sky is blue" vs. "Blue is the color of the sky" - but this illustrates my hypothesis perfectly. In training data (and in the totality of what humans ever said or wrote), "Color of the sky is " is almost certain to be followed by "blue" - meanwhile, "Blue is the color of the " has a lot of possible completions, of which "sky" is by far not the most likely.

user_named · on Nov 15, 2023

Nah. The context of all of this is people claiming LLMs are AI. If so they would be able to reverse A is B, and also know when not to, so your example is irrelevant.

The paper shows that it cannot reverse. Then the blog posts goes around different disingenuous ways of making excuses for why it can't reverse at the same time as trying to show that it can reverse in the Tom Cruise training example (extremely disingenuous because it's just the same data over and over with synonyms replaced, and what he gets out of it is just completions, not logical deduction).

TeMPOraL · on Nov 15, 2023

I still call bullshit on this. Humans don't do reversals for free either. We may memorize the reversal, or first, infer it. That's an extra step, often very cognitively challenging. I'd like to see experiments showing how "reversal curse" fares when LLM is allowed to make the inference. Like:

"Please complete the sentence: [your B->A] - but reason through the process first. Start with identifying the subject, then write a high-level summary of what you know about the subject, and only then attempt to complete the original sentence."

I'd expect something like this to suddenly score much better. And I don't consider this cheating - because I think the "AI-ness" quality of LLMs shouldn't be measured against the workings of a human mind, but rather the workings of the inner voice in the human's mind.

foldr · on Nov 15, 2023

This simply isn't true of humans for the kinds of examples used in the paper. If I read "Olaf Scholz was the ninth Chancellor of Germany" then I have no trouble reversing that and determining that the 9th chancellor of Germany was Olaf Scholz. This is not an inference that's 'cognitively challenging'.

Humans may sometimes fail to make simple inferences of this sort when building their factual databases, but they don't systematically fail to do so.

Also, you may have missed this part of the paper:

>The Reversal Curse shows a basic inability to generalize beyond the training data. Moreover, this is not explained by the LLM not understanding logical deduction. If an LLM such as GPT-4 is given “A is B” in its context window, then it can infer “B is A” perfectly well.

The paper does not claim that GPT-4 cannot perform logical deduction, but only that it does not appear to make use of it when generalizing its training data.

TeMPOraL · on Nov 15, 2023

> This simply isn't true of humans for the kinds of examples used in the paper. If I read "Olaf Scholz was the ninth Chancellor of Germany" then I have no trouble reversing that and determining that the 9th chancellor of Germany was Olaf Scholz. This is not an inference that's 'cognitively challenging'.

That's in-context though. LLMs don't fail in-context either.

For a better comparison, recall your school experience, say with history lessons, or geography lessons - where you would cram a hundred "A is B" relationships, and then take a test that demanded you know the reversals. Not as easy.

The "reversal curse" failures I've seen with LLMs are very similar to asking a random person, out of a blue, some unusual reversal of some random fact they ought to know, and then being surprised they can't answer quickly.

> The paper does not claim that GPT-4 cannot perform logical deduction, but only that it does not appear to make use of it when generalizing its training data.

Well, neither can humans when cramming, if you don't give them time to pause and think about what they're learning. I believe the equivalent is happening here - LLMs can perform logical deductions, but at no point in the training process is this capability used.

foldr · on Nov 15, 2023

>For a better comparison, recall your school experience, say with history lessons, or geography lessons - where you would cram a hundred "A is B" relationships, and then take a test that demanded you know the reversals. Not as easy.

This doesn't correspond to my school experience. In general I don't feel that I have to separately memorise "A is B" and "B is A". For example, if I learn that Elizabeth I was Henry VIII's daughter, I don't also have to learn that he was her father.

>LLMs can perform logical deductions, but at no point in the training process is this capability used.

That is exactly what the paper says.

user_named · on Nov 15, 2023

Contrary what you claim, there is nothing in common between LLMs and humans. We are intelligent and conscious, these dumb models are not.

user_named · on Nov 15, 2023

It also cannot tell me the 7th.

Khelavaster · on Nov 15, 2023

You have to add context the relationship of A to B isaintained under reflexive algebra..

jmount · on Nov 15, 2023

This is really dreary.

willsoon · on Nov 15, 2023

Oh Cantor, Cantor,

bitter pale Cantor,

you pale ascetic,

your twist was epic,

so I live in silence,

in sadness I crave,

all for that book,

you wrote and I read.