« BackWhich Humans? (2023)osf.ioSubmitted by surprisetalk 3 days ago
  • mncharity 3 days ago

    Since the page didn't load for me several times, and the title is ambiguous, here's the Abstract: Large language models (LLMs) have recently made vast advances in both generating and analyzing textual data. Technical reports often compare LLMs’ outputs with “human” performance on various tests. Here, we ask, “Which humans?” Much of the existing literature largely ignores the fact that humans are a cultural species with substantial psychological diversity around the globe that is not fully captured by the textual data on which current LLMs have been trained. We show that LLMs’ responses to psychological measures are an outlier compared with large-scale cross-cultural data, and that their performance on cognitive psychological tasks most resembles that of people from Western, Educated, Industrialized, Rich, and Democratic (WEIRD) societies but declines rapidly as we move away from these populations (r = -.70). Ignoring cross-cultural diversity in both human and machine psychology raises numerous scientific and ethical issues. We close by discussing ways to mitigate the WEIRD bias in future generations of generative language models.

    • memoriuaysj 3 days ago

      [flagged]

      • observationist 3 days ago

        AWFL is my recent favorite - affluent white female liberal. Western would work as well.

        • catigula 3 days ago

          The implicit subtext of 'WEIRD' is "these people are amazing and that's weird" tbh.

          • jaapz 3 days ago

            Is it? Just sounds like fun acronym to me, nothing more

      • didgetmaster 3 days ago

        Surprise, Surprise. LLMs will respond according to the set of data that their model was trained on!

        While just about every LLM is trained on data that far surpasses the output of just one person, or even a decent sized group; it will still reflect the average sentiment of the corpus of data fed into it.

        If the bulk of the training data was scraped from websites created in 'WEIRD' countries, then it's responses will largely mimic their culture.

        • undefined 3 days ago
          [deleted]
        • rokizero 3 days ago

          This was submitted 30 months ago. Still interesting. I would be interested if this got 'worse' or 'better' with newer models.

          • jdkee 3 days ago

            As an aside: Last year a student of mine (we're at a U.S. college) told me that his teenage cousins back in Mongolia were all learning English in order to use ChatGPT.

            • croisillon 3 days ago

              is it a steppe up?

            • MengerSponge 3 days ago

              /usr/bin/humans, presumably

              • dwa3592 3 days ago

                /usr/bin/human/weird, to be precise.

                • Timwi 3 days ago

                  Homo peculiaris

              • levocardia 3 days ago

                I think it is mostly a good thing that LLMs have "WEIRD" values. We are at a very fortuitous point in history, where the modal position in extant written text is classically liberal and believes in respecting human rights. Virtually no other point in history would be that way, and a true modal position among values and moral beliefs held among all 8 billion people currently alive on earth -- much less the modal position among all ~100 billion humans ever -- would, I'd hazard to guess, not be a very nice place to end up.

                • rexpop 2 days ago

                  You're foolishly myopic if you think your ideology coincides with the global historical best. Have a little humility and consider that there are concepts you haven't even heard of that might be better.

                  • daymanstep 3 days ago

                    If you go back 200 years, you'll be smack bang in the middle of the Enlightenment with thinkers like Kant and Jeremy Bentham.

                    You could argue that if you trained LLMs on only texts from that time period, you would get something even more "classically liberal" or human-rights-respecting

                    • g8oz 3 days ago

                      The extant text also encodes Western hypocrisies and blind spots.

                    • buildsjets 3 days ago

                      Denisovans, clearly.

                      • cathyreisenwitz 3 days ago

                        I wonder whether it might be useful to the continued existence of humanity to correct for the individualism bias in WEIRD countries with some collectivism

                        • andy99 3 days ago

                          2023, and using some kind of in-page pdf reader from 2002

                          • wellthisisgreat 3 days ago

                            It’s 2025 today, Andy :)

                            • Timwi 3 days ago

                              The website is from 2023.

                          • OrionNox 3 days ago

                            They deliberately picked "weird". That tells you enought about their ideological position and bias.

                            • undefined 3 days ago
                              [deleted]