• T156@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      6 months ago

      Only slightly though. It hardly seems practical to try and infer gender from names, in a way where it can’t be obtained through historical records, or the user.

      • skisnow@lemmy.ca
        link
        fedilink
        English
        arrow-up
        7
        ·
        6 months ago

        For a given individual, sure. If you’re trying to do some statistics over a whole group that you have no other record for, it could be useful.

        • bss03@infosec.pub
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          4
          ·
          6 months ago

          Sounds like those statistics output would the heavily biased by whatever process you were using to turn names into genders. In short, a bad idea.

          • TangledHyphae@lemmy.world
            link
            fedilink
            arrow-up
            4
            arrow-down
            2
            ·
            6 months ago

            “Since the dataset isn’t 100% perfectly annotated for analysis, we should give up the whole project entirely.”

            • Shanmugha@lemmy.world
              link
              fedilink
              arrow-up
              2
              ·
              edit-2
              6 months ago

              No, since the dataset is bound to give nonsensical results, we search for sources that are more precise. Hint: “Andrea” already mentioned and Japanese names