• @[email protected]
    link
    fedilink
    English
    515 days ago

    Prove it. Please, show me the full training data to guarantee you’re right.

    But also, all the kids used for “kids face data” didn’t sign up to be porn

    • @[email protected]
      link
      fedilink
      English
      515 days ago

      I don’t need to. It’s is just the way gen AI works. It takes images of things it knows and then generates NEW content based on what it think you want with your prompts.

      If I’m looking for a infant flying an airplane, gen AI knows what a pilot looks like and what a child looks like and it creates something new.

      Also kids face data doesn’t mean they take the actual face of the actual child and paste it on a body. It might take an eyebrow and a freckle from one kidand use a hair style from another and eyes from someone else.

      Lastly, the kids parents consented when they upload images of their kids on social media.

          • @[email protected]
            link
            fedilink
            English
            214 days ago

            AI models are trained on the open Internet. Not curated. Open Internet has horrible things.

            • @[email protected]
              link
              fedilink
              English
              -114 days ago

              So is that the Gen AI problem or the open internets problem. It sounds like you hate the open internet and awful people who put real cp online and not Gen AI.

        • @[email protected]
          link
          fedilink
          English
          -314 days ago

          What AI are you talking about? Are you suggesting the commercial models from OpenAI are trained using CP? Or just that there are some models out there that were trained using CP? Because yeah, anyone can create a model at home and train it with whatever. But suggesting that OpenAI has a DB of tagged CP is a different story.

          • @[email protected]
            link
            fedilink
            English
            514 days ago

            Open AI just scours the Internet. 100% chance it’s come across someone illegal and horrible. They don’t pre-approve its training data.

            • @[email protected]
              link
              fedilink
              English
              -114 days ago

              But you have to describe it. It doesn’t just suck in images at random. I imagine someone will remove CP when the images are reviewed. Or do you think they just download all images and add them to the training set without even looking at them?

              • @[email protected]
                link
                fedilink
                English
                114 days ago

                I think that’s exactly what they do. Curation at the quantities that they’re working at would require an army.

                • @[email protected]
                  link
                  fedilink
                  English
                  113 days ago

                  So you think to train AI you just show it random images without describing what they represent and AI just magically learns? If I then ask AI to create an image of a computer, how does it know what a computer is? Does it just learn this on it’s own from all the random images?