• @[email protected]
    link
    fedilink
    English
    -314 days ago

    What AI are you talking about? Are you suggesting the commercial models from OpenAI are trained using CP? Or just that there are some models out there that were trained using CP? Because yeah, anyone can create a model at home and train it with whatever. But suggesting that OpenAI has a DB of tagged CP is a different story.

    • @[email protected]
      link
      fedilink
      English
      514 days ago

      Open AI just scours the Internet. 100% chance it’s come across someone illegal and horrible. They don’t pre-approve its training data.

      • @[email protected]
        link
        fedilink
        English
        -114 days ago

        But you have to describe it. It doesn’t just suck in images at random. I imagine someone will remove CP when the images are reviewed. Or do you think they just download all images and add them to the training set without even looking at them?

        • @[email protected]
          link
          fedilink
          English
          114 days ago

          I think that’s exactly what they do. Curation at the quantities that they’re working at would require an army.

          • @[email protected]
            link
            fedilink
            English
            113 days ago

            So you think to train AI you just show it random images without describing what they represent and AI just magically learns? If I then ask AI to create an image of a computer, how does it know what a computer is? Does it just learn this on it’s own from all the random images?