• ruan@lemmy.eco.br
    link
    fedilink
    arrow-up
    1
    ·
    13 hours ago

    It was definitely making use of the content, and not just my prompt.

    Ok, being simplistic about the actual workings: anything a LLM outputs is based only in the training data or the prompt, a LLM does not “create” anything.

    I really doubt your blog is statistically significant enough represented in the training data, therefore I can only assume that yes, your blog post URL referenced was web scrapped by ChatGPT and, and any other URLs linked by this main URL that the scrapped deemed significant to the prompt, and all that text was in fact added to the full internal prompt that was processed by the actual LLM.