A model predicts text
An LLM is a probabilistic text generator. Small differences in prompts, settings, and model versions can change outputs.
Loading…
Learn
Large language models are probabilistic text generators: ChatGPT, Claude, and Gemini may retrieve public web signals, summarize them, and change answers when prompts, models, or dates differ — which is why Coastline Metrics stores prompt, model, and date on every baseline and verification rerun.
Large language models are probabilistic text generators: ChatGPT, Claude, and Gemini may retrieve public web signals, summarize them, and change answers when prompts, models, or dates differ — which is why Coastline Metrics stores prompt, model, and date on every baseline and verification rerun.
An LLM is a probabilistic text generator. Small differences in prompts, settings, and model versions can change outputs.
Many products add retrieval so the model can quote current web sources. Missing or inconsistent site data may not be retrieved or cited.
Even when a UI shows citations, the model may summarize or blend sources. You still need to verify what was used and what changed.
To compare runs, hold constant: prompt pack, locale, provider, model, and date. Otherwise differences may be drift.