Learn

How LLMs work — visibility basics.

Large language models are probabilistic text generators: ChatGPT, Claude, and Gemini may retrieve public web signals, summarize them, and change answers when prompts, models, or dates differ — which is why Coastline Metrics stores prompt, model, and date on every baseline and verification rerun.

See an example All learn articles

Transparent measurement, inspectable evidence, and comparable reruns.
Baseline → Diagnose → Improve → Verify
Full methodology and limits on dedicated pages — not repeated on every section.

Limitations in plain language →

Large language models are probabilistic text generators: ChatGPT, Claude, and Gemini may retrieve public web signals, summarize them, and change answers when prompts, models, or dates differ — which is why Coastline Metrics stores prompt, model, and date on every baseline and verification rerun.

A model predicts text

An LLM is a probabilistic text generator. Small differences in prompts, settings, and model versions can change outputs.

Retrieval changes the answer

Many products add retrieval so the model can quote current web sources. Missing or inconsistent site data may not be retrieved or cited.

Citations are signals, not guarantees

Even when a UI shows citations, the model may summarize or blend sources. You still need to verify what was used and what changed.

Verification requires comparability

To compare runs, hold constant: prompt pack, locale, provider, model, and date. Otherwise differences may be drift.

What we store on every run

Provider and model
Prompt pack identity
Date and time
Warnings about partial failures

Selected sources

Learn

How LLMs work — visibility basics.

See an example All learn articles

Transparent measurement, inspectable evidence, and comparable reruns.
Baseline → Diagnose → Improve → Verify
Full methodology and limits on dedicated pages — not repeated on every section.

Limitations in plain language →

Large language models are probabilistic text generators: ChatGPT, Claude, and Gemini may retrieve public web signals, summarize them, and change answers when prompts, models, or dates differ — which is why Coastline Metrics stores prompt, model, and date on every baseline and verification rerun.

A model predicts text

An LLM is a probabilistic text generator. Small differences in prompts, settings, and model versions can change outputs.

Retrieval changes the answer

Many products add retrieval so the model can quote current web sources. Missing or inconsistent site data may not be retrieved or cited.

Citations are signals, not guarantees

Even when a UI shows citations, the model may summarize or blend sources. You still need to verify what was used and what changed.

Verification requires comparability

To compare runs, hold constant: prompt pack, locale, provider, model, and date. Otherwise differences may be drift.

What we store on every run

Provider and model
Prompt pack identity
Date and time
Warnings about partial failures