Skip to content

Commit

Permalink
Hides figures from AT with aria-hidden instead of role=presentation
Browse files Browse the repository at this point in the history
  • Loading branch information
jcarstairs-scottlogic committed Aug 30, 2024
1 parent 9f24e61 commit 5eadd87
Showing 1 changed file with 35 additions and 42 deletions.
77 changes: 35 additions & 42 deletions _posts/2024-08-29-llms-dont-hallucinate.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,17 +14,16 @@ LLMs can confidently tell you all about the winners of non-existent sporting
events[^7], They can invent legal cases[^8], and fabricate fake science[^9].
These kinds of outputs are often called **hallucinations**.

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/mirage.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<figcaption>
A desert mirage.
<a role="presentation" href="https://commons.wikimedia.org/wiki/File:Desert_mirage,_Egypt.jpg">Photo credit: Ashabot, Wikimedia</a>,
<a role="presentation" href="https://creativecommons.org/licenses/by-sa/4.0/deed.en">CC BY-SA 4.0</a>
<a href="https://commons.wikimedia.org/wiki/File:Desert_mirage,_Egypt.jpg">Photo credit: Ashabot, Wikimedia</a>,
<a href="https://creativecommons.org/licenses/by-sa/4.0/deed.en">CC BY-SA 4.0</a>
</figcaption>
</figure>
</p>
Expand All @@ -46,17 +45,16 @@ But is it so simple? A growing chorus of academics, engineers and journalists
are calling this narrative into question.[^3] **Maybe hallucination isn't a
solvable problem at all.**

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/ai-safety-summit.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<figcaption>
Taken at the AI Safety Summit hosted at Bletchley Park, Nov 2023.
<a role="presentation" href="https://www.flickr.com/photos/ukgov/53302448690/in/album-72177720312359636/">Photo credit: Marcel Grabowski</a>,
<a role="presentation" href="https://creativecommons.org/licenses/by-nc-nd/2.0/">CC BY-NC-ND 2.0</a>
<a href="https://www.flickr.com/photos/ukgov/53302448690/in/album-72177720312359636/">Photo credit: Marcel Grabowski</a>,
<a href="https://creativecommons.org/licenses/by-nc-nd/2.0/">CC BY-NC-ND 2.0</a>
</figcaption>
</figure>
</p>
Expand Down Expand Up @@ -132,16 +130,15 @@ These points are pretty subtle, so let's explore a couple of examples.
Think about dice. I could roll six sixes in a row, and that would be really
unusual behaviour for the die.

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/dice.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<a role="presentation" href="https://www.deviantart.com/barefootliam-stock/art/five-ivory-dice-97476774">Photo credit: barefootliam-stock</a>,
<a role="presentation" href="https://creativecommons.org/licenses/by/3.0/">CC BY 3.0</a>
<figcaption>
<a href="https://www.deviantart.com/barefootliam-stock/art/five-ivory-dice-97476774">Photo credit: barefootliam-stock</a>,
<a href="https://creativecommons.org/licenses/by/3.0/">CC BY 3.0</a>
</figcaption>
</figure>
</p>
Expand All @@ -159,18 +156,17 @@ sixes in a row.
In contrast, if I secure a picture to the wall with a picture hook, and it
falls off, I can reasonably expect there to be an explanation.

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/picture-falling-off-wall.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<figcaption>
A picture hook behaving abnormally. Mind you, the chap on the ladder
appears to have bigger problems right now.
<a role="presentation" href="https://www.lookandlearn.com/history-images/YW039819V/Episodes-illustrating-forces-in-physics-dynamics-and-mechanics">Photo credit: lookandlearn.com</a>,
<a role="presentation" href="https://creativecommons.org/licenses/by/4.0/">CC BY 4.0</a>
<a href="https://www.lookandlearn.com/history-images/YW039819V/Episodes-illustrating-forces-in-physics-dynamics-and-mechanics">Photo credit: lookandlearn.com</a>,
<a href="https://creativecommons.org/licenses/by/4.0/">CC BY 4.0</a>
</figcaption>
</figure>
</p>
Expand Down Expand Up @@ -220,17 +216,16 @@ faithfulness. They are not trained to target truth. They are trained to guess
the most plausible next word in the given sequence. So the case where the LLM
'hallucinates' and the case where it doesn't are causally indistinguishable.

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/predictive-text.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<figcaption>
The solution given by LLMs is new. The problem of predictive text is not.
<a role="presentation" href="https://www.flickr.com/photos/bike/4474418938">Photo credit: Richard Masoner</a>,
<a role="presentation" href="https://creativecommons.org/licenses/by-sa/2.0/deed.en">CC BY-SA 2.0</a>
<a href="https://www.flickr.com/photos/bike/4474418938">Photo credit: Richard Masoner</a>,
<a href="https://creativecommons.org/licenses/by-sa/2.0/deed.en">CC BY-SA 2.0</a>
</figcaption>
</figure>
</p>
Expand Down Expand Up @@ -307,19 +302,18 @@ humans who were really responsible. And where there's a lack of accountability,
organisations are unable to make good decisions or learn from their
mistakes[^2].

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/crumple-zone.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<figcaption>
Just as crumple zones in a car is designed to absorb the impact of a
collision to protect its driver, so misplaced and misunderstood LLMs
could act to absorb responsibility from decision-makers.
<a role="presentation" href="https://commons.wikimedia.org/wiki/File:Escort_wreck_006.jpg">Photo credit: Karrmann, Wikimedia</a>,
<a role="presentation" href="https://creativecommons.org/licenses/by-sa/3.0/deed.en">CC BY-SA 3.0</a>
<a href="https://commons.wikimedia.org/wiki/File:Escort_wreck_006.jpg">Photo credit: Karrmann, Wikimedia</a>,
<a href="https://creativecommons.org/licenses/by-sa/3.0/deed.en">CC BY-SA 3.0</a>
</figcaption>
</figure>
</p>
Expand Down Expand Up @@ -385,17 +379,16 @@ can convince my crush that I'm a sophisticated gentleman that knows his Islays
from his Arrans, she just might consider going on a date with me. Actually
saying something true or meaningful about whisky is not the task at hand.

<p>
<figure role="presentation">
<p aria-hidden="true">
<figure>
<img
src="{{ site.github.url }}/jcarstairs/assets/whisky.webp"
alt=""
role="presentation"
/>
<figcaption role="presentation">
<figcaption>
I like the long notes of smoke and spices in this one. Mmm, gingery.
<a role="presentation" href="https://timelessmoon.getarchive.net/amp/media/whisky-alcohol-drink-food-drink-e8ec93">Photo credit: Pixabay</a>,
<a role="presentation" href="https://creativecommons.org/publicdomain/zero/1.0">CC CC0 1.0</a>
<a href="https://timelessmoon.getarchive.net/amp/media/whisky-alcohol-drink-food-drink-e8ec93">Photo credit: Pixabay</a>,
<a href="https://creativecommons.org/publicdomain/zero/1.0">CC CC0 1.0</a>
</figcaption>
</figure>
</p>
Expand Down

0 comments on commit 5eadd87

Please sign in to comment.