Skip to content

Commit

Permalink
Added Arxiv link.
Browse files Browse the repository at this point in the history
  • Loading branch information
luk-s committed Jun 19, 2023
1 parent 7c42845 commit c8d36a5
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

![main figure](./docs/main_figure.png "Testing the consistency of superhuman AI via consistency checks")

This repository contains the code for the paper [Evaluating Superhuman Models with Consistency Checks](https://arxiv.org/TODO) by [Lukas Fluri](https://www.linkedin.com/in/lukas-fluri-0b4721112), [Daniel Paleka](https://danielpaleka.com/), and [Florian Tramèr](https://floriantramer.com/).
This repository contains the code for the paper [Evaluating Superhuman Models with Consistency Checks](https://arxiv.org/abs/2306.09983) by [Lukas Fluri](https://www.linkedin.com/in/lukas-fluri-0b4721112), [Daniel Paleka](https://danielpaleka.com/), and [Florian Tramèr](https://floriantramer.com/).

## tl;dr
If machine learning models were to achieve *superhuman* abilities at various reasoning or decision-making tasks,
Expand Down Expand Up @@ -66,4 +66,4 @@ In particular, we show violations of the following consistency constraints:
ways of “ranking” different outcomes. We consider an extreme example here, where we test whether
a bail-decision model could favorably switch its decision if the defendant commits more crimes.

The code for our experiments is available in the [legal-ai-testing](./legal-ai-testing) directory. The data files are available in [release v1.0.0](https://github.com/ethz-privsec/superhuman-ai-consistency/releases/tag/v1.0.0)..
The code for our experiments is available in the [legal-ai-testing](./legal-ai-testing) directory. The data files are available in [release v1.0.0](https://github.com/ethz-privsec/superhuman-ai-consistency/releases/tag/v1.0.0).

0 comments on commit c8d36a5

Please sign in to comment.