Skip to content

Commit

Permalink
Add security and privacy questionnaire
Browse files Browse the repository at this point in the history
  • Loading branch information
domenic committed Apr 24, 2024
1 parent 91af943 commit f662f53
Show file tree
Hide file tree
Showing 2 changed files with 107 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -277,6 +277,8 @@ As a first step, we require that detecting the availability of translation for a

Another way in which this API might enhance the web's fingerprinting surface is if translation and language detection models are updated separately from browser versions. In that case, differing results from different versions of the model provide additional fingerprinting bits beyond those already provided by the browser's major version number. Mandating that older browser versions not receive updates or be able to download models from too far into the future might be a possible remediation for this.

Finally, we intend to prohibit (in the specification) any use of user-specific information in producing the translations. For example, it would not be permissible to fine-tune the translation model based on information the user has entered into the browser in the past.

## Alternatives considered and under consideration

### Streaming input support
Expand Down
105 changes: 105 additions & 0 deletions security-privacy-questionnaire.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,105 @@
# [Self-Review Questionnaire: Security and Privacy](https://w3ctag.github.io/security-questionnaire/)

> 01. What information does this feature expose,
> and for what purposes?
This feature exposes two main pieces of information:

- The availability information for each `{ sourceLanguage, targetLanguage }` pair, so that web developers know what translations are possible and whether such translations will require the user to download a potentially-large language pack.

- The actual results of translations, which can be dependent on the language packs or translation models.

> 02. Do features in your specification expose the minimum amount of information
> necessary to implement the intended functionality?
We believe so. It's possible that we could remove the exposure of the availability information. However, it would almost certainly be inferrable via timing side-channels. (I.e., if downloading a language pack is required, then the web developer can observe the first translation taking longer.)

> 03. Do the features in your specification expose personal information,
> personally-identifiable information (PII), or information derived from
> either?
No. Although it's imaginable that the translation models could be fine-tuned on PII to give more accurate-to-this-user translations, we intend to disallow this in the specification.

> 04. How do the features in your specification deal with sensitive information?
We do not deal with sensitive information.

> 05. Do the features in your specification introduce state
> that persists across browsing sessions?
Yes. The downloading of language packs and translation models persists across browsing sessions.

> 06. Do the features in your specification expose information about the
> underlying platform to origins?
Possibly. If a browser does not bundle its own translation models and language packs, but instead uses the operating system's functionality, it is possible for a web developer to infer information about such operating system functionality.

> 07. Does this specification allow an origin to send data to the underlying
> platform?
Possibly. Again, in the scenario where translation is done via operating system functionality, such data would pass through OS libraries.

> 08. Do features in this specification enable access to device sensors?
No.

> 09. Do features in this specification enable new script execution/loading
> mechanisms?
No.

> 10. Do features in this specification allow an origin to access other devices?
No.

> 11. Do features in this specification allow an origin some measure of control over
> a user agent's native UI?
No.

> 12. What temporary identifiers do the features in this specification create or
> expose to the web?
None.

> 13. How does this specification distinguish between behavior in first-party and
> third-party contexts?
We are not yet sure. Our default course of action is to give the same capabilities to both first- and third-party contexts. It is easy to imagine use cases where this could be useful, e.g. a third party customer-support widget that provides translation functionality.

However, it seems likely that some of the mitigations for the [anti-fingerprinting considerations](./README.md#privacy-considerations) will require some sort of distinction between first- and third-party contexts. For example, partitioning download status, or only using the top-level site's detected language, or similar.

> 14. How do the features in this specification work in the context of a browser’s
> Private Browsing or Incognito mode?
We are not yet sure. It is likely that some behavior will be different to deal with the [anti-fingerprinting considerations](./README.md#privacy-considerations). For example, if storage partitioning infrastructure is used, then this would be automatic.

Another possible area of discussion here is whether cloud-based translation APIs make sense in such modes, or whether they should be disabled.

> 15. Does this specification have both "Security Considerations" and "Privacy
> Considerations" sections?
There is no specification yet, but there is a [privacy considerations](./README.md#privacy-considerations) section in the explainer.

We do not anticipate significant security risks for this feature at this time.

> 16. Do features in your specification enable origins to downgrade default
> security protections?
No.

> 17. What happens when a document that uses your feature is kept alive in BFCache
> (instead of getting destroyed) after navigation, and potentially gets reused
> on future navigations back to the document?
Ideally, nothing special should happen. In particular, `LanguageTranslator` and `LanguageDetector` objects should still be usable without interruption after navigating back. We'll need to add web platform tests to confirm this, as it's easy to imagine implementation architectures in which keeping these objects alive while the `Document` is in the back/forward cache is difficult.

(For such implementations, failing to bfcache `Document`s with active `LanguageTranslator` or `LanguageDetector` objects would a simple way of being spec-compliant.)

> 18. What happens when a document that uses your feature gets disconnected?
As with the previous question, nothing special should happen. As with the previous question, it's easy to imagine implementations where this is difficult to implement. We may need to add a check in the specification to prevent such usage, if prototyping shows that the difficulty is significant.

> 19. What should this questionnaire have asked?
Seems fine.

0 comments on commit f662f53

Please sign in to comment.