[w3ctag/design-reviews] Writing Assistance APIs (Issue #991)

こんにちは TAG-さん!

I'm requesting an early TAG design review of the writing assistance APIs.

Browsers and operating systems are increasingly expected to gain access to a language model. ([Example](https://developer.chrome.com/docs/ai/built-in), [example](https://blogs.windows.com/windowsdeveloper/2024/05/21/unlock-a-new-era-of-innovation-with-windows-copilot-runtime-and-copilot-pcs/), [example](https://www.apple.com/apple-intelligence/).) Web applications can benefit from using language models for a variety of [use cases](#use-cases).

We're proposing a group of APIs that use language models to give web developers high-level assistance with writing. Specifically:

* The **summarizer** API produces summaries of input text;
* The **writer** API writes new material, given a writing task prompt;
* The **rewriter** API transforms and rephrases input text in the requested ways.

Because these APIs share underlying infrastructure and API shape, and have many cross-cutting concerns, we include them all in one explainer, to avoid repeating ourselves across three repositories. However, they are separate API proposals, and can be evaluated independently.

  - Explainer: https://github.com/WICG/writing-assistance-apis/blob/main/README.md
  - User research: based on a series of prototyping sessions with partners, many separate applications were created using these APIs or their predecessors. These prototyping sessions were conducted confidentially (so as not to leak specific product plans), but from them we extracted the [use cases](https://github.com/WICG/writing-assistance-apis/blob/main/README.md#use-cases) listed in the explainer.
  - Security and Privacy self-review: https://github.com/WICG/writing-assistance-apis/blob/main/security-privacy-questionnaire.md
  - GitHub repo: https://github.com/WICG/writing-assistance-apis
  - Primary contacts:
      - Domenic Denicola (@domenic), Google, editor
  - Organization/project driving the design: Google
  - Multi-stakeholder feedback:
    - Chromium comments: We are excited to start trialing these APIs with developers through origin trials and behind-a-flag experiments.
    - Mozilla comments: TODO, will edit shortly
    - WebKit comments: TODO, will edit shortly
    - Web developers:
      - As mentioned above, based on a series of prototyping sessions we have heard significant excitement for using these APIs.
      - Public feedback on https://github.com/WICG/proposals/issues/163 was mixed. To summarize, some themes we saw include: asking for more capabilities (e.g. full prompting of a language model instead of higher-level APIs ([our response](https://github.com/WICG/writing-assistance-apis/blob/main/README.md#directly-exposing-a-prompt-api); multimodal support); desire to make sure the API actually works robustly in many real-world use cases; removal of any safety/ethical safeguards; and confusion about on-device vs. cloud APIs.

Further details:

  - [x] I have reviewed the TAG's [Web Platform Design Principles](https://www.w3.org/TR/design-principles/)
  - The group where the incubation/design work on this is being done (or is intended to be done in the future): WICG
  - The group where standardization of this work is intended to be done ("unknown" if not known): not completely known, but we are discussing the APIs with the Web Machine Learning Working Group at TPAC, and it is possible a future version of their charter would welcome us.
  - Existing major pieces of multi-implementer review or discussion of this design: see above.
  - Major unresolved issues with or opposition to this design:
    - We are aware of previous TAG feedback (in https://github.com/w3ctag/design-reviews/issues/948) regarding API surface details, and have captured that [in the explainer](https://github.com/WICG/writing-assistance-apis/blob/main/README.md#alternative-api-spellings).
    - As with the translator/language detector APIs (#948), there is a tension between interoperability and exposing whether the model is on-device or cloud-based; we discuss this a bit more [in the explainer](https://github.com/WICG/writing-assistance-apis/blob/main/README.md#shared-goals).
    - As with the translator/language detector APIs (#948), there are several privacy concerns, discussed [in the explainer](https://github.com/WICG/writing-assistance-apis/blob/main/README.md#privacy-considerations). We believe there are reasonable mitigations possible there, but will need to do some experimentation to find the best ones.
  - This work is being funded by: Google

You should also know that...

[This is not a generic prompt API](https://github.com/WICG/writing-assistance-apis/blob/main/README.md#directly-exposing-a-prompt-api).

-- 
Reply to this email directly or view it on GitHub:
https://github.com/w3ctag/design-reviews/issues/991
You are receiving this because you are subscribed to this thread.

Message ID: <w3ctag/design-reviews/issues/991@github.com>

Received on Tuesday, 10 September 2024 06:43:13 UTC