Skip to content

Commit

Permalink
Clean up and modernize AIP-151: Long-running operations.
Browse files Browse the repository at this point in the history
The changes are pretty broad, and include:

1) Refer to operations rather than status monitors.  This is based on discussion in Slack with Luke Sneeringer, who originally adopted this AEP.
2) Add references to the common components repo (aep-dev/aep) and perform corresponding cleanup in the examples.

This PR corresponds to aep-dev/aep-components#8.
  • Loading branch information
rofrankel committed Mar 8, 2024
1 parent 9a8f676 commit 8ccde42
Show file tree
Hide file tree
Showing 3 changed files with 62 additions and 72 deletions.
109 changes: 43 additions & 66 deletions aep/general/0151/aep.md.j2
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Long-running requests
# Long-running operations

Occasionally, a service may need to expose an operation that takes a
significant amount of time to complete. In these situations, it is often a poor
Expand All @@ -13,77 +13,50 @@ can be used to track progress and retrieve the result.
## Guidance

Operations that might take a significant amount of time to complete **should**
return a `202 Accepted` response along with an identifier that can be used to
track the status of the request and ultimately retrieve the result.
return a `202 Accepted` response along with an `Operation` resource that can be
used to track the status of the request and ultimately retrieve the result.

Any single operation defined in an API surface **must** either _always_ return
`202 Accepted` along with a request identifier, or _never_ do so. A service
**must not** return a `200 OK` response with the result if it is "fast enough",
and `202 Accepted` if it is not fast enough, because such behavior adds
significant burdens for clients.
`202 Accepted` along with an `Operation`, or _never_ do so. A service **must
not** return a `200 OK` response with the result if it is "fast enough", and
`202 Accepted` if it is not fast enough, because such behavior adds significant
burdens for clients.

**Note:** User expectations can vary on what is considered "a significant
amount of time" depending on what work is being done. A good rule of thumb is
10 seconds.

### Status monitor representation
### Operation representation

The response to a long-running request **should** be a "status monitor" having
the following common format:
The response to a long-running request **must** be an [`Operation`][].

```typescript
interface StatusMonitor {
// The identifier for this status monitor.
id: string;
{% tab proto %}

// Whether the request is done.
done: boolean;
Protocol buffer APIs **must** use the common component [`aep.api.Operation`][].

// The result of the request.
// Only populated if the request is done and was successful.
response: any;
{% tab oas %}

// The error that arose from the request.
// Only populated if the request is done and was unsuccessful.
error: Error;
OpenAPI services **must** use this [`JSON Schema Operation`][] schema.

// Metadata associated with the request.
// Populated throughout the life of the request, including after
// it completes.
metadata: any;
}
```
{% endtabs %}

- If the `done` field is `true`, then one and exactly one of the `response` and
`error` fields **must** be populated.
- If the `done` field is `false`, then the `response` and `error` fields
**must not** be populated.
- The `response` and `metadata` fields **may** be any type that the service
determines to be appropriate, but **must** always be the same type for any
particular operation.
- The `response` and `metadata` types **should** be defined in the same API
surface as the operation itself.
- The `response` and `metadata` types that need no data **should** use a
custom-defined empty struct rather than a common void or empty type, to
permit future extensibility.

### Querying a status monitor
### Querying an operation

The service **must** provide an endpoint to query the status of the operation,
which **must** accept the operation identifier and **should not** include other
which **must** accept the operation path and **should not** include other
parameters:

```http
GET /v1/statusMonitors/{status_monitor} HTTP/2
Host: library.googleapis.com
GET /v1/operations/{operation} HTTP/2
Host: library.example.com
Accept: application/json
```

The endpoint **must** return a `StatusMonitor` as described above.
The endpoint **must** return a `Operation` as described above.

### Standard methods

APIs **may** return an `StatusMonitor` from the [`Create`][aip-133],
APIs **may** return an `Operation` from the [`Create`][aip-133],
[`Update`][aip-134], or [`Delete`][aip-135] standard methods if appropriate. In
this case, the `response` field **must** be the standard and expected response
type for that standard method.
Expand All @@ -107,10 +80,10 @@ but is not obligated to do so:

### Expiration

APIs **may** allow their status monitor resources to expire after sufficient
time has elapsed after the request completed.
APIs **may** allow their operation resources to expire after sufficient time
has elapsed after the request completed.

**Note:** A good rule of thumb for status monitor expiry is 30 days.
**Note:** A good rule of thumb for operation expiry is 30 days.

### Errors

Expand All @@ -125,18 +98,17 @@ canonical error object.

{% tab proto %}

When using protocol buffers, the well-known type `google.longrunning.Operation`
is used.

**Note:** For historical reasons, Google uses the term `Operation` to represent
what this document describes as a `StatusMonitor`.
When using protocol buffers, the common component [`aep.api.Operation`][] is
used.

{% sample 'lro.proto', 'rpc WriteBook' %}

- The response type **must** be `google.longrunning.Operation`. The `Operation`
proto definition **must not** be copied into individual APIs.
- The response type **must** be `aep.api.Operation`. The `Operation` proto
definition **should not** be copied into individual APIs; prefer to use a
single copy (in monorepo code bases), or remote dependencies via a tool like
[Buf][buf.build].
- The response **must not** be a streaming response.
- The method **must** include a `google.longrunning.operation_info` annotation,
- The method **must** include a [`aep.api.operation_info`][lro] annotation,
which **must** define both response and metadata types.
- The response and metadata types **must** be defined in the file where the
RPC appears, or a file imported by that file.
Expand All @@ -152,8 +124,10 @@ what this document describes as a `StatusMonitor`.
metadata will _never_ be needed. If metadata might be added in the future,
define an empty message for the RPC metadata and use that.
- APIs with messages that return `Operation` **must** implement the
[`Operations`][lro] service. Individual APIs **must not** define their own
interfaces for long-running operations to avoid inconsistency.
`GetOperation` method of the [`Operations`][lro] service, and **may**
implement the other methods defined in that service. Individual APIs **must
not** define their own interfaces for long-running operations to avoid
inconsistency.

{% tab oas %}

Expand All @@ -162,8 +136,8 @@ what this document describes as a `StatusMonitor`.
- `202` **must** be the only success status code defined.
- The `202` response **must** define an `application/json` response body and no
other response content types.
- The response body schema **must** be an object with `name`, `done`, and
`result` properties as described above for a StatusMonitor
- The response body schema **must** be an object with `path`, `done`, `error`,
and `response` properties as described above for an `Operation`.
- The response body schema **may** contain an object property named `metadata`
to hold service-specific metadata associated with the operation, for example
progress information and common metadata such as create time. The service
Expand All @@ -176,15 +150,18 @@ what this document describes as a `StatusMonitor`.
as an empty object schema. For a standard `Get/Create/Update` operation,
`response` should be a representation of the resource.
- If a service has any long running operations, the service **must** define an
`StatusMonitor` resource with a `list` operation to retrieve a potentially
filtered list of status monitors and a `get` operation to retrieve a specific
status monitor by its `name`.
`Operation` resource with a `list` operation to retrieve a potentially
filtered list of operations and a `get` operation to retrieve a specific
operation by its `path`.

{% endtabs %}

<!-- prettier-ignore-start -->
[google.rpc.Status]: https://github.com/googleapis/api-common-protos/blob/master/google/rpc/S.proto
[lro]: https://github.com/googleapis/api-common-protos/blob/master/google/longrunning/operations.proto
[lro]: https://github.com/aep-dev/aep/blob/main/proto/aep-api/aep/api/operations.proto
[node.js promise]: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Using_promises
[future]: https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.Future
[`Operation`]: https://github.com/aep-dev/aep/blob/main/schemas/api/operation.md
[`aep.api.Operation`]: https://buf.build/aep/api/docs/main:aep.api#aep.api.Operation
[`JSON Schema Operation`]: https://github.com/aep-dev/aep/blob/main/json_schema/api/operation.yaml
<!-- prettier-ignore-end -->
21 changes: 17 additions & 4 deletions aep/general/0151/lro.oas.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -17,13 +17,13 @@ paths:

components:
schemas:
StatusMonitor:
Operation:
description: The status of a long running operation.
properties:
name:
path:
type: string
description:
The server-assigned name, which is only unique within the same
The server-assigned path, which is only unique within the same
service that originally returns it.
done:
type: boolean
Expand All @@ -33,8 +33,21 @@ components:
is available.
error:
$ref: '#/components/schemas/Error'
metadata:
type: object
description: >-
Service-specific metadata associated with the operation. It typically
contains progress information and common metadata such as create time.
Some services might not provide such metadata. Any method that returns a
long-running operation should document the metadata type, if any.
response:
type: object
description: >-
The normal response of the operation in case of success.
Depending on the method, this may be the empty object, the resource on which
the operation is performed, or a custom response.
required:
- name
- path
- done

WriteBookStatus:
Expand Down
4 changes: 2 additions & 2 deletions aep/general/0151/lro.proto
Original file line number Diff line number Diff line change
Expand Up @@ -21,12 +21,12 @@ import "google/protobuf/timestamp.proto";

service Library {
// Write a book.
rpc WriteBook(WriteBookRequest) returns (google.longrunning.Operation) {
rpc WriteBook(WriteBookRequest) returns (aep.api.Operation) {
option (google.api.http) = {
post: "/v1/{parent=publishers/*}/books:write"
body: "*"
};
option (google.longrunning.operation_info) = {
option (aep.api.operation_info) = {
response_type: "WriteBookResponse"
metadata_type: "WriteBookMetadata"
};
Expand Down

0 comments on commit 8ccde42

Please sign in to comment.