Skip to content

Commit

Permalink
fix: update Chicago Taxi Dataset URL to Zenodo (#3357)
Browse files Browse the repository at this point in the history
Co-authored-by: Ianna Osborne <[email protected]>
  • Loading branch information
jpivarski and ianna authored Jan 9, 2025
1 parent c51c58c commit c909669
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
6 changes: 3 additions & 3 deletions docs/getting-started/10-minutes-to-awkward-array.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ In this guide, we'll look at how to manipulate a jagged dataset to plot taxi rou

## Loading the dataset

Our dataset is formatted as a 611 MB [Apache Parquet](https://parquet.apache.org/) file, provided [here](https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet). Alongside JSON, and raw buffers, Awkward can also read Parquet files and Arrow tables.
Our dataset is formatted as a 611 MB [Apache Parquet](https://parquet.apache.org/) file, provided [here](https://zenodo.org/records/14537442/files/chicago-taxi.parquet). Alongside JSON, and raw buffers, Awkward can also read Parquet files and Arrow tables.

Given that this file is so large, let's first look at the *metadata* with `ak.metadata_from_parquet` to see what we're working with:

Expand All @@ -43,7 +43,7 @@ import numpy as np
import awkward as ak
metadata = ak.metadata_from_parquet(
"https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet"
"https://zenodo.org/records/14537442/files/chicago-taxi.parquet"
)
```

Expand All @@ -59,7 +59,7 @@ There are a lot of different columns here (`trip.sec`, `trip.begin.lon`, `trip.p

```{code-cell} ipython3
taxi = ak.from_parquet(
"https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet",
"https://zenodo.org/records/14537442/files/chicago-taxi.parquet",
row_groups=[0],
columns=["trip.km", "trip.begin.l*", "trip.end.l*", "trip.path.*"],
)
Expand Down
2 changes: 1 addition & 1 deletion docs/user-guide/how-to-examine-single-item.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ First, let's load the dataset using the {func}`ak.from_parquet` function. We wil
```{code-cell} ipython3
import awkward as ak
url = "https://pivarski-princeton.s3.amazonaws.com/chicago-taxi.parquet"
url = "https://zenodo.org/records/14537442/files/chicago-taxi.parquet"
taxi = ak.from_parquet(
url,
row_groups=[0],
Expand Down

0 comments on commit c909669

Please sign in to comment.