Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bring back GCS ops. #1229

Open
wants to merge 85 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
85 commits
Select commit Hold shift + click to select a range
59ddbc4
Add a standalone binary build for GCS ops
michaelbanfield Jun 18, 2020
ca8e327
Revert "Deprecate gcs-config (#1024)"
michaelbanfield Dec 14, 2020
cb8ffe7
Rebase change
michaelbanfield Dec 14, 2020
3536338
Clean up merge
michaelbanfield Dec 14, 2020
11c8a51
Fix lint errors
michaelbanfield Dec 14, 2020
3560ac6
Skip tests for windows
michaelbanfield Mar 29, 2021
42c2867
Move build target to exclude windows
michaelbanfield Mar 29, 2021
7c04ee3
Update the API Compatibility test to include tf-nightly vs. tensorflo…
yongtang Dec 15, 2020
760dd25
Bump Apache Arrow to 2.0.0 (#1231)
yongtang Dec 16, 2020
9a3663c
Bump Avro to 1.10.1 (#1235)
yongtang Dec 17, 2020
2e6936f
Add emulator for gcs (#1234)
vnghia Dec 18, 2020
04d6913
fix nightly build because of missing `google-cloud-storage` (#1238)
vnghia Dec 18, 2020
aa1a95d
Remove the CI build for CentOS 8 (#1237)
yongtang Dec 18, 2020
6c29813
[MongoDB] update API docstrings (#1243)
kvignesh1420 Dec 22, 2020
371877e
Remove redundant output of dataset.element_spec in PostgreSQL tutoria…
terrytangyuan Dec 22, 2020
88b9d8d
add tf-c-header rule (#1244)
vnghia Dec 22, 2020
d695644
Add `fail-fast: false` to API Compatibility GitHub Actions (#1246)
yongtang Dec 23, 2020
b0ffa2e
Skip tf-nightly:tensorflow-io==0.17.0 on API compatibility test (#1247)
yongtang Dec 23, 2020
a68633e
S3 Improvements (#1248)
vnghia Dec 23, 2020
e29e8a3
Add missed function RecursivelyCreateDir for hdfs file system impleme…
yongtang Dec 24, 2020
fd7bc1a
update bazel version to 3.7.2 (#1251)
kvignesh1420 Dec 26, 2020
7bb3d30
[audio] cleanup vorbis file after usage (#1249)
kvignesh1420 Dec 27, 2020
9966944
[s3] add support for testing on macOS (#1253)
kvignesh1420 Jan 4, 2021
0342a16
add notebook formatting instruction in README (#1256)
burgerkingeater Jan 5, 2021
b56ad5d
[docs] Restructure README.md content (#1257)
kvignesh1420 Jan 5, 2021
ab9004d
Update libtiff/libgeotiff dependency (#1258)
yongtang Jan 5, 2021
0381b91
Update openjpeg to 2.4.0 (#1259)
yongtang Jan 5, 2021
ac8da58
[arrow] using eager exec for examples in README.md (#1261)
kvignesh1420 Jan 6, 2021
3641d2a
remove unstable elasticsearch test setup on macOS (#1263)
kvignesh1420 Jan 6, 2021
4f340e0
Exposes num_parallel_reads and num_parallel_calls (#1232)
i-ony Jan 7, 2021
0663d38
Fix incomplete row reading issue in parquet files (#1262)
yongtang Jan 7, 2021
7cb6b0f
Tests to train a keras model using MongoDBIODataset (#1264)
kvignesh1420 Jan 7, 2021
cebc613
add avro tutorial testing data (#1267)
burgerkingeater Jan 11, 2021
fc8d472
Update Kafka tutorial to work with Apache Kafka (#1266)
dalelane Jan 12, 2021
5df32c6
Update pulsar download link. (#1270)
yongtang Jan 14, 2021
2bbdd40
add github workflow for performance benchmarking (#1269)
kvignesh1420 Jan 14, 2021
f8efb10
handle missing dependencies while benchmarking (#1271)
kvignesh1420 Jan 18, 2021
544740a
Disable s3 macOS for now as docker is not working on GitHub Actions f…
yongtang Jan 19, 2021
b71fcb3
rename testing data files (#1278)
burgerkingeater Jan 19, 2021
337ef96
Add tutorial for avro dataset API (#1250)
burgerkingeater Jan 19, 2021
c0d56ee
remove docker based mongodb tests in macos (#1279)
kvignesh1420 Jan 20, 2021
f16c613
trigger benchmarks workflow only on commits (#1282)
kvignesh1420 Jan 26, 2021
ef2927c
Bump Apache Arrow to 3.0.0 (#1285)
yongtang Jan 27, 2021
03b77de
Add bazel cache (#1287)
yongtang Jan 29, 2021
f492bc8
Add initial bigtable stub test (#1286)
yongtang Jan 29, 2021
2808ac3
Update azure lite v0.3.0 (#1288)
yongtang Jan 30, 2021
5a42d1e
Add reference to github-pages benchmarks in README (#1289)
kvignesh1420 Jan 31, 2021
ff6245a
Update _toc.yaml (#1290)
burgerkingeater Jan 31, 2021
8a1fead
Clear outputs (#1292)
MarkDaoust Feb 2, 2021
b07d1ab
fix kafka online-learning section in tutorial notebook (#1274)
kvignesh1420 Feb 2, 2021
5299d14
Only enable bazel caching writes for tensorflow/io github actions (#1…
yongtang Feb 2, 2021
f02af15
Enable ready-only bazel cache (#1294)
yongtang Feb 2, 2021
84bba4c
Update xz to 5.2.5, and switch the download link. (#1296)
yongtang Feb 3, 2021
0a7c5a2
Enable bazel remote cache for kokoro tests (#1295)
yongtang Feb 3, 2021
880c8b3
Rename tests (#1297)
yongtang Feb 3, 2021
7010a48
Combine Ubuntu 20.04 and CentOS 7 tests into one GitHub jobs (#1299)
yongtang Feb 4, 2021
5091b94
Update names of api tests (#1300)
yongtang Feb 4, 2021
79ccf5e
Fix wrong benchmark tests names (#1301)
yongtang Feb 4, 2021
7945ec5
Patch arrow to temporarily resolve the ARROW-11518 issue (#1304)
yongtang Feb 8, 2021
a398d26
Avoid error if plugins .so module is not available (#1302)
yongtang Feb 8, 2021
df04e37
Remove AWS headers from tensorflow, and use headers from third_party …
yongtang Feb 8, 2021
cc93afa
Fix docstring. (#1305)
MarkDaoust Feb 8, 2021
f34d193
Switch to use github to download libgeotiff (#1307)
yongtang Feb 9, 2021
801569f
Add @com_google_absl//absl/strings:cord (#1308)
yongtang Feb 9, 2021
a871e52
Switch to modular file system for hdfs (#1309)
yongtang Feb 11, 2021
5b77f96
Disable test_write_kafka test for now. (#1310)
yongtang Feb 12, 2021
53c9a71
Modify --plat-name for macosx wheels (#1311)
kvignesh1420 Feb 13, 2021
3f7f292
Switch to modular file system for s3 (#1312)
yongtang Feb 13, 2021
33fca56
Update to enable python 3.9 building on Linux (#1314)
yongtang Feb 26, 2021
314f406
Add python 3.9 on Windows (#1316)
yongtang Feb 27, 2021
fb5cab8
Use `-p 9000:9000` (and hide 8088) when launch hadoop (#1317)
yongtang Mar 2, 2021
221e221
Experimental: Add initial wavefront/obj parser for vertices (#1315)
yongtang Mar 5, 2021
3b81b85
update `protobuf` version to `3.11.4` to match tensorflow-nightly (#1…
vnghia Mar 7, 2021
1c85b77
Revert "update `protobuf` version to `3.11.4` to match tensorflow-nig…
vnghia Mar 12, 2021
ef46f8c
Enable python 3.9 build on macOS (#1324)
yongtang Mar 13, 2021
3121308
switch mnist dataset mirror to a more reliable one (#1327)
vnghia Mar 14, 2021
57d840b
remove flaky centos 7 based build action (#1328)
kvignesh1420 Mar 14, 2021
c7e99a5
Adds AVRO_PARSER_NUM_MINIBATCH to override num_minibatches and logs t…
i-ony Mar 18, 2021
de54c3c
Super Serial- automatically save and load TFRecords from Tensorflow d…
markemus Mar 18, 2021
9644be3
Fix link in avro reader notebook (#1333)
oliverhu Mar 21, 2021
8d7d28f
Bump abseil-cpp to 6f9d96a1f41439ac172ee2ef7ccd8edf0e5d068c (#1336)
yongtang Mar 23, 2021
3de431d
Release nightly even if test fails (#1339)
yongtang Mar 23, 2021
ef8a5d5
remove unused/stale azure_ops (#1338)
kvignesh1420 Mar 23, 2021
4154a2c
gcs switch to env (#1319)
vnghia Mar 24, 2021
64eb761
improvements for `s3` environements variables (#1343)
vnghia Mar 29, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .bazelversion
Original file line number Diff line number Diff line change
@@ -1 +1 @@
3.1.0
3.7.2
31 changes: 18 additions & 13 deletions .github/workflows/api.yml
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
name: API Compatibility

on:
push:
branches:
- master
schedule:
- cron: "0 12 * * *"

Expand All @@ -13,12 +16,12 @@ jobs:
name: macOS ${{ matrix.python }} + ${{ matrix.version }}
runs-on: macos-latest
strategy:
fail-fast: false
matrix:
python: ['3.8']
version: ['tensorflow==2.4.0rc4:tensorflow-io-nightly', 'tf-nightly:tensorflow-io-nightly']
version: ['tensorflow==2.4.0:tensorflow-io-nightly', 'tf-nightly:tensorflow-io==0.17.0', 'tf-nightly:tensorflow-io-nightly']
steps:
- uses: actions/checkout@v2
- uses: docker-practice/actions-setup-docker@v1
- uses: actions/setup-python@v1
with:
python-version: ${{ matrix.python }}
Expand All @@ -29,9 +32,7 @@ jobs:
- name: Setup macOS
run: |
set -x -e
docker version
bash -x -e tests/test_azure/start_azure.sh
bash -x -e tests/test_aws/aws_test.sh
- name: Test macOS
run: |
set -x -e
Expand All @@ -40,21 +41,23 @@ jobs:
rm -rf tensorflow_io
echo ${{ matrix.version }} | awk -F: '{print $1}' | xargs python -m pip install -U
echo ${{ matrix.version }} | awk -F: '{print $2}' | xargs python -m pip install --no-deps -U
python -m pip install pytest-benchmark boto3
python -m pip install pytest-benchmark boto3 google-cloud-storage==1.32.0
python -m pip freeze
python -c 'import tensorflow as tf; print(tf.version.VERSION)'
python -c 'import tensorflow_io as tfio; print(tfio.version.VERSION)'
python -m pytest -s -v tests/test_http_eager.py
python -m pytest -s -v tests/test_s3_eager.py
python -m pytest -s -v tests/test_http.py
python -m pytest -s -v tests/test_s3.py
python -m pytest -s -v tests/test_azure.py
python -m pytest -s -v tests/test_gcs.py

linux:
name: Linux ${{ matrix.python }} + ${{ matrix.version }}
runs-on: ubuntu-20.04
strategy:
fail-fast: false
matrix:
python: ['3.8']
version: ['tensorflow==2.4.0rc4:tensorflow-io-nightly', 'tf-nightly:tensorflow-io-nightly']
version: ['tensorflow==2.4.0:tensorflow-io-nightly', 'tf-nightly:tensorflow-io==0.17.0', 'tf-nightly:tensorflow-io-nightly']
steps:
- uses: actions/checkout@v2
- uses: actions/setup-python@v1
Expand All @@ -75,21 +78,23 @@ jobs:
rm -rf tensorflow_io
echo ${{ matrix.version }} | awk -F: '{print $1}' | xargs python -m pip install -U
echo ${{ matrix.version }} | awk -F: '{print $2}' | xargs python -m pip install --no-deps -U
python -m pip install pytest-benchmark boto3
python -m pip install pytest-benchmark boto3 google-cloud-storage==1.32.0
python -m pip freeze
python -c 'import tensorflow as tf; print(tf.version.VERSION)'
python -c 'import tensorflow_io as tfio; print(tfio.version.VERSION)'
python -m pytest -s -v tests/test_http_eager.py
python -m pytest -s -v tests/test_s3_eager.py
python -m pytest -s -v tests/test_http.py
python -m pytest -s -v tests/test_s3.py
python -m pytest -s -v tests/test_azure.py
if [[ "${{ matrix.version }}" != "tf-nightly:tensorflow-io==0.17.0" ]]; then python -m pytest -s -v tests/test_gcs.py ; fi

windows:
name: Windows ${{ matrix.python }} + ${{ matrix.version }}
runs-on: windows-latest
strategy:
fail-fast: false
matrix:
python: ['3.8']
version: ['tensorflow==2.4.0rc4:tensorflow-io-nightly', 'tf-nightly:tensorflow-io-nightly']
version: ['tensorflow==2.4.0:tensorflow-io-nightly', 'tf-nightly:tensorflow-io==0.17.0', 'tf-nightly:tensorflow-io-nightly']
steps:
- uses: actions/checkout@v2
- uses: actions/setup-python@v1
Expand All @@ -115,4 +120,4 @@ jobs:
python -m pip freeze
python -c 'import tensorflow as tf; print(tf.version.VERSION)'
python -c 'import tensorflow_io as tfio; print(tfio.version.VERSION)'
python -m pytest -s -v tests/test_http_eager.py -k remote
python -m pytest -s -v tests/test_http.py -k remote
81 changes: 81 additions & 0 deletions .github/workflows/benchmarks.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,81 @@
name: API Performance Benchmarks

on:
push:
branches:
- master

jobs:
macos:
name: macOS ${{ matrix.python }} + ${{ matrix.version }}
runs-on: macos-latest
strategy:
fail-fast: false
matrix:
python: ['3.8']
version: ['tensorflow==2.4.0:tensorflow-io-nightly', 'tensorflow==2.4.0:tensorflow-io']
steps:
- uses: actions/checkout@v2
- uses: actions/setup-python@v1
with:
python-version: ${{ matrix.python }}
- name: Setup macOS
run: |
set -x -e
python -m pip install -U wheel setuptools
python --version
- name: Benchmark on macOS
run: |
set -x -e
python --version
df -h
rm -rf tensorflow_io
echo ${{ matrix.version }} | awk -F: '{print $1}' | xargs python -m pip install -U
echo ${{ matrix.version }} | awk -F: '{print $2}' | xargs python -m pip install --no-deps -U
python -m pip install -q scikit-image pytest pytest-benchmark boto3 fastavro avro-python3 scikit-image pandas pyarrow==2.0.0 google-cloud-pubsub==2.1.0 google-cloud-bigquery-storage==1.1.0 google-cloud-bigquery==2.3.1 google-cloud-storage==1.32.0
python -m pip freeze
python -c 'import tensorflow as tf; print(tf.version.VERSION)'
python -c 'import tensorflow_io as tfio; print(tfio.version.VERSION)'
python -m pytest --benchmark-only -v --import-mode=append $(find . -type f \( -iname "test_*.py" ! \( -iname "test_*_v1.py" -o -iname "test_bigquery.py" \) \))

linux:
name: Linux ${{ matrix.python }} + ${{ matrix.version }}
runs-on: ubuntu-20.04
strategy:
fail-fast: false
matrix:
python: ['3.8']
version: ['tensorflow==2.4.0:tensorflow-io-nightly', 'tensorflow==2.4.0:tensorflow-io']
steps:
- uses: actions/checkout@v2
with:
fetch-depth: 0
- uses: actions/setup-python@v1
with:
python-version: ${{ matrix.python }}
- name: Setup Linux
run: |
set -x -e
bash -x -e .github/workflows/build.space.sh
bash -x -e tests/test_sql/sql_test.sh
- name: Benchmark on Linux
run: |
set -x -e
python --version
df -h
rm -rf tensorflow_io
echo ${{ matrix.version }} | awk -F: '{print $1}' | xargs python -m pip install -U
echo ${{ matrix.version }} | awk -F: '{print $2}' | xargs python -m pip install --no-deps -U
python -m pip install -q scikit-image pytest pytest-benchmark boto3 fastavro avro-python3 scikit-image pandas pyarrow==2.0.0 google-cloud-pubsub==2.1.0 google-cloud-bigquery-storage==1.1.0 google-cloud-bigquery==2.3.1 google-cloud-storage==1.32.0
python -m pip freeze
python -c 'import tensorflow as tf; print(tf.version.VERSION)'
python -c 'import tensorflow_io as tfio; print(tfio.version.VERSION)'
python -m pytest --benchmark-only --benchmark-json benchmark.json -v --import-mode=append $(find . -type f \( -iname "test_*.py" ! \( -iname "test_*_v1.py" -o -iname "test_bigquery.py" \) \))
- name: Store benchmark result
uses: rhysd/github-action-benchmark@v1
with:
name: Tensorflow-IO Benchmarks
tool: 'pytest'
output-file-path: benchmark.json
github-token: ${{ secrets.GITHUB_TOKEN }}
auto-push: true
10 changes: 5 additions & 5 deletions .github/workflows/build.wheel.sh
Original file line number Diff line number Diff line change
Expand Up @@ -6,11 +6,11 @@ run_test() {
entry=$1
CPYTHON_VERSION=$($entry -c 'import sys; print(str(sys.version_info[0])+str(sys.version_info[1]))')
(cd wheelhouse && $entry -m pip install tensorflow_io-*-cp${CPYTHON_VERSION}-*.whl)
$entry -m pip install -q pytest pytest-benchmark boto3 fastavro avro-python3 scikit-image pandas pyarrow==0.16.0 google-cloud-pubsub==2.1.0 google-cloud-bigquery-storage==1.1.0 google-cloud-bigquery==2.3.1 google-cloud-storage==1.32.0
(cd tests && $entry -m pytest --benchmark-disable -v --import-mode=append $(find . -type f \( -iname "test_*.py" ! \( -iname "test_*_eager.py" \) \)))
(cd tests && $entry -m pytest --benchmark-disable -v --import-mode=append $(find . -type f \( -iname "test_*_eager.py" ! \( -iname "test_bigquery_eager.py" \) \)))
# GRPC and test_bigquery_eager tests have to be executed separately because of https://github.com/grpc/grpc/issues/20034
(cd tests && $entry -m pytest --benchmark-disable -v --import-mode=append $(find . -type f \( -iname "test_bigquery_eager.py" \)))
$entry -m pip install -q pytest pytest-benchmark boto3 fastavro avro-python3 scikit-image pandas pyarrow==3.0.0 google-cloud-pubsub==2.1.0 google-cloud-bigtable==1.6.0 google-cloud-bigquery-storage==1.1.0 google-cloud-bigquery==2.3.1 google-cloud-storage==1.32.0 PyYAML==5.3.1
(cd tests && $entry -m pytest --benchmark-disable -v --import-mode=append $(find . -type f \( -iname "test_*_v1.py" \)))
(cd tests && $entry -m pytest --benchmark-disable -v --import-mode=append $(find . -type f \( -iname "test_*.py" ! \( -iname "test_*_v1.py" -o -iname "test_bigquery.py" \) \)))
# GRPC and test_bigquery tests have to be executed separately because of https://github.com/grpc/grpc/issues/20034
(cd tests && $entry -m pytest --benchmark-disable -v --import-mode=append $(find . -type f \( -iname "test_bigquery.py" \)))
}

PYTHON_VERSION=python
Expand Down
Loading