PR: Add libpng and libjpeg-turbo requirement into conda recipe #2301

andfoy · 2020-06-08T22:31:48Z

fmassa · 2020-06-09T13:46:21Z

Could you try rebasing your changes on top of #1881 and #1909 to test how the changes perform when the dependencies on libpng and libjpeg-turbo are enabled?

setup.py

fmassa

Thanks a lot, it's great that all tests pass!

I have a few questions / comments.
In particular, I'm thinking if using the raw libjpeg API would enable us to fix some of the incompatibilities that arises from the other libraries -- IIRC, libjpeg-turbo had the same ABI as libjpeg, so one could switch to use it at runtime.

Also, could you add a note somewhere (maybe in the README?) explaining what are the steps that the user should do to get the image extensions compiled depending on their system?

fmassa · 2020-06-24T14:09:00Z

packaging/torchvision/meta.yaml

+    # Pillow introduces unwanted conflicts with libjpeg-turbo, as it depends on jpeg
+    # The fix depends on https://github.com/conda-forge/conda-forge.github.io/issues/673


Could you explain when / how these conflicts materialize?
Could that lead to segfaults when the user imports torchvision and PIL?

The conflict here is that we're substituting the libjpeg library, which is used primarily by Pillow, however this may not affect us as the ABI between jpegturbo and libjpeg is the same AFAIK.

Most Linux distributions distinguish between libturbojpeg (the library itself) and libjpeg-turbo (the libjpeg version using turbojpeg), thus, they do not have this problem, as we link against libturbojpeg, rather than against libjpeg-turbo.

In conda-forge, we have this conflict because the recipe for libjpeg-turbo produces both the turbo flavored libjpeg and libturbojpeg. I spoke to @isuruf, one of the maintainers in conda-forge, and he told me that they are working towards a solution in conda-forge/conda-forge.github.io#673, but right now there are no alternatives to this conflict.

I guess we are not having any problem with this conflicting installation, as the tests are passing. However, we should encourage users to install PyTorch and torchvision on a separate environment, so we prevent other errors that could be caused as part of this conflict and we're not aware of

Hum, this makes me a bit worried, because AFAIK we don't have tests running on OSX, only Linux and Windows (we do compile on OSX though).
So if there is a problem happening in OSX we wouldn't be able to see it in CI.

Do you think that, if we were to use libjpeg API, everything would be safer?

Aren't they running on the binary_macos_conda_*_pyxx CircleCI pipelines?

Oh ok, yeah, we were not running tests for OSX with wheels, only conda.

But still, do you think if we were to use libjpeg API it would make things safer?

I think right now we don't have much trouble on our setup, as we are linking against libturbojpeg and not libjpeg directly. If we were compilling against libjpeg, then we would be in trouble. Right now the only conflict that we have is the one on conda-forge.

A provisional solution would be compilling libturbojpeg (Without libjpeg) ourselves and publish it into the conda pytorch channel until conda-forge/conda-forge.github.io#673 is fixed. What do you think about this?

packaging/torchvision/meta.yaml

fmassa · 2020-06-24T14:12:15Z

packaging/torchvision/conda_build_config.yaml

@@ -1,3 +1,6 @@
+channel_sources:
+  - defaults,conda-forge


is it now safe to use conda-forge as well? At some point we had issues with it.

It seems only libturbojpeg is being pulled from there, so all other packages are being pulled from defaults or the main PyTorch channel

.travis.yml

fmassa

Thanks a lot @andfoy !

peterjc123 · 2020-06-30T18:15:11Z

Caused failure on master:

_________________________ ImageTester.test_decode_jpeg _________________________
 
RuntimeError: No such operator image::decode_jpeg
 

 
During handling of the above exception, another exception occurred:
 

 
self = <test_image.ImageTester testMethod=test_decode_jpeg>
 

 
    def test_decode_jpeg(self):
 
        for img_path in get_images(IMAGE_ROOT, "jpg"):
 
            img_pil = torch.from_numpy(np.array(Image.open(img_path)))
 
            size = os.path.getsize(img_path)
 
            img_ljpeg = decode_jpeg(torch.from_file(img_path, dtype=torch.uint8, size=size))
 
    
 
            norm = img_ljpeg.shape[0] * img_ljpeg.shape[1] * img_ljpeg.shape[2] * 255
 
            err = torch.abs(img_ljpeg.flatten().float() - img_pil.flatten().float()).sum().float() / (norm)
 
    
 
            self.assertLessEqual(err, 1e-2)
 
    
 
        with self.assertRaisesRegex(ValueError, "Expected a non empty 1-dimensional tensor."):
 
            decode_jpeg(torch.empty((100, 1), dtype=torch.uint8))
 
    
 
        with self.assertRaisesRegex(ValueError, "Expected a torch.uint8 tensor."):
 
            decode_jpeg(torch.empty((100, ), dtype=torch.float16))
 
    
 
        with self.assertRaisesRegex(RuntimeError, "Error while reading jpeg headers"):
 
>           decode_jpeg(torch.empty((100), dtype=torch.uint8))
 
E           AssertionError: "Error while reading jpeg headers" does not match "No such operator image::decode_jpeg"

ezyang · 2020-06-30T19:28:56Z

This PR has broken the doc push job on PyTorch main repo:


Jun 30 18:53:59                  from /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:12:
Jun 30 18:53:59 /usr/include/pngconf.h:383:12: error: '__pngconf' does not name a type
Jun 30 18:53:59             __pngconf.h__ in libpng already includes setjmp.h;
Jun 30 18:53:59             ^
Jun 30 18:53:59 /usr/include/pngconf.h:384:12: error: '__dont__' does not name a type
Jun 30 18:53:59             __dont__ include it again.;
Jun 30 18:53:59             ^
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp: In function 'at::Tensor decodePNG(const at::Tensor&)':
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:35:5: error: 'png_const_bytep' does not name a type
Jun 30 18:53:59      png_const_bytep ptr;
Jun 30 18:53:59      ^
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:37:10: error: 'struct decodePNG(const at::Tensor&)::Reader' has no member named 'ptr'
Jun 30 18:53:59    reader.ptr = png_const_bytep(datap) + 8;
Jun 30 18:53:59           ^
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:37:37: error: 'png_const_bytep' was not declared in this scope
Jun 30 18:53:59    reader.ptr = png_const_bytep(datap) + 8;
Jun 30 18:53:59                                      ^
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp: In lambda function:
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:42:27: error: 'struct decodePNG(const at::Tensor&)::Reader' has no member named 'ptr'
Jun 30 18:53:59          std::copy(reader->ptr, reader->ptr + bytes, output);
Jun 30 18:53:59                            ^
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:42:40: error: 'struct decodePNG(const at::Tensor&)::Reader' has no member named 'ptr'
Jun 30 18:53:59          std::copy(reader->ptr, reader->ptr + bytes, output);
Jun 30 18:53:59                                         ^
Jun 30 18:53:59 /var/lib/jenkins/workspace/vision/torchvision/csrc/cpu/image/readpng_cpu.cpp:43:17: error: 'struct decodePNG(const at::Tensor&)::Reader' has no member named 'ptr'
Jun 30 18:53:59          reader->ptr += bytes;
Jun 30 18:53:59                  ^
Jun 30 18:53:59 error: command 'gcc' failed with exit status 1

I'm going to hotfix it by pinning doc push to an older version of PyTorch, but this will need to get fixed eventually.

fmassa · 2020-06-30T21:38:57Z

@ezyang can you point to the location of the location of where the doc push job is defined? This issue arises because the image might have an older version of libpng. We should guard against this case (I thought we did, but looks like we missed something)

ezyang · 2020-06-30T21:40:36Z

pytorch/pytorch@9ac0feb

fmassa · 2020-06-30T21:42:25Z

@andfoy the errors pointed out by @peterjc123 are due to the latest CI changes that were merged just before yours, in #2328

#2301)" This reverts commit 766721b.

andfoy · 2020-06-30T23:01:28Z

@fmassa @ezyang , I'll revert this PR, rebase again and open it again with the doc fixes as well

#2301)" (#2375) This reverts commit 766721b.

andfoy · 2020-07-01T16:54:50Z

This PR has broken the doc push job on PyTorch main repo:

@ezyang, is it possible to get access to the full log of that build?

fmassa · 2020-07-01T17:06:40Z

@andfoy here it is https://app.circleci.com/pipelines/github/pytorch/pytorch/186575/workflows/8ccd8fe9-4d3b-4b67-a06a-a1eeb12c18f7/jobs/6064458/steps

…ch#2301) * Add libpng requirement into conda recipe * Try to install libjpeg-turbo * Add PNG reading capabilities * Remove newline * Add image extension to compilation instructions * Include png functions as part of the main library * Update CMakeLists * Detect if building on conda-build * Debug * More debug messages * Print globbed libreries * Print globbed libreries * Point to correct PNG path * Remove libJPEG preventively * Debug extension loading * Link libpng explicitly * Link with PNG * Add PNG reading capabilities * Add libpng requirement into conda recipe * Try to install libjpeg-turbo * Remove newline * Add image extension to compilation instructions * Include png functions as part of the main library * Update CMakeLists * Detect if building on conda-build * Debug * More debug messages * Print globbed libreries * Print globbed libreries * Point to correct PNG path * Remove libJPEG preventively * Debug extension loading * Link libpng explicitly * Link with PNG * Install libpng on conda-based wheel distributions * Add -y flag * Add -y flag to yum * Locate LibPNG on windows conda * Remove empty else * Copy libpng16.so * Copy dylib on Mac * Improve check on Windows * Try to install ninja using conda on windows * Use libpng on Windows * Package lib on windows wheel * Point library to the correct place * Include binaries as part of wheel * Copy libpng.so on linux * Look for png.h on Windows when using conda-build * Do not skip png tests on Mac/Win * Restore libjpeg-turbo * Install jpeg-turbo on wheel distributions * Install libjpeg-turbo from conda-forge on wheel distributions * Do not pull av on conda-build * Add pillow disclaimer * Vendors libjpeg-turbo 2.0.4 * Merge JPEG work * Remove submodules * Regenerate circle config * Fix style issues * Fix C++ style issues * More style corrections * Add JPEG-turbo to linking libraries * More style corrections * More style corrections * More style corrections * Install libjpeg-turbo-devel * Install libturbo-jpeg on typing pipeline * Update Circle template * Windows and Unix turbojpeg have the same linking name * Install turbojpeg-devel instead of libjpeg-turbo * Copy TurboJPEG binaries to wheel * Move test image * Move back test image * Update JPEG test path * Remove dot from extension * Move image functions to extension * Use stdout arg in subprocess * Disable image extension if libpng or turbojpeg are not found * Append libpng stdout * Prevent list appending on lists * Minor path correction * Minor error correction * Add linking flags * Style issues correction * Address minor review corrections * Refactor library search * Restore access index * Fix JPEG tests * Update libpng version in Travis * Add -y flag * Remove dot * Update libpng using apt * Check libpng version * Change libturbojpeg binary * Update import * Change call * Restore av in conda recipe * Minor error correction * Remove unused comment in travis.yml * Update README * Fix missing links * Remove fixes for 16.04 Co-authored-by: Ryad ZENINE <[email protected]>

pytorch#2301)" (pytorch#2375) This reverts commit 766721b.

Add libpng requirement into conda recipe

2b78943

andfoy changed the title ~~PR: Add libpng requirement into conda recipe~~ PR: Add libpng and libjpeg-turbo requirement into conda recipe Jun 8, 2020

Try to install libjpeg-turbo

23255aa

r-zenine and others added 4 commits June 11, 2020 11:46

Add PNG reading capabilities

006ab0c

Remove newline

7b9ec24

Add image extension to compilation instructions

f97a9f0

Include png functions as part of the main library

ac6d26e

andfoy force-pushed the add_libpng branch from 10559e8 to ac6d26e Compare June 11, 2020 19:52

andfoy and others added 21 commits June 11, 2020 17:28

Update CMakeLists

b14912e

Detect if building on conda-build

770cea5

Debug

0861b80

More debug messages

a42a029

Print globbed libreries

b7a19ea

Print globbed libreries

1afde4d

Point to correct PNG path

386fd5b

Remove libJPEG preventively

2b5c469

Debug extension loading

0341aa5

Link libpng explicitly

721e5e3

Link with PNG

2186d68

Add libpng requirement into conda recipe

b80fb08

Try to install libjpeg-turbo

3d153f0

Add PNG reading capabilities

36b0a8f

Remove newline

9d14d9e

Add image extension to compilation instructions

852a289

Include png functions as part of the main library

3e86f49

Update CMakeLists

021e767

Detect if building on conda-build

e734175

Debug

58c6524

More debug messages

b9295c1

andfoy added 2 commits June 23, 2020 12:46

Update import

051425b

Change call

3bc7323

fmassa reviewed Jun 24, 2020

View reviewed changes

setup.py Show resolved Hide resolved

fmassa reviewed Jun 24, 2020

View reviewed changes

Restore av in conda recipe

6b87895

seemethere approved these changes Jun 29, 2020

View reviewed changes

andfoy added 6 commits June 29, 2020 15:52

Merge with master

af61f94

Minor error correction

123fd3f

Remove unused comment in travis.yml

831749c

Update README

558b0cb

Fix missing links

8b2f507

Remove fixes for 16.04

b560227

fmassa approved these changes Jun 30, 2020

View reviewed changes

fmassa merged commit 766721b into pytorch:master Jun 30, 2020

andfoy deleted the add_libpng branch June 30, 2020 17:23

andfoy added a commit that referenced this pull request Jun 30, 2020

Revert "PR: Add libpng and libjpeg-turbo requirement into conda recipe (

139cb16

#2301)" This reverts commit 766721b.

andfoy mentioned this pull request Jun 30, 2020

Revert "PR: Add libpng and libjpeg-turbo requirement into conda recipe" #2375

Merged

fmassa pushed a commit that referenced this pull request Jul 1, 2020

Revert "PR: Add libpng and libjpeg-turbo requirement into conda recipe (

fa6af6d

#2301)" (#2375) This reverts commit 766721b.

andfoy mentioned this pull request Jul 1, 2020

PR: Enable libPNG support #2379

Merged

fmassa mentioned this pull request Jul 10, 2020

PR: Improve calls to libpng-config on Ubuntu/Debian #2398

Merged

de-vri-es pushed a commit to fizyr-forks/torchvision that referenced this pull request Aug 4, 2020

Revert "PR: Add libpng and libjpeg-turbo requirement into conda recipe (

d1fb19b

pytorch#2301)" (pytorch#2375) This reverts commit 766721b.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR: Add libpng and libjpeg-turbo requirement into conda recipe #2301

PR: Add libpng and libjpeg-turbo requirement into conda recipe #2301

andfoy commented Jun 8, 2020

fmassa commented Jun 9, 2020

fmassa left a comment

fmassa Jun 24, 2020

andfoy Jun 24, 2020 •

edited

Loading

fmassa Jun 24, 2020

andfoy Jun 24, 2020

fmassa Jun 24, 2020

andfoy Jun 24, 2020

fmassa Jun 24, 2020

andfoy Jun 24, 2020

fmassa left a comment

peterjc123 commented Jun 30, 2020 •

edited

Loading

ezyang commented Jun 30, 2020

fmassa commented Jun 30, 2020

ezyang commented Jun 30, 2020

fmassa commented Jun 30, 2020

andfoy commented Jun 30, 2020

andfoy commented Jul 1, 2020

fmassa commented Jul 1, 2020

		# Pillow introduces unwanted conflicts with libjpeg-turbo, as it depends on jpeg
		# The fix depends on https://github.com/conda-forge/conda-forge.github.io/issues/673

PR: Add libpng and libjpeg-turbo requirement into conda recipe #2301

PR: Add libpng and libjpeg-turbo requirement into conda recipe #2301

Conversation

andfoy commented Jun 8, 2020

fmassa commented Jun 9, 2020

fmassa left a comment

Choose a reason for hiding this comment

fmassa Jun 24, 2020

Choose a reason for hiding this comment

andfoy Jun 24, 2020 • edited Loading

Choose a reason for hiding this comment

fmassa Jun 24, 2020

Choose a reason for hiding this comment

andfoy Jun 24, 2020

Choose a reason for hiding this comment

fmassa Jun 24, 2020

Choose a reason for hiding this comment

andfoy Jun 24, 2020

Choose a reason for hiding this comment

fmassa Jun 24, 2020

Choose a reason for hiding this comment

andfoy Jun 24, 2020

Choose a reason for hiding this comment

fmassa left a comment

Choose a reason for hiding this comment

peterjc123 commented Jun 30, 2020 • edited Loading

ezyang commented Jun 30, 2020

fmassa commented Jun 30, 2020

ezyang commented Jun 30, 2020

fmassa commented Jun 30, 2020

andfoy commented Jun 30, 2020

andfoy commented Jul 1, 2020

fmassa commented Jul 1, 2020

andfoy Jun 24, 2020 •

edited

Loading

peterjc123 commented Jun 30, 2020 •

edited

Loading