You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are 5 cases in which the gold patch fails on Verified:
astropy__astropy-7606
astropy__astropy-8707
astropy__astropy-8872
matplotlib__matplotlib-20488
django__django-10097
The reason for why astropy__astropy-7606 fails seems to be that Verified was not update (see #223). It works when using princeton-nlp/SWE-bench instead.
I was looking into it. The matplotlib problem could be something on my end:
Test failing: lib/matplotlib/tests/test_image.py::test_https_imread_smoketest
Reason: urllib.error.HTTPError: HTTP Error 403: Forbidden
This is a simple test trying to read https://matplotlib.org/1.5.0/_static/logo2.png Things work fine on my machine when trying to set up the test manually. Could be that the urllib request gets blocked. This has happened previously already with django (this was fixable though since it was part of swebench code, not the benchmark itself)
Describe the bug
There are 5 cases in which the gold patch fails on Verified:
The reason for why
astropy__astropy-7606
fails seems to be that Verified was not update (see #223). It works when usingprinceton-nlp/SWE-bench
instead.Steps/Code to Reproduce
Expected Results
All 5 problems are resolved
Actual Results
None of them is resolved
System Information
latest version on main
The text was updated successfully, but these errors were encountered: