Errors in swebench.harness.run_evaluation in Windows #260

3N4N · 2024-11-26T16:20:53Z

Describe the bug

When I want to run the evaluation script on predictions already generated by a model, and I run the evaluation script swebench.harness.run_evaluation in a Windows system, it causes several issues. This is because SWE-bench is not completely OS independent.

Steps/Code to Reproduce

 python -m swebench.harness.run_evaluation --predictions_path output/predictions-from-model.jsonl --max_workers 1 --run_id validate-predictions

Expected Results

The evaluation runs without error.

Actual Results

Several errors occur.

Traceback (most recent call last):
  File "<frozen runpy>", line 189, in _run_module_as_main
  File "<frozen runpy>", line 112, in _get_module_details
  File "C:\Users\enan\projects\SWE-bench\swebench\__init__.py", line 46, in <module>
    from swebench.harness.run_evaluation import (
  File "C:\Users\enan\projects\SWE-bench\swebench\harness\run_evaluation.py", line 5, in <module>
    import resource
ModuleNotFoundError: No module named 'resource'

This import error can be easily fixed just by commenting it out. But there are other OS-dependent issues that pops up.

System Information

WIndows 11, Python 3.12, swebench git+7501f0993193

The text was updated successfully, but these errors were encountered:

3N4N added the bug Something isn't working label Nov 26, 2024

3N4N mentioned this issue Nov 26, 2024

Fix: issues of 'harness' in Windows #261

Merged

john-b-yang closed this as completed in #261 Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Errors in swebench.harness.run_evaluation in Windows #260

Errors in swebench.harness.run_evaluation in Windows #260

3N4N commented Nov 26, 2024

Errors in swebench.harness.run_evaluation in Windows #260

Errors in swebench.harness.run_evaluation in Windows #260

Comments

3N4N commented Nov 26, 2024

Describe the bug

Steps/Code to Reproduce

Expected Results

Actual Results

System Information