Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to download wiki. Exiting with code [2] #2112

Open
Ay1tsMe opened this issue Dec 8, 2024 · 5 comments
Open

Unable to download wiki. Exiting with code [2] #2112

Ay1tsMe opened this issue Dec 8, 2024 · 5 comments
Assignees

Comments

@Ay1tsMe
Copy link

Ay1tsMe commented Dec 8, 2024

I'm trying to scrape this wiki but it keeps failing with exit code 2. It doesnt give much in the logs for me to understand the problem. I've used a similar command to scrape other wikis in the past and I have had success but this one just fails. Does this mean that the wiki isn't configured correctly and cant be converted to a zim.

Logs:

❯ sudo docker run -it --rm -v ~/Documents/zims:/output ghcr.io/openzim/mwoffliner:dev mwoffliner --mwUrl=https://lit.trainroll.xyz [email protected] --outputDirectory=/output --verbose=log --mwActionApiPath=/api.php --mwRestApiPath=/rest.php --mwModulePath=/load.php
starting redis-server in the background…
[log] [2024-12-08T03:42:55.695Z] Starting mwoffliner v1.14.0-dev...
[log] [2024-12-08T03:42:55.699Z] Getting text direction...
[log] [2024-12-08T03:42:55.699Z] Getting site info...
[log] [2024-12-08T03:42:55.699Z] Getting sub-title...
[log] [2024-12-08T03:42:55.704Z] closing sanitize redis DB
[log] [2024-12-08T03:42:57.632Z] Text direction is [ltr]
[log] [2024-12-08T03:43:02.108Z] Using output directory /output
[log] [2024-12-08T03:43:02.109Z] Using temporary directory /tmp/mwoffliner-1733629382108
[log] [2024-12-08T03:43:02.410Z] Worker [0] getting article range [0-1] of [1] [100%]
[warn] [2024-12-08T03:43:02.718Z] Got warning from MW Query {
	"main": {
		"warnings": "Unrecognized parameter: colimit."
	},
	"query": {
		"warnings": "Unrecognized values for parameter \"prop\": pageimages, coordinates"
	}
}
[warn] [2024-12-08T03:43:03.378Z] Got warning from MW Query {
	"main": {
		"warnings": "Unrecognized parameter: colimit."
	},
	"query": {
		"warnings": "Unrecognized value for parameter \"prop\": coordinates"
	}
}
[log] [2024-12-08T03:43:03.380Z] A total of [81] articles has been found in namespace [0]
[log] [2024-12-08T03:43:03.380Z] Got ArticleIDs in 0.971 seconds
[log] [2024-12-08T03:43:03.380Z] Total articles found in Redis: 82
[log] [2024-12-08T03:43:03.380Z] Doing dump
[log] [2024-12-08T03:43:03.381Z] Writing zim to [/output/trainroll_en_all_2024-12.zim]
[log] [2024-12-08T03:43:05.039Z] Found [2] stylesheets to download
[log] [2024-12-08T03:43:05.039Z] Downloading stylesheets and populating media queue
[warn] [2024-12-08T03:43:05.922Z] Failed to get CSS from [https://lit.trainroll.xyz/wiki/?title=Mediawiki%253Aoffline.css&action=raw]
[warn] [2024-12-08T03:43:05.945Z] Failed to get CSS from [https://lit.trainroll.xyz/load.php?lang=en&modules=ext.visualEditor.desktopArticleTarget.noscript%7Cskins.monobook.styles&only=styles&skin=monobook]
[log] [2024-12-08T03:43:05.945Z] Downloaded stylesheets
[log] [2024-12-08T03:43:05.945Z] Saving favicon.png...
[log] [2024-12-08T03:43:05.945Z] Getting Main Page
[log] [2024-12-08T03:43:05.945Z] Create main page redirection from [index] to [A/Main_Page]
[log] [2024-12-08T03:43:05.952Z] Getting articles
[log] [2024-12-08T03:43:06.656Z] Progress downloading articles [0/82] [0.0%]
[error] [2024-12-08T03:43:07.507Z] Error downloading article /lit/_guide_to_learning_ancient_Greek
[error] [2024-12-08T03:43:07.507Z] Failed to run mwoffliner after [14s]: {}
[log] [2024-12-08T03:43:07.507Z] Exiting with code [2]
[log] [2024-12-08T03:43:07.507Z] Deleting temporary directory [/tmp/mwoffliner-1733629382108]
@Ay1tsMe
Copy link
Author

Ay1tsMe commented Dec 8, 2024

I'm running version 1.14.0 in a docker container

❯ sudo docker run -it --rm -v ~/Documents/zims:/output ghcr.io/openzim/mwoffliner:dev mwoffliner --version     
[sudo] password for adamw: 
starting redis-server in the background…
1.14.0-dev

@kelson42
Copy link
Collaborator

kelson42 commented Dec 10, 2024

@Ay1tsMe Thank you for your bug report. I should try to reproduce the issue ourself. Pretty sure there is sone kind of problem around the / character in the title. Will try do to so this week end.

@kelson42
Copy link
Collaborator

kelson42 commented Dec 12, 2024

Might be the same as #2105

@audiodude
Copy link
Member

@kelson42 I don't think that's the issue you wanted to link?

@kelson42
Copy link
Collaborator

Indeed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants