You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to scrape this wiki but it keeps failing with exit code 2. It doesnt give much in the logs for me to understand the problem. I've used a similar command to scrape other wikis in the past and I have had success but this one just fails. Does this mean that the wiki isn't configured correctly and cant be converted to a zim.
Logs:
❯ sudo docker run -it --rm -v ~/Documents/zims:/output ghcr.io/openzim/mwoffliner:dev mwoffliner --mwUrl=https://lit.trainroll.xyz [email protected] --outputDirectory=/output --verbose=log --mwActionApiPath=/api.php --mwRestApiPath=/rest.php --mwModulePath=/load.php
starting redis-server in the background…
[log] [2024-12-08T03:42:55.695Z] Starting mwoffliner v1.14.0-dev...
[log] [2024-12-08T03:42:55.699Z] Getting text direction...
[log] [2024-12-08T03:42:55.699Z] Getting site info...
[log] [2024-12-08T03:42:55.699Z] Getting sub-title...
[log] [2024-12-08T03:42:55.704Z] closing sanitize redis DB
[log] [2024-12-08T03:42:57.632Z] Text direction is [ltr]
[log] [2024-12-08T03:43:02.108Z] Using output directory /output
[log] [2024-12-08T03:43:02.109Z] Using temporary directory /tmp/mwoffliner-1733629382108
[log] [2024-12-08T03:43:02.410Z] Worker [0] getting article range [0-1] of [1] [100%]
[warn] [2024-12-08T03:43:02.718Z] Got warning from MW Query {
"main": {
"warnings": "Unrecognized parameter: colimit."
},
"query": {
"warnings": "Unrecognized values for parameter \"prop\": pageimages, coordinates"
}
}
[warn] [2024-12-08T03:43:03.378Z] Got warning from MW Query {
"main": {
"warnings": "Unrecognized parameter: colimit."
},
"query": {
"warnings": "Unrecognized value for parameter \"prop\": coordinates"
}
}
[log] [2024-12-08T03:43:03.380Z] A total of [81] articles has been found in namespace [0]
[log] [2024-12-08T03:43:03.380Z] Got ArticleIDs in 0.971 seconds
[log] [2024-12-08T03:43:03.380Z] Total articles found in Redis: 82
[log] [2024-12-08T03:43:03.380Z] Doing dump
[log] [2024-12-08T03:43:03.381Z] Writing zim to [/output/trainroll_en_all_2024-12.zim]
[log] [2024-12-08T03:43:05.039Z] Found [2] stylesheets to download
[log] [2024-12-08T03:43:05.039Z] Downloading stylesheets and populating media queue
[warn] [2024-12-08T03:43:05.922Z] Failed to get CSS from [https://lit.trainroll.xyz/wiki/?title=Mediawiki%253Aoffline.css&action=raw]
[warn] [2024-12-08T03:43:05.945Z] Failed to get CSS from [https://lit.trainroll.xyz/load.php?lang=en&modules=ext.visualEditor.desktopArticleTarget.noscript%7Cskins.monobook.styles&only=styles&skin=monobook]
[log] [2024-12-08T03:43:05.945Z] Downloaded stylesheets
[log] [2024-12-08T03:43:05.945Z] Saving favicon.png...
[log] [2024-12-08T03:43:05.945Z] Getting Main Page
[log] [2024-12-08T03:43:05.945Z] Create main page redirection from [index] to [A/Main_Page]
[log] [2024-12-08T03:43:05.952Z] Getting articles
[log] [2024-12-08T03:43:06.656Z] Progress downloading articles [0/82] [0.0%]
[error] [2024-12-08T03:43:07.507Z] Error downloading article /lit/_guide_to_learning_ancient_Greek
[error] [2024-12-08T03:43:07.507Z] Failed to run mwoffliner after [14s]: {}
[log] [2024-12-08T03:43:07.507Z] Exiting with code [2]
[log] [2024-12-08T03:43:07.507Z] Deleting temporary directory [/tmp/mwoffliner-1733629382108]
The text was updated successfully, but these errors were encountered:
@Ay1tsMe Thank you for your bug report. I should try to reproduce the issue ourself. Pretty sure there is sone kind of problem around the / character in the title. Will try do to so this week end.
I'm trying to scrape this wiki but it keeps failing with exit code 2. It doesnt give much in the logs for me to understand the problem. I've used a similar command to scrape other wikis in the past and I have had success but this one just fails. Does this mean that the wiki isn't configured correctly and cant be converted to a zim.
Logs:
The text was updated successfully, but these errors were encountered: