Skip to content

An extensible crawler for downloading Android applications in third-party markets.

Notifications You must be signed in to change notification settings

pbsf/android-apps-crawler

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Android Apps Crawler

Overview

Android Apps Crawler is an extensible crawler for downloading Android applications in the third-party markets. It can crawl the download url addresses of applications and automatically download applications into repository.

Requirements

  • Python 2.6 or up
  • Scrapy 0.22 or up: http://scrapy.org (didn't have a full test in lower version.)
  • Works on Linux, Windows, Mac OSX, BSD
  • Currently, downloader cannot work on Windows.
  • For Ubuntu users, "Don't use the python-scrapy package provided by Ubuntu, they are typically too old and slow to catch up with latest Scrapy. Instead, use the official Ubuntu Packages."

Usage

  • Set the third-party markets you want to crawl in settings.py
  • Set the proxy if you have
  • Start crawler:
./crawl.sh <market name>
  • Start downloader:
./downloader.py <database file path> <output directory>

Settings

You can set proxy, user-agen, database name, etc in crawler/android_apps_crawler/settings.py file.

Supported Third-party Markets (market names used in crawl.sh)

More Android Markets

See: https://github.com/mssun/android-markets-list

TODO

  • Windows support for downloader.
  • Crawl apps from shared cloud storage link (e.g, pan.baidu.com, dbank.com).

About

An extensible crawler for downloading Android applications in third-party markets.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 96.1%
  • Shell 3.9%