This crate contains queries gathering contents of Python 2 files. It finds commits older than December 2008 (date of introduction of Python 3) and retrieves the contents of changed Python files (identified by extension). There are also auxiliary queries gathering general information about Python projects in the queried dataset.
Djanco is a query system for querying GitHub datasets downloaded by as part of the CodeDJ project.
To generate a harness for executing the queries in this crate install :
cargo install --git https://github.com/PRL-PRG/cargo-djanco
Then, generate the harness:
cargo djanco
This generates s source file src/bin/djanco.rs
, which you can run to execute all your queries:
cargo run --bin djanco --release -- --dataset-path DATASET_LIVES_HERE --output-path WRITE_RESULTS_HERE
The template file for the python-2 crate comes from .
To create a new crate from the template, first install :
cargo install cargo-generate
Then, use the generate
command in cargo to create a new crate from the
template:
cargo generate --git https://github.com/PRL-PRG/djanco-query-template --name my-query-crate
Then you can add your query functions. There's an example function in
src/lib.rs
to get you started.