- Go to Datasets
- For real datasets, run
bash get_data.sh
- For synthetic datasets, run
# For generating XOR-10 dataset
python generate_data.py --data_type 0entropy --markovity 10 --file_name files_to_be_compressed/xor10.txt
# For generating HMM-10 dataset
python generate_data.py --data_type HMM --markovity 10 --file_name files_to_be_compressed/hmm10.txt
- This will generate a folder named
files_to_be_compressed
. This folder contains the parsed files which can be used to recreate the results in our paper.