Collatz Conjecture vs. Compression

Introduction

Based on the provided tests, a Collatz Conjecture sequence can be just as challenging to compress as random data.

This can be used for 'worst-case' scenario metrics of compression efficiency, with significantly less computing power and time cost.

Analysis

Many times, I’ve found myself needing to measure the effectiveness of a compression algorithm, especially in database design.

Using random values usually provides the best "worst-case" scenario and a good estimation of how compression will help save space.

However, the drawback of random values is that they are slow to generate.

Generating billions of random values is a very slow process when you are in need of data.

This is where the purpose of this Git repository comes in. It aims to demonstrate that using pseudo-random values produced from a Collatz Conjecture [1] sequence is a method that requires significantly less computational power and time, as these values are easier to calculate. The end result is just as challenging for compression algorithms.

To understand how easy it is to generate a Collatz Conjecture sequence, one can refer to the Collatz Conjecture description, which is a simple mathematical sequence defined as follows:

Start with any positive integer ( n ).
If ( n ) is even, divide it by 2.
If ( n ) is odd, multiply it by 3 and then add 1.
Repeat the process with the resulting number until the number becomes 1.

As we know, Collatz sequences eventually always end with the number 1. However, this has not yet been proven by the best mathematicians in the world.

Until this is proven, compression algorithms struggle to find a better pattern than simply undoing the described process (/2 || *3). This can sometimes result in a compression rate that is even worse than with random numbers, which, in computing, are not truly random.

In the res.json file, you can find an example of the output of this program, or you can run it yourself by executing npm i and node index.js.

The last time I ran it, a simple arithmetic sequence (n=99;n-1) achieved a compression effectiveness of 66% with Zstandard (zstd) compression. Random numbers achieved 69%, and Collatz sequence achieved 69% as well.

Method	Data	Effect	Stable Effect
Zstd	Arithmetic (n=99;n-1)	66%	True
Zstd	Random (10-99, JS Math.Random())	66% - 69%	False
Zstd	Collatz	69%	True

Here, I would like to notice, that this idea came to mind recently.

If this theory is mistaken in any way, please feel free to reach out—I’d love to discuss it.
If this theory is right, I would love as well to discuss it.

Source of inspiration

The source of inspiration was Prof. Gregor Kiczales's "How to Code: Complex Data" course.

There, the Collatz Conjecture is referenced as a generative recursion that is guaranteed to stop, although it has not yet been proven that it will always do so. However, based on experience, we believe it will stop because no starting number has been found that results in a sequence that doesn’t eventually reach 1 and stop.

"How to Code: Complex Data" [2] is an amazing course. You can find it in the links or you can read the book "How to Design Programs (HtDP)" by Matthias Felleisen, Robert Bruce Findler, Matthew Flatt, and Shriram Krishnamurthi.

Collatz Conjecture in TS

function generateCollatzSequence(n: number): number[] {
    const sequence: number[] = [];
    while (n !== 1) {
        sequence.push(n);
        if (n % 2 === 0) {
            n /= 2;
        } else {
            n = 3 * n + 1;
        }
    }
    sequence.push(1); // Add the final 1
    return sequence;
}

Links

[1] "Wikipedia : Collatz Conjecture" https://en.wikipedia.org/wiki/Collatz_conjecture

[2] "EdX : How to Code : Complex Data" https://www.edx.org/learn/coding/university-of-british-columbia-how-to-code-complex-data?objectID=course-c2a2d51b-4841-46aa-b149-e7d1a64b6298

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README-top.png		README-top.png
README.md		README.md
index.js		index.js
index.ts		index.ts
package-lock.json		package-lock.json
package.json		package.json
res.json		res.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Collatz Conjecture vs. Compression

Introduction

Analysis

Source of inspiration

Collatz Conjecture in TS

Links

About

Releases

Packages

Languages

License

rept0id/collatz-conjecture-vs-compression

Folders and files

Latest commit

History

Repository files navigation

Collatz Conjecture vs. Compression

Introduction

Analysis

Source of inspiration

Collatz Conjecture in TS

Links

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages