Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Info: historical graph of the number of domains in this list #53

Open
StevenBlack opened this issue Jul 30, 2022 · 3 comments
Open

Info: historical graph of the number of domains in this list #53

StevenBlack opened this issue Jul 30, 2022 · 3 comments

Comments

@StevenBlack
Copy link

Andrey @ameshkov this issue is for information only. I think you'll find this interesting.

I'm developing a variety of helper tools, still to early for prime-time, but this is a preliminary finding from analyzing my hosts repo history.

The data for your list is here has the granularity of my commits, whose frequency has varied over the years.

This bash file is what I used on my repo to get data. This is easily adaptable for your use; feel free. I use Mathematica for the plot, but many other things can produce graphs like this from CSV data.

KADhosts

Issue 2014 in my repo (scroll-scroll-scroll) contains recent graphs of all the sources for my amalgamated lists...

@StevenBlack
Copy link
Author

Andrey @ameshkov small clarification: I paused refreshing data from AdguardTeam/cname-trackers a few days ago which is why this graph appears recently flat at 39,985 domains.

I paused refreshing AdguardTeam/cname-trackers because a recent list was over 60k domains, which raises my concern for MY overall list size. I'm sensitive to that, because I want my amalgamated lists to be usable on all devices, including tiny ones, and including Microsoft Windows, which starts choking long before *nix OSes do.

This graph shows that AdguardTeam/cname-trackers list size has started to vary significantly from release to release.

All that said, fuck these cname trackers, amirite? 🙄

@ameshkov
Copy link
Member

Hi Steven,

It seems that most of these come from a single source: https://github.com/AdguardTeam/cname-trackers/blob/master/trackers/ingenious-technologies.txt

And this source is basically a bunch of subdomains of zooplus.*.

Not sure what to do with this information though:) In theory you can amend your script and ignore zooplus.* domains.

@TPS
Copy link

TPS commented Aug 1, 2022

Whenever #51 is implemented, that might help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants