-
Notifications
You must be signed in to change notification settings - Fork 0
/
data_viz_hidden_gems.html
100 lines (90 loc) · 10.3 KB
/
data_viz_hidden_gems.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8" />
<meta name="generator" content="Pelican" />
<title>My 6 Gems on Data Visualization</title>
<link rel="stylesheet" href="/theme/css/main.css" />
<meta name="description" content="I have been working quite some time with charts and business intelligence in the last 5 years. When you spend time building business reports, you..." />
</head>
<body id="index" class="home">
<header id="banner" class="body">
<h1><a href="/">Marco Santoni</a></h1>
<nav><ul>
<li><a href="/pages/about.html">about</a></li>
<li><a href="/pages/bookshelf.html">bookshelf</a></li>
<li class="active"><a href="/category/posts.html">posts</a></li>
</ul></nav>
</header><!-- /#banner -->
<section id="content" class="body">
<article>
<header>
<h1 class="entry-title">
<a href="/data_viz_hidden_gems.html" rel="bookmark"
title="Permalink to My 6 Gems on Data Visualization">My 6 Gems on Data Visualization</a></h1>
</header>
<div class="entry-content">
<footer class="post-info">
<abbr class="published" title="2022-02-26T19:35:00+01:00">
Published: Sat 26 February 2022
</abbr>
<address class="vcard author">
By <a class="url fn" href="/author/marco-santoni.html">Marco Santoni</a>
</address>
<p>In <a href="/category/posts.html">posts</a>.</p>
</footer><!-- /.post-info --> <p>I have been working quite some time with charts and business intelligence in the last 5 years. When you spend time building business reports, you may perceive data visualization as a cold technical and business tool. However, there are <strong>6 hidden gems</strong> in data visualization that I found by chance. I realized data visualization is not as cold as I thought. Let me recap for you these 6 gems.</p>
<h2>1) The first chart ever</h2>
<p>William Playfair was a Scottish engineer and political scientist from the 18th century. He is considered as the author of the very first chart:</p>
<p><img alt="By William Playfair - The Commercial and Political Atlas, 1786 (3th ed. edition 1801), Public Domain" src="/images/datavizhiddengems/playfair_first_chart_800.jpg"></p>
<p>The chart was published back in 1786. It shows the volumes of imports and exports of Scotland over one year on a scale of 10k pounds. Each country is given two bars: one for volume of imports, one for volume of exports.</p>
<p>I am so used to seeing bar charts that I never asked myself who was the inventor or when they first appeared. It's nice to find out that the have been invented way before the invention of calculators and that they have changed so little since then.</p>
<h2>2) The best graphic ever</h2>
<p>Charles Minard represented 6 types of data about Napoleon's 1812 Russia campaign in one single chart. This visual was considered by <a href="https://www.nationalgeographic.com/culture/article/charles-minard-cartography-infographics-history">Edward Tufte</a> as "<em>the best statistical graphic ever produced</em>".</p>
<p><img alt="By Charles Minard (1869): map of Napoleon's disastrous Russian campaign of 1812" src="/images/datavizhiddengems/minardnapoleon_800.png"></p>
<p>Minard represented in two dimensions <a href="https://ageofrevolution.org/200-object/flow-map-of-napoleons-invasion-of-russia/">six types</a> of data: the number of Napoleon's troops; distance; temperature; the latitude and longitude; direction of travel; and location relative to specific dates.</p>
<h2>3) Non-neutrality: the Legarithmic scale</h2>
<p>Is data visualization a neutral discipline? Not really. Basic decisions like the choice of scale or of the limit of axes might change radically the information perceived by the reader. Take a look at the following tweet by Matteo Salvini (leader of "Lega" party) about results of a poll on popularity of Italian politicians:</p>
<blockquote class="twitter-tweet"><p lang="it" dir="ltr">Nonostante menzogne, attacchi e processi, milioni di Italiani credono, sperano, confidano nella Lega. <br>Eh già, e siamo ancora qua…<br>Non si molla mai, GRAZIE! <a href="https://t.co/DFMecxPFzC">pic.twitter.com/DFMecxPFzC</a></p>— Matteo Salvini (@matteosalvinimi) <a href="https://twitter.com/matteosalvinimi/status/1436662148709629952?ref_src=twsrc%5Etfw">September 11, 2021</a></blockquote>
<p><script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script></p>
<p>Do you notice anything wrong with the chart? The y axis looks a bit tweaked. The difference between the axis does not follow any reasonable scale (perhapse a "Legarithmic" scale?) since the difference between the 3 bars is not consistent. Here is how the same data looks when plotted in Excel.</p>
<p><img alt="Unbiased chart of the same data shown in Matteo Salvini's tweet" src="/images/datavizhiddengems/realchartfromtweet_800.png"></p>
<p>However, the effect on the reader is not the same, isn't it?</p>
<h2>4) Beyond shapes: infographics</h2>
<p>Otto Neurath was one of the main contributor to the <em>picture language</em>, aka ISOTYPE (International System of Typographic Picture Education). This method consists of replacing classic shapes in data visualization (eg bars, circles, etc) with a set of standardized symbols. Quantities are represented by repeating the same symbol over and over proportionally to the measure. Consider the following example by Otto Neurath from 1930.</p>
<p><img alt="Otto Neurath, Residential density in big cities - 1930" src="/images/datavizhiddengems/isotypeexample_800.png"></p>
<p>The chart represents the density of population in different cities. The information is represented as the number of persons that would live in a flat of 200 m2. The count of persons is not represented by a digit or by a bar, but it is represented by the repetition of a symbol as many time as the count of persons for that city. The result is effective. Density is no more a number, and you can <em>feel</em> the size of the measure. Infographics can turn cold numbers into tangible perceptions of a phenomenon.</p>
<h2>5) Pie charts: bad by definition</h2>
<p>"Bad by definition" is the title of one of my <a href="https://www.data-to-viz.com/caveat/pie.html">favourite blog posts</a> about data visualization. This article is a clean explanation of why you should not use pie charts for most of the use cases. The article starts with this example.</p>
<p><img alt="Yan Holtz - The issue with pie chart" src="/images/datavizhiddengems/piechart_400.png"></p>
<p>Can you rank the slices of the pie by size? You'd probably struggle a bit trying to answer. The reason is that our brain is not used to measure and compare angles. It's funny to see pie charts being used every now and then in business reports. Most of the times, a basic bar chart would be way more effective to let the user understand the numbers behind. However, it seems that pie charts are now endemic in corporations, and the way is still long before getting rid of it 😁</p>
<h2>6) What is data visualization?</h2>
<p>Is data visualization a branch of computer science? It turns out that data visualization is broader discipline, and it is part of <a href="https://visme.co/blog/information-design/">information design</a>. Information design is the practice of presenting information in a way that fosters an efficient and effective understanding of the information.</p>
<p><img alt="Plain text representation of data" src="/images/datavizhiddengems/irpef_table_800.png"></p>
<p>Can the same data of a bar chart be represented in plain text? Yes.</p>
<p>Would plain text require us the same effort to understand the information behind the numbers? Probably not.</p>
<p>Would we even be able to get such information from plain text? Probably not because visualizing information helps our brain to perceive what's going on.</p>
<p><img alt="Same representation of data via line chart" src="/images/datavizhiddengems/irpef_chart.png"></p>
<p>I recently wrote about <a href="https://medium.com/@marcosantoni_39266/riforma-irpef-i-grafici-che-avrei-voluto-vedere-7a69f7577bc3">an article</a> on the impact of information design on journalism. The article starts from a recent tax reform in Italy. Most information media have kept showing tables about the new tax rates, however I found quite hard to get a clear and full picture of the reform. I was not able to find online a single data visualization about the data behind the reform. So, I have done it by myself, and it turned out the article was quite appreciated (with more than 2.3k reads at the time of this writing and plenty of positive feedbacks on social networks).</p>
<p>The reason why the article was so viral is that one single line chart was able to describe the reform way more effectively than the textual tables you could find online. I find this a decent example of "<em>efficient and effective understanding of information</em>" that is the overall goal of information design.</p>
<h2>References</h2>
<p>This article is a collection of notes I took in the last couple of years. Historical charts are inspired by talks by <a href="https://twitter.com/pciuccarelli">Paolo Ciuccarelli</a>. The ideas behind the critics to pie charts is inspired by the article of <a href="https://www.data-to-viz.com/caveat/pie.html">Yan Holtz</a>. Plenty of details are of course from Wikipedia.</p>
</div><!-- /.entry-content -->
</article>
</section>
<section id="extras" class="body">
<div class="social">
<h2>social</h2>
<ul>
<li><a href="https://linkedin.com/in/msantoni">linkedin</a></li>
<li><a href="https://twitter.com/mrsantoni">twitter</a></li>
</ul>
</div><!-- /.social -->
</section><!-- /#extras -->
<footer id="contentinfo" class="body">
<address id="about" class="vcard body">
Proudly powered by <a href="https://getpelican.com/">Pelican</a>, which takes great advantage of <a href="https://www.python.org/">Python</a>.
</address><!-- /#about -->
<p>The theme is by <a href="https://www.smashingmagazine.com/2009/08/designing-a-html-5-layout-from-scratch/">Smashing Magazine</a>, thanks!</p>
</footer><!-- /#contentinfo -->
</body>
</html>