“You don’t need to be an professional to deceive somebody, although you may want some experience to reliably acknowledge if you find yourself being deceived.”
When my co-instructor and I begin our quarterly lesson on misleading visualizations for the info visualization course we train on the College of Washington, he emphasizes the purpose above to our college students. With the appearance of contemporary know-how, growing fairly and convincing claims about knowledge is less complicated than ever. Anybody could make one thing that appears satisfactory, however comprises oversights that render it inaccurate and even dangerous. Moreover, there are additionally malicious actors who actively need to deceive you, and who’ve studied a few of the greatest methods to do it.
I typically begin this lecture with a little bit of a quip, wanting severely at my college students and asking two questions:
- “Is it a great factor if somebody is gaslighting you?”
- After the final murmur of confusion adopted by settlement that gaslighting is certainly unhealthy, I ask the second query: “What’s one of the simplest ways to make sure nobody ever gaslights you?”
The scholars typically ponder that second query for a bit longer, earlier than chuckling a bit and realizing the reply: It’s to learn the way individuals gaslight within the first place. Not so you’ll be able to reap the benefits of others, however so you’ll be able to stop others from making the most of you.
The identical applies within the realm of misinformation and disinformation. Individuals who need to mislead with knowledge are empowered with a number of instruments, from high-speed web to social media to, most just lately, generative AI and huge language fashions. To guard your self from being misled, you must be taught their methods.
On this article, I’ve taken the important thing concepts from my knowledge visualization course’s unit on deception–drawn from Alberto Cairo’s glorious e book How Charts Lie–and broadened them into some common ideas about deception and knowledge. My hope is that you simply learn it, internalize it, and take it with you to arm your self in opposition to the onslaught of lies perpetuated by ill-intentioned individuals powered with knowledge.
People Can not Interpret Space
No less than, not in addition to we interpret different visible cues. Let’s illustrate this with an instance. Say now we have an very simple numerical knowledge set; it’s one dimensional and consists of simply two values: 50 and 100. One strategy to characterize this visually is through the size of bars, as follows:

That is true to the underlying knowledge. Size is a one-dimensional amount, and now we have doubled it in an effort to point out a doubling of worth. However what occurs if we need to characterize the identical knowledge with circles? Effectively, circles aren’t actually outlined by a size or width. One choice is to double the radius:

Hmm. The primary circle has a radius of 100 pixels, and the second has a radius of fifty pixels–so that is technically appropriate if we wished to double the radius. Nonetheless, due to the way in which that space is calculated (πr²), we’ve far more than doubled the realm. So what if we tried simply doing that, because it appears extra visually correct? Here’s a revised model:

Now now we have a distinct drawback. The bigger circle is mathematically twice the realm of the smaller one, but it surely not seems that means. In different phrases, though it’s a visually correct comparability of a doubled amount, human eyes have issue perceiving it.
The difficulty right here is attempting to make use of space as a visible marker within the first place. It’s not essentially improper, however it’s complicated. We’re growing a one-dimensional worth, however space is a two-dimensional amount. To the human eye, it’s all the time going to be troublesome to interpret precisely, particularly when put next with a extra pure visible illustration like bars.
Now, this will seem to be it’s not an enormous deal–however let’s check out what occurs whenever you prolong this to an precise knowledge set. Under, I’ve pasted two photos of charts I made in Altair (a Python-based visualization bundle). Every chart exhibits the utmost temperature (in Celsius) throughout the first week of 2012 in Seattle, USA. The primary one makes use of bar lengths to make the comparability, and the second makes use of circle areas.


Which one makes it simpler to see the variations? The legend helps in the second, but when we’re being sincere, it’s a misplaced trigger. It’s a lot simpler to make exact comparisons with the bars, even in a setting the place now we have such restricted knowledge.
Do not forget that the purpose of a visualization is to make clear knowledge–to make hidden tendencies simpler to see for the typical individual. To attain this objective, it’s greatest to make use of visible cues that simplify the method of creating that distinction.
Beware Political Headlines (In Any Path)
There’s a small trick query I typically ask my college students on a homework task across the fourth week of sophistication. The task largely entails producing visualizations in Python–however for the final query, I give them a chart I actually generated accompanied by a single query:

Query: There may be one factor egregiously improper with the chart above, an unforgivable error in Information Visualization. What’s it?
Most assume it has one thing to do with the axes, marks, or another visible side, typically suggesting enhancements like filling within the circles or making the axis labels extra informative. These are high quality strategies, however not probably the most urgent.
Essentially the most flawed trait (or lack thereof, somewhat) within the chart above is the lacking title. A title is essential to an efficient knowledge visualization. With out it, how are we presupposed to know what this visualization is even about? As of now, we are able to solely confirm that it should vaguely have one thing to do with carbon dioxide ranges throughout a span of years. That isn’t a lot.
Many people, feeling this requirement is just too stringent, argue {that a} visualization is usually meant to be understood in context, as half of a bigger article or press launch or different accompanying piece of textual content. Sadly, this line of considering is much too idealistic; in actuality, a visualization should stand alone, as a result of it’s going to typically be the one factor individuals have a look at–and in social media blow-up circumstances, the one factor that will get shared extensively. Consequently, it ought to have a title to elucidate itself.
After all, the title of this very subsection tells you to be cautious of such headlines. That’s true. Whereas they’re mandatory, they’re a double-edged sword. Since visualization designers know viewers will take note of the title, ill-meaning ones also can use it to sway individuals in less-than-accurate instructions. Let’s have a look at an instance:
The above is a image shared by the White Home’s public Twitter account in 2017. The image can be referenced by Alberto Cairo in his e book, which emphasizes most of the factors I’ll now make.
First issues first. The phrase “chain migration,” referring to what’s formally often known as family-based migration (the place an immigrant might sponsor relations to come back to the US), has been criticized by many who argue that it’s needlessly aggressive and makes authorized immigrants sound threatening for no purpose.
After all, politics is by its very nature divisive, and it’s doable for any aspect to make a heated argument. The first difficulty right here is definitely a data-related one–particularly, what the usage of the phrase “chain” implies within the context of the chart shared with the tweet. “Chain” migration appears to point that individuals can immigrate one after the opposite, in a seemingly countless stream, uninhibited and unperturbed by the space of household relations. The fact, in fact, is that a single immigrant can largely simply sponsor speedy relations, and even that takes fairly a little bit of time. However when one reads the phrase “chain migration” after which instantly seems at a seemingly smart chart depicting it, it’s straightforward to consider that a person can in actual fact spawn further immigrants at a base-3 exponential development price.
That is the problem with any sort of political headline–it makes it far too straightforward to hide dishonest, inaccurate workings with precise knowledge processing, evaluation, and visualization.
There may be no knowledge underlying the chart above. None. Zero. It’s fully random, and that isn’t okay for a chart that’s purposefully made to look as whether it is exhibiting one thing significant and quantitative.
As a enjoyable little rabbit gap to go down which highlights the hazards of political headlining inside knowledge, here’s a hyperlink to FloorCharts, a Twitter account that posts probably the most absurd graphics proven on the U.S. Congress flooring.
Don’t Use 3D. Please.
I’ll finish this text on a barely lighter subject–however nonetheless an vital one. Not at all–none in any respect–do you have to ever make the most of a 3D chart. And in the event you’re within the footwear of the viewer–that’s, in the event you’re a 3D pie chart made by another person–don’t belief it.
The rationale for that is easy, and connects again to what I mentioned with circles and rectangles: a 3rd dimension severely distorts the reality behind what are often one-dimensional measures. Space was already exhausting to interpret–how properly do you actually assume the human eye does with quantity?
Here’s a 3D pie chart I generated with random numbers:

Now, right here is the very same pie chart, however in two dimensions:

Discover how the blue is just not fairly as dominant because the 3D model appears to counsel, and that the pink and orange are nearer to at least one one other in measurement than initially portrayed. I additionally eliminated the proportion labels deliberately (technically unhealthy follow) in an effort to emphasize how even with the labels current within the first one, our eyes mechanically pay extra consideration to the extra drastic visible variations. For those who’re studying this text with an analytical eye, maybe you assume it doesn’t make that a lot of a distinction. However the reality is, you’ll typically see such charts within the information or on social media, and a fast look is all they’ll ever get.
It is very important be certain that the story informed by that fast look is a truthful one.
Remaining Ideas
Information science is usually touted as the proper synthesis of Statistics, computing, and society, a strategy to get hold of and share deep and significant insights about an information-heavy world. That is true–however because the capability to extensively share such insights expands, so should our common capacity to interpret them precisely. It’s my hope that in gentle of that, you’ve discovered this primer to be useful.
Keep tuned for Half 2, by which I’ll discuss just a few misleading strategies a bit extra concerned in nature–together with base proportions, (un)reliable statistical measures, and measures of correlation.
Within the meantime, strive to not get deceived.