Hey you all!
Itβs Josep here, one more week! ππ»
Now that it's August, summer has officially begun! π₯ I hope those of you on vacation are having an amazing time, and for those still working, I hope your vacations are just around the corner!
I have some exciting news to share about the coming weeks...
On August 14th, I'll be traveling through North America for two weeks, followed by a month in Japan throughout September. During this time, I'll continue posting my DataBites issues as usual, but I might be a bit less active on social networks.
I promise to return full of energy, but right now, I'm incredibly excited about this travel opportunity!
The most important news of the week?
I've become a digital advocate for Pieces, a fantastic product (completely free) that's helping me optimize my coding workflow. Over the next few weeks, I'll be sharing more details about it.
And what else is on the horizon?
I recently had my first collaboration with
(and hopefully many more to come) to create a list of resources for getting started with ML. It was a success, and if you missed it, you can check it out below ππ»Now that weβve caught up on life updates, let's dive into the important stuff π¨π»βπ».
Building on my previous DataBites issue, today I want to explore the intuition behind Data Visualization and charts.
Why does this matter?
We've already recognized the importance of DataViz, but many people still don't understand why visual elements are more intuitive than raw numbers.
Many technical experts often overlook how profoundly our understanding of the world is shaped by the workings of our brains.
Today, let's reconnect with our human side and explore why some charts resonate more intuitively with us than others π₯
Breaking DataViz to Its Basics
There are two main theories behind the intuitiveness of charts - and a final technique that can be quite useful to know! π
1. Gestalt TheoryΒ
People who work with technology sometimes forget about the human side of things. The Gestalt Principles are rules from psychology that explain precisely this: How our brain sees patterns.Β
Some of these rules help us understand why we group things that look alike or notice things that stand out.Β
#1 Similarity
Gestalt similarity means our brain groups things that look alike. This can be because of their position, shape, color, or size.
This is extensively used in heat maps or scatter plots.Β
#2 Closure
Objects inside a border-like a line or a shared color-look as they belong together. This makes them stand out from other things we see.
We often use borders or colors in tables and graphs to group data.
#3 Continuity
When individual elements are connected, our eyes think they belong together. Even if they look different, the line makes us see them as a group. This is extensively used in line charts.
#4 Proximity
We think things are in the same group if they are close to each other. To show things belong together, put them close. Using a little space can help separate different groups. This is commonly used in scattering plots or node-link diagrams.Β
So the Gestalt principles and their interactions are important to consider when making visualizations.
2. The Principle of Proportional InkΒ
The second rule involves the Principle of Proportional Ink. In many different visualization scenarios, we represent data values by the extent of a graphical element.Β
It is common practice to use the word ink to refer to any part of a visualization that deviates from the background color. This includes lines, bars, points, shared areas, and text.
For example, in a bar plot, we draw bars that begin at 0 and end at the data value they represent. In this case, the data value is not only encoded in the endpoint of the bar but also in the height or length of the bar.Β
If we drew a bar that started at a different value than 0, then the length of the bar and the bar endpoint would convey contradicting information.
In all these cases, we need to make sure that there is no inconsistency. This concept has been termed the principle of proportional ink by Bergstrom and West.
βWhen a shaded region is used to represent a numerical value, the area of that shaded region should be directly proportional to the corresponding value.β
Violations of this principle are quite common when trying to manipulate data, in particular in the popular press and in the world of finance.
Similar issues will happen whenever we use graphical elements such as rectangles, shaded areas of arbitrary shape, or any other elements that have a defined visual extent that can be either consistent or inconsistent with the data value shown.
3. The Pop-Out effect
The pop-out effect stands out as a key technique for highlighting specific data points amidst a sea of information.
This visual phenomenon ensures that certain elements in a dataset instantly grab the observer's attention due to their distinctive features, such as color, shape, or size.
By leveraging this effect, data analysts can draw focus to unusual data behaviors or important outliers that might otherwise be overlooked in a complex visual.
To effectively implement the pop-out effect in your charts:
Use muted colors to represent general trends.
Select a vibrant, contrasting hue for the key data you want to emphasize.
This approach not only enhances the visual appeal of your charts but also aids in quicker data interpretation and decision-making processes.
Whether in a business presentation or an academic report, mastering the pop-out effect can significantly boost the communicative power of your visual data representations.Β
As data lovers, itβs our art, our language, and our bridge to the whole world.
So next time you craft a chart, rememberβ¦
You're not just a data professional, You're a storyteller!
Are you still here? π§
ππ» I want this newsletter to be useful for everyone, soβ¦
Let me know any preference for future content.
If you have any suggestions or preferences for the newsletter to be more useful, feel free to let me know!
My latest articles π
5 Simple Steps to Automate Data Cleaning with Python in KDnuggets.
A Simple to Implement End-to-End Project with HuggingFace in KDnuggets.
The Complete Guide to File Storage on Azure with Blob & Files in DataCamp.
The Transformers Architecture - Whatβs the magic behind LLMs? in AIgents.
Recommendations (my weekly favs)! β₯
I Spent a Week Diving Into LLMs (for the First Time) as a Data Scientist by
How I Became a Data Scientist at Meta β No Stats Degree; No Bootcamp by
Want to get more of my content? ππ»ββοΈ
Reach me on:
LinkedIn, X (Twitter), orΒ Threads to get daily posts about Data Science.
My MediumΒ Blog to learn more about Data Science, Machine Learning, and AI.
Just email me at rfeers@gmail.com for any inquiries or to ask for help! π€
Remember now that DataBites has an official X (Twitter) account and LinkedIn page. Follow us there to stay updated and help spread the word! ππ»
Great article Josep ! If anyone can write on visualization its you with so much experience in creating clean and simple visuals.
Where are you traveling to North America btw ? :)
This was great Josep! I love anything related to UX and data visualisation so this one hit the spot.
Thank you for the shoutout and enjoy your travels. Iβm looking forward to more articles.