The Open Knowledge Foundation’s Open Data Index attempts to score the degree a country has opened several key datasets. In the United States, this is promulgated through data.gov site. The index measures ten categories—ranging from transportation timetables to releasing legislation—and nine criteria within each, such as whether the data exists through the timeliness.
The visualization resembles DNA sequencing. A simple color schema shows whether a criteria was met and an option to click for more information. The entire index, which consists of 60 countries, has a quality to be quickly scannable.
Other indexes, such as democracy index, is usually displayed as a global chloropleth, which seems to marginalize the smaller geographic areas or reinforce regional grouping. The OKFN visualization provides the same space and opportunity to each country, instead of tying it to its geography.
Ah, 1880. An era so magical it was called guilded. To be rich was magical, to not be was soul-crushing. The United States was also coming from the Civil War and reconstruction, with the roots of federalism starting to take hold. As part of that, nationwide Census and statistical maps were a budding and remarkable practice.
It was also the year of the Garfield and Hancock election. Garfield handily won the electoral college, but by the narrowest popular vote margin of only 1,898 votes of 8.9 million votes. Two-hundred days later, Garfield was assassinated, despite revolutionary efforts of using a metal detector built by Alexander Graham Bell to locate the bullet and one of the first air conditioners ever built.
These maps are, according to Susan Schulten, are the first modern electoral maps. This map provides several techniques, now considered mundane, by providing granular electoral returns by county and by showing the degree of victory. The technique is crucial to begin displaying, through data, underlying cleavages in the society at the time—the assassination, however, was not politically motivated. As a result, there is a fine gradient showing the slow changing political positioning of the country from the north to the reconstructed south.
Illinois’ is an excellent example which highlights the republican leanings of the north compared to the democratic favor of southerners. The smaller graphs breaking-down the electoral and popular votes show Garfield’s marginal wins within the most populous states—driving a robust electoral college victory—but the substantial wins in the smaller, southern states.
The series of small graphs are a good touch by putting the main electoral map in context of the returns. Like other graphs of the era, the lithograph and slight imperfections only adds to the aesthetic quality.
Animated maps of tweets across the world, in regions, within urban areas, and within specific domains are appearing in the wild more frequently. Those maps usually emphasize dynamic visualizations, but even for fans of those animations, it is difficult to discern the important information, such as where and when they occurred.
However, the colors on the map correspond to intensity of time, instead of overall intensity. Assuming tweets exist, the intensity shows when an area becomes active. Some insights include the tweets around the university peak during typical lecture hours; the business district during, obviously, business hours; the soccer stadium preceding the match, but not during.
But those tweets during mid-day represent a portion of tweets. In a simple bar graph, Giorgia also shows the simple distribution of tweets by 3-hour segments. Almost 40% of a day’s tweets are after work and during the late-night hours.
Using 45-degree angles is a great way to add perspective to scatterplots. For a different context, Arnold Kling recommended a similar approach for school performance.
Public perceptions of US death rates for selected causes: Comparison of public opinion survey estimates versus empirical data shows how people overestimate of rare deaths
The need to compare cities seems to be innate, from best sports teams to the highest quality of life. Visually, the comparisons can be tough. ESRI seems to be willing to take a stab by releasing the Urban Observatory in an attempt to compare urban areas. The visualization shows cities—16 to choose from—side-by-side at the same altitude. You can choose from a list of 16 data points to display on the map.
Efforts like this always need to be evaluated in two parts: the technical success and the practical. The former refers to the technical feat of standardizing data, create side-by-side maps, and the visualization effort. The latter referring to the usefulness of information.
The technical success is interesting. The UI is intuitive and the display is engaging. The synchronous zoom is fun.
The usefulness of the display is debatable. It’s quick and relatively easy to see the landscape of, for instance, parks.But it is more difficult to discern any answer from the maps. Simple questions, such as, “who has more park space” is left unanswered. More nuanced questions are also harder to discern.
ESRI may continue to invest in this prototype by allowing more exploration of maps and data.
Regression analysis is probably the most common form of parametric statistical analysis. Usually, regressions are visualized as slope and intercepts since it immediately relates to the interpretation of the coefficients. Unfortunately, the focus on slope makes it hard to display several coefficients on the same plane.
At Code a la Mode, Konstantin Kashin shows how to graph coefficients with standard errors in R.
pd <- position_dodge(width=0.2,height=NULL)
ggplot(treisman, aes(specification,coef, color=method)) +
geom_point(aes(shape=method),size=4, position=pd) +
scale_x_continuous("Specification", breaks=1:length(specification), labels=specification) +
scale_y_continuous("Estimated effect of being a former British colony or the UK on TI98") +
Always a good idea.
Dataviz doodling: I’ve sketched these findings 4 different ways and still despise every option… Back to the drawing board tomorrow!
P.S. If you work with charts, it’s always a good idea to keep a few markers next to your desk for easy sketches.
How does trash impact the ocean? This infographic paints a pretty picture on an ugly topic. It’s not easy to make a swirl of trash in the mid Pacific look interesting, but this infographic pulls it off swimmingly. The entire infographic is a loose federation of different topics: how trash gets to the ocean, the type of trash, effected animals, and the impact on food chain.
Most infographics would focus too much on walking the reader through each bit, but the organization lets the reader explore the topic without an annoying, forced flow. It is also a pleasant combination of dashboard-style numbers, minimalist bar charts, map, and illustrations. Using bubble plots is always dicey, but works nicely in this circumstance since it’s used sparingly.
I couldn’t find a higher resolution image, a drawback at this point. This infographic has a low data-to-ink ratio, but I think the spacing and illustration restraint is evident here.
Two things can be said about Nobel prize winners with almost complete certainty: they’re old and are at top-tier universities. So I thought, but my knowledge of Nobel prizes are heavily weighted to economics and the physical sciences. An infographic by Giorgia Lupi, Federica Fragapane, and Francesco Majino helped me realized I was wrong.
Recipients in physics were comparatively young at the prize’s inception. While most recipients in the sciences, medicine, and economics came from Ph.D. granting universities—mostly from a select few colleges—Peace and Literature prizes were awarded to non-academics.
It’s infograph is far from perfect, however. The infographic comes with instructions on how to read the it—a bad omen for data visualizations. But the infographic can still be understood, and with some patience, highly appreciable.