Statistics and Computing Series Editors: J Chambers D Hand W Hăardle For further volumes: http://www.springer.com/series/3022 Graham Wills Visualizing Time Designing Graphical Representations for Statistical Data 123 Graham Wills Hidden Spring Dr 1128 60540-4112 Naperville, Illinois USA graham@spss.com Series Editors: J Chambers Department of Statistics Sequoia Hall 390 Serra Mall Stanford University Stanford, CA 94305-4065 D Hand Department of Mathematics Imperial College London, South Kensington Campus London SW7 2AZ United Kingdom W Hăardle C.A.S.E Centre for Applied Statistics and Economics School of Business and Economics Humboldt-Universităat zu Berlin Unter den Linden 10099 Berlin Germany ISSN 1431-8784 ISBN 978-0-387-77906-5 e-ISBN 978-0-387-77907-2 DOI 10.1007/978-0-387-77907-2 Springer New York Dordrecht Heidelberg London Library of Congress Control Number: 2011940977 © Springer Science+Business Media, LLC 2012 All rights reserved This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer Science+Business Media, LLC, 233 Spring Street, New York, NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com) Although this book contains tributes to famous men and women who have invented unique and novel visualizations, and to experts who have distilled knowledge and advanced the science of information visualization, this book is dedicated to those people who designed reports, published charts, and created visualizations and were not content to use the defaults but instead took the extra effort to make their work more truthful, more beautiful, and more useful Thank you! Preface Art or science? Which of these is the right way to think of the field of visualization? This is not an easy question to answer, even for those who have many years of experience in making graphical depictions of data with a view to helping people understand them and take action When we look at beautiful hand-drawn pictures of data, carefully composed by talented individuals, we are drawn to the artistic side In some ways those charts are discouraging; their artistic elegance implies that the creation of good visualizations is not an option for most of us There are books that provide rules and advice on how to draw graphs Some give general advice, suggesting that such and such is good, but this other is bad Others give specific advice such as requiring all charts to have a title or all axes to go to zero, but these are often tied to specific visualizations and so are not general enough to qualify as scientific principles They are valuable for describing existing visualizations, but not general enough to provide guidance for future visualizations If you are designing something new, advice on a bar chart is not especially helpful In this book I want to bridge the gap and not simply give rules and advice but base these on general principles and provide a clear path between them, so that the rules and guidance fall into place naturally, due to knowledge of those principles In terms of the art/science split, I want to advance the scientific component There are excellent books describing artistically superb plots; however, my goal is not simply to be descriptive, but to be prescriptive – to allow people to start with a goal in mind and design a visualization that fulfills that goal clearly, truthfully, and actionably Because I have an essentially scientific direction in mind, I will concentrate on reproducibility A chart that is wonderful for exactly one data set is of little interest It can be appreciated and enjoyed, but the important question must always be: What can I learn from this graphic that I can apply to other data? With this in mind, the examples in this book have been chosen to be realistic rather than exemplary I have made a definite attempt not to choose data that make a picture look good, but rather to choose data for which a chart should be applicable If the result is not perfect, I prefer to present imperfection and explore remedies rather than look for a different data source vii viii Preface This book is concerned with the graphical representation of time data Time is special – it doesn’t behave quite like other variables It has an inherent direction and determines causality Time can be recorded in many ways: it can be linear or cyclic, categorical or continuous Even the written format of a piece of time data can be curiously baroque; mixtures of words, numbers, and special symbols make up the time “Monday the 13th of October, 11:45 am.” What other form of data might occur in so obscure a format? All data are recorded at a certain time, and so all data have a time component, even if it has been removed or deemed a priori as uninteresting This makes time data both unique and universal, so understanding how best to portray them not only is challenging but has wide applicability The portrayal of time data is ubiquitous Any newspaper will feature time-based plots; any company report will show historical data as charts Even the gas bill for my home invites me to compare a time series of the cost of heating my home against one of average monthly temperature Because of this generality, I have written this book to cover a range of different users A visualization expert designing tools for displaying time will find it valuable, but so also should a financier assembling a report in a spreadsheet or a medical researcher trying to display gene sequences using a commercial statistical package You have data, you have a goal in mind Now all you need are the tools to graph the data and so achieve the goal Read on! Graham Wills Acknowledgements The only way to know the effort needed to write a book is to so yourself, and only authors know the debt of gratitude they owe to others Warm thanks are due to many people, broadly classified as shown in the diagram below Any errors and mistakes within the book are entirely my own Fig A modified Venn diagram showing where acknowledgement is due; it shows the major sources but does not include everyone who has helped or influenced my thinking or who has taught me or argued with me over the years The sum total of those contributions would be large; as this book will indicate, it is often small contributions that make or break any major endeavor ix 10.2 Time Lines and Linked Events 221 Fig 10.9 An example timeline by Joseph Priestly, showing the time period leading up to AD Note the faceting into two sections, one on top for Men of Learning, and one below for Statesmen War in Achin Moslem Rebellion in Kansu Spanish Civil War Maji-Maji Rebellion Moslem Rebellions Franco-Prussian War Sequel to the Bolshevik Revolution Colombian Civil War First Chinese-Communist War North American Civil War Crimean War Russo-Turkish War Cuban Revolt Taiping Rebellion Russo-Turkish War 1825 1850 Great War in La Plata 1875 Chaco War Russo-Japanese War Communal Riots in the Indian Peninsula The Mexican Revolution World War II Spanish-American War World War I 1900 1925 1950 Fig 10.10 The timeline here shows the spans of major wars over the timeline covered by Richardson’s analysis of deadly quarrels The color of the interval element and, redundantly, the label color indicate the magnitude of the war as measured by the number of casualties designer to compensate for this problem In this book we are trying to establish automatic rules and so Fig 10.10 shows a more typical result, where an automatic label location algorithm has been used Figure 10.10 shows essentially the same timeline display, with the addition of a color aesthetic The data shown are the statistics of deadly quarrels used in Chap The use of color in this figure is worth a side note Because the elements may be of very short span (e.g., the Spanish-American War), the color of the element might 222 10 Topics In Time Fig 10.11 Timeline of Stonehenge exploration This figure is taken from the book Solving Stonehenge: The New Key to an Ancient Enigma [64], used with permission A detailed explanation of the figure is given in the text 10.2 Time Lines and Linked Events 223 not be visible Thus the label was used redundantly to encode the color An astute observer will then note that although the hues of the elements and their labels match, the labels are darker than the elements themselves This was done so as to ensure the text is legible on a white background (this page) Always be careful when designing visualizations to plan for extreme data, and ensure the chart will be useful in such circumstances Again, comparison with Fig 10.9 is informative The variation in lengths of wars is much greater than the variation in lengths of lives of important people It is rare to see a great man like Alexander, who lived less than 33 years; life spans for this chart have a min:max ratio of about 1:3 In contrast, some deadly quarrels are very quick (the Spanish-American war took months), whereas some are longer (the Achinese War lasted from 1873 to 1904), for a ratio of about 1:100 Different statistical properties of the data lead to different design decisions In the two and a half centuries since their invention, countless versions of the timeline have been created, with many levels of detail and organization The basic structure is of data ranges aligned on one dimension, labeled with information that allows an expert to apply their domain knowledge, and using aesthetics and faceting to add additional information Figure 10.11 shows an excellent example of a timeline showing a wealth of concentrated information in a timeline format The core data are coded as pastel-shaded rectangular ranges showing when various exploration activities took place, with the type of activity color coded and faceted to provide groupings In addition to these base ranges for activities, multiple other elements are present: Small red “subrange” elements These show critical events that are often point events, and almost always lie within a range of the base element One feature we can note is that the landmark events occur in the later half of the activity As we might expect, papers and books are published when the work is nearing completion Adding a second labeled element causes label layout issues, however, so the designer has mitigated this by using distinct typefaces for the two labelings, in much the same way a cartographer would use a different font for labeling regions as opposed to cities Low-information ranges Although most activities are well documented, a considerable amount of unorchestrated digging and robbing took place over the earlier years, and these have been displayed with ranges that are shown in a style that makes them more of a background element than a major feature A single label for a set of such low-information activities has been used, also deemphasizing their importance Icons Restoration activities are not shown as ranges of times, but as simple events Free from the constraints of showing a range, icons can be used to distinguish between simple stone straightening and trilithon fixing Color is used to show bad events in red and good events in green This figure is a complex, hand-designed chart intended for reflective study The design and layout invites us to make time comparisons, and help explain features of importance to researchers in this domain, such as answering questions like: What differences would we expect between Loggan’s engravings and Sir Henry James’ 224 10 Topics In Time photographs? As well as pattern discovery, this visualization has an important use as a reference: When we need to understand one person’s contribution (such as Gibson’s Brittanica article), we can see his contemporaries and parallel activities This chart is valuable for both discovery and search goals 10.3 Summary Most of this book is targeted at presenting general data In this chapter we have examined a couple of specific areas that need visualizations designed to highlight important features of this data The examples were chosen to show the extremes in terms of data quantity The section on large data shows how to apply the principles described in this book to data of arbitrarily large volume; the focus being on aggregation, filtering and showing features within the aggregation The section on timelines shows a different extreme – very few items of data, but rich information on each item For large data, general aggregation techniques were shown, as well as techniques for augmenting standard displays to cope with large data volumes One important lesson is that graphs and tables are not necessarily competing techniques for displaying summary data, but comprise different points along a continuum of possible visualizations The Grammar of Graphics [135] provides building blocks for defining this continuum, as discussed in the framework Chap For linked events and timelines very different techniques are required The details are too important to be aggregated over, and the design goals become focused on maximizing the available information on single charts for reflective study Issues of labeling and graphic design become less of an individual style preference and more important for enhancing clarity These charts are not easy to design and may take many design iterations to get useful enough to use 10.4 Further Exploration The American Statistical Association site for this Airline Data Expo [1] contains a number of posters showing visualizations crafted to explore this data set As with any such competition, it is valuable to look through them all and see what works and try and draw out the successful principles It is also interesting to see how details of presentation make a big difference to clarity of results Which axes drawing algorithms work best? Which fonts are better for labeling? Is it better to have a dark or a light background for a chart? By exploring a set of graphs for the same data set, we can learn more about what makes visualizations successful It is interesting to compare the Stonehenge exploration (Fig 10.11) to those created by the LifeLines project [58, 86] The latter figures are for a very different 10.4 Further Exploration 225 domain, showing events in the treatment of health issues for medical patients, but the goals are similar and the end results show a remarkable similarity of design Linked event diagrams and timelines have a free dimension (in our examples and also typically, the vertical dimension) Items can be moved round within their faceting and should be placed so as to make best use of the space available, avoid overlap and, with linked events, avoid too many crossings This is an important constrained layout problem, and the field of graph layout provides help on how to design algorithms to solve this problem The Web site http://graphdrawing.org provides references to a number of books at various levels of complexity Index A acknowledgement diagram, ix aesthetics brightness, 157 color, 156, 221 combinations, 46 hue, 157 on text labels, 161, 221 other, 161 saturation, 157 shape, 158 size, 157 transparency, 157 aggregation, 207, 208 Agincourt, battle of, 158 airports in the USA, 215 always show zero, 30 annotation, 57 aspect ratio, 124 axis, 129 B Bach, Johann Sebastian, 117 balance of trade, baseball, 41 designated hitter, 48 bible, biblical time, big-endian, 102 binning, 176, 178 bottom-up design, 89, 90 boxplot, 33, 75 C calendars, 2, cartesian coordinates, 50 cascading style sheets, 59 categorical sequences, 116, 118 chart complexity, 140 chartlike table, 211 Chernoff faces, 33 choice of y-axis minimum, 30 Clock of Ages, closely related variables, 77 clustering hierarchical, 200 self-organizing map, 41 color, 156 color, use of, 41 comic books, 15 common patterns, 66 complexity aesthetics, 46 grammatical breakdown, 227 subjective evaluation, 231 complexity experiment, 228 composite aesthetics, 46 conditional relationships, 79 consensus ordering, 230 consistency of mapping, 89 converting time into ranges, 178 converting time ranges to time points, 176 coordinate chain, 52 coordinate transformations, 127, 169 correlations, 75 count data representation, 152 CPI data for the UK, 188 CSS see cascading style sheets, 59 cyclical order, 97 cylindrical coordinates, 50 G Wills, Visualizing Time: Designing Graphical Representations for Statistical Data, Statistics and Computing, DOI 10.1007/978-0-387-77907-2, © Springer Science+Business Media, LLC 2012 253 254 D data–ink ratio, 58, 152 date formats, 102 date transformation formulas, 179 decision trees, 80 density estimation, 115 use in ThemeRiver, 115 dependent and independent variables, 106 discrete time series, 102 display pipeline, 182 distances between orderings, 230 distorting time, 169 distortion techniques, 188 distributions, 71 dividing time, 176 DNA sequence, 117 document analysis, 161 dodging, 112 Dojo, 62 domain-specific display, 199 drill-down, 204 dynamic graphics see interactivity, 181 E earthquakes, 107 Easter, Ebbinghaus Illusion, 153 El Ni˜no, 49, 53 elapsed time, 97 email data, 184 English dialects, 21 epoch, 103 Excel, 103 UNIX, 103 epoch failure, 103 event data, 99, 101, 108, 114 examples airline delay data, 208 baby names, 78 balance of trade, baseball, 140 baseball players, 41 beatles songs, 86 consumer price index, 172, 188 crime, 51, 58 deadly quarrels, 108, 220 El Ni˜no, 49, 53 email, 184 human genome, 116, 117, 119 IBM Stock, 176 mass layoffs, 124, 127 medieval soldiers, 151 Index migration paths, movie ratings, 169 movies, 71, 114, 129 passenger arrivals, 138 population changes in US states, 17 rainfall in the UK, 144 roleplaying games, 218 seismic activity, 106 software bugs and feature requests, 74 software features, 80 star magnitude / color, 68 stock trades, 23, 30, 77 The Jungle Book, 161 Twitter, 111 US Population, 18 wind speeds, 35, 38 ExcelTM date functions, 179 exploratory graphics, 63 F faceting, 17, 136 complexity, 140 faceting by time, 138 time within a faceting, 144 filtering, 207 fisheye coordinate transformation, 52 fisheye coordinate transformations, 188 focus+context, 184, 188 formats, 102 output, 135 fourier analysis, 172 fragile visualizations, xi, 119, 208, 223 frequency space transformations, 172 G gallery, 236 Gantt chart, 217 generalized correlations, 75 geo-temporal data, 144 geography and nationality in the British Isles, 144 Goldberg Variations, 119 GQM (Goal, Question, Metric), 64 Grammar of Graphics, 22 aesthetics, 41 coordinate transformations, 123 coordinates, 50, 105, 123 elements, 23 faceting, 49, 55, 123 guides, 56 interactivity, 58 statistics, 35 styles, 58 Index grammar of graphics complexity analysis, 227 grammatical summary of charts, 229 Grand Canyon, granularity of data, 178 graph comprehension, 85 graph layouts for variable associations, 82 graphical perception tasks, 105 guide axis, 129 time axis, 132, 135 H heatmap, 74 Hertzsprung-Russell Diagram, 68 high-dimensional data, 67 histogram, 178 bin width automatic choice, 178 history of visualization of time, I identifier variable, 111 immersive learning, 65 information seeking, 91 interactive model fitting, 192 interactive parameter manipulation, 184 interactivity, 181 linked views, 198, 216 international date format, see ISO 8601 interval data, 97 ISO 8601, 102 J Japanese calendar, jittering, 112 John Harrison, K kernel, Epanechnikov, 37 Kohonen map, see self-organizing map Kolmogorov–Smirnov test, 71 L labeling, 220 large data sets, 207 legends, 57 linked events, 218 little-endian, see big-endian longitude, 2, 95 lunar time, 255 M ManyEyes, 91 map, 215 map projections, 52 mapping data to graphical features, 85 measurement levels, 95 measures of calendar time, medieval soldiers, 151 Minard, Charles Joseph, model fitting, 192 moving average, 37 multidimensional scaling, 145 multimodal distributions, 72 multivariate time series techniques, 145 musical notation as visualization, 117 N Napoleon, narrative structure, 88 narrative visualization, 86 nominal data, 96 nonlinear transformations of time, 169 O oblique projection, 52 occlusion problem, 24 ordered data, 233 ordinal, 208 ordinal data, 96 outliers, 70 overplotting, 121 overview+detail see focus+context, 184 P paneling, 17, 148 parallel coordinates, 50, 67 parameters, 184 perceptual tasks, 105 periodogram, 174 petroglyphs, phrase net, 91 pipeline see display pipeline, 182 Playfair, William, point processes, 67, 101 polar coordinates, 5, 50, 127 pop-up, 187 position modifiers, 112 preattentive visual processing, 48 presentation graphics, 63 PRIM-9, 182 256 principles of design, 63 Python, xi Q questions charts answer, 69 R random charts, 228 random forest, 80 rank data, 233 ratio data, 97 real-time data, 69 recoding data, 208 rectangular coordinates, 50 reflective learning, 65 regular data, 101 relationships, 73 S scale divergent, 157, 209 double-ended, 157 interactive scale manipulation, 194 scatterplot matrix, 59 schema, 33 search engines, 90, 91 seasonality, 192 SeeNet, 216 seismograph, 106 selection calculus, 203 self-organizing map, 40 semantic map, 40 September 11, 209 shape, 158 Shape of Song, 118 shape of song, 118 shingling, 148 showing importance, 66 sidereal time, SimCity, 58 size, 157 small multiples, 55 smooth local, 36 loess, 36 moving average, 37 social networking, 111 Solomom, SOM, see self-organizing map space–time processes, 67 space-filling layout, 199 spatial data, 144 spectral analysis, 172 Index spherical coordinates, 50 splitting aesthetic, 154 SQL GROUP BY and splitting aesthetics, 156 stability in animation, 89 stacking, 112 standard date format, see ISO 8601 statistics interactive parameter manipulation, 191 step line, 176 stereotypes, 89 stock trades, 23 storytelling visualization, 86 Strasbourg Cathedral clock, streaming data, 69 summarizing aesthetic, 154 sunflower plot, 98 T tablelike chart, 211 tag cloud, 162 taxonomies of visualizations, 22 text analysis, 184 text mining, 161 ThemeRiver, 116 time intervals, 99 time ranges , see time intervals time series, 6, 123 time series chart, 123 time series plot, 105 timelines, 220 tool tip, 187 top-down design, 89 a faceting approach, 136 tours in high-dimensional space, 67 transforming time events to a sequence, 171 trees, 80 trellis, 55, 148 Twitter, 111 U units, 2, 103 historical units of time, used in axes, 133 unusual values, 70 V variable associations, 82 venn diagram, ix VizML, xi, 11, 116 W when to travel, 211 wind speeds, 35 word cloud, 162 References ASA Sections on Statistical Computing and Statistical Graphics: Airline on-time performance url: http://stat-computing.org/dataexpo/2009 (2009) American Statistical Association: The American Statistical Association Section on Statistical Graphics http://stat-graphics.org/graphics/ Basili, V.R., Caldiera, G., Rombach, D.H.: The Goal Question Metrics Approach In: Encyclopedia of Software Engineering, vol I, 1st edn., pp 528–532 Wiley (1994) Battista, G.D., Eades, P., Tamassia, R., Tollis, I.G.: Graph Drawing: Algorithms for the Visualization of Graphs, 1st edn Prentice Hall, Upper Saddle River (1998) Becker, R.A., Cleveland, W.S., Shyu, M.J.: The visual design and control of trellis display J Comput Graph Stat 5, 123–155 (1996) Becker, R.A., Cleveland, W.S., Wilks, A.R.: Dynamic graphics for data analysis (c/r: p50-72) Dyn Graph Stat 0(0), 1–50 (1988) Becker, R.A., Eick, S.G., Wilks, A.R.: Visualizing network data IEEE Trans Vis Comput Graph 1, 16–28 (1995) Bertin, J.: Semilogie Graphique Mouthon-Gauthiers-Villars, Paris (1967) Bertin, J.: Semiology of Graphics University of Wisconsin Press, Madison (1983) 10 Boehm, B., Rombach, H.D., Zelkowitz, M.V.: Foundations of Empirical Software Engineering: The Legacy of Victor R Basili Springer, Secaucus (2005) 11 Brewer, C.A.: Color use guidelines for mapping and visualization In: A MacEachren, D Taylor (eds.) Visualization in Modern Cartography, chap 7, pp 123–147 Elsevier, Tarrytown, NY (1994) 12 Brewer, C.A.: Guidelines for use of the perceptual dimensions of color for mapping and visualization In: Color Hard Copy and Graphic Arts III, Proceedings of the International Society for Optical Engineering (SPIE), San Jose, vol 2171, pp 54–63 (1994) 13 Brewer, C.A.: ColorBrewer http://colorbrewer2.org/ (2003) 14 Brewer, C.A., Hachard, G.W., Harrower, M.A.: Colorbrewer in print: a catalog of color schemes for maps Cartogr Geogr Inf Sci 30(1), 5–32 (2003) 15 Buja, A., Cook, D., Swayne, D.: Interactive High-Dimensional Data Visualization J Comput Graph Stat 5(1), 78–99 (1996) 16 Card, S.K., Mackinlay, J., Shneiderman, B.: Readings in Information Visualization: Using Vision to Think Series in Interactive Technologies The Morgan Kaufmann, Waltham (1999) 17 Chatfield, C.: The Analysis of Time Series: An Introduction, 6th edn Chapman & Hall/CRC, London, UK (2003) 18 Chen, C.h., Hrdle, W., Unwin, A.: Handbook of Data Visualization (Springer Handbooks of Computational Statistics), edn Springer TELOS, Santa Clara (2008) G Wills, Visualizing Time: Designing Graphical Representations for Statistical Data, Statistics and Computing, DOI 10.1007/978-0-387-77907-2, © Springer Science+Business Media, LLC 2012 247 248 References 19 Chen, J., MacEachren, A.M.: Supporting the process of exploring and interpreting space-time multivariate patterns: The visual inquiry toolkit Cartogr Geogr Inf Sci 35, 33–50 (2008) 20 Chernoff, H.: The use of faces to represent points in k-dimensional space graphically J Am Stat Assoc 68(0), 361–368 (1973) 21 Cleveland, W.: The Elements of Graphing Data Hobart, Lafayette, IN (1985) 22 Cleveland, W.: Visualizing data Hobart, Lafayette, IN (1993) 23 Cleveland, W.C., McGill, M.E.: Dynamic Graphics for Statistics CRC, Boca Raton (1988) 24 Cleveland, W.S.: Graphical methods for data presentation: Full scale breaks, dot charts, and multibased logging Am Stat 38(4), 270–280 (1984) 25 Cleveland, W.S.: A model for studying display methods of statistical graphics (with discussion) J Comput Stat Graph 2, 323–364 (1993) 26 Cleveland, W.S., Devlin, S.J.: Locally weighted regression: an approach to regression analysis by local fitting J Am Stat Assoc 83(0), 596–610 (1988) 27 Cleveland, W.S., McGill, M.E., McGill, R.: The shape parameter of a two-variable graph J Am Stat Assoc 83(402), 289–300 (1988) 28 Cleveland, W.S., McGill, R.: The many faces of a scatterplot J Am Stat Assoc 79(388), 807–822 (1984) 29 Cook, D.: The UCI KDD Archive http://kdd.ics.uci.edu/; University of California, Department of Information and Computer Science 30 Cook, D., Swayne, D.F.: Interactive and Dynamic Graphics for Data Analysis: With Examples Using R and GGobi Springer, Berlin Heidelberg New York (2007) 31 Cox, T.F., Cox, M.A.: Multidimensional Scaling Chapman & Hall, London, UK (2001) 32 Craig, P., Haslett, J., Unwin, A., Wills, G.: Moving statistics - an extension of ”brushing” for spatial data In: Berk, Malone (eds.) Proceedings of the 21st Symposium on the Interface, pp 170–174 (1989) 33 Di Battista, G., Eades, P., Tamassia, R., Tollis, I.G.: Algorithms for drawing graphs: an annotated bibliography Comput Geom Theory Appl 4(5), 235–282 (1994) 34 Donoho, A.W., Donoho, D.L., Gasko, M.: Macspin: Dynamic graphics on a desktop computer IEEE Comput Graph Appl 8(4), 51–58 (1988) 35 Evarts, H.F.: Introduction to PERT Allyn and Bacon series in quantitative methods for business and economics Allyn and Bacon, Boston (1964) 36 Ewing, J.: On a new seismograph for horizontal motion Trans Seismol Soc Jpn 2, 45–49 (1880) 37 Few, S.: Show Me the Numbers: Designing Tables and Graphs to Enlighten Analytics, Oakland (2004) 38 Few, S.: Now You See It: Simple Visualization Techniques for Quantitative Analysis Analytics, Oakland (2009) 39 Fisherkeller, M., Friedman, J., Tukey, J.: Prim-s, an interactive multidimensional data display and analysis system In: Dynamic Graphics for Statistics, pp 91–109 Wadsworth, Pacific Grove (1975) 40 Fisherkeller, M.A., Friedman, J.H., Tukey, J.W.: Prim-9: An interactive multi-dimensional data display and analysis system In: ACM Pacific, pp 140–145 (1975) 41 Forta, B.: Sams Teach Yourself SQL in 10 Minutes, 3rd edn Sams, Indianapolis (2004) 42 Friedman, J.: Exploratory projection pursuit J Am Stat Assoc 82, 249–266 (1987) 43 Friedman, J., Tukey, J.: A projection pursuit algorithm for exploratory data analysis IEEE Trans Comput C 23, 881–889 (1974) 44 Friendly, M.: Re-visions of minard http://www.math.yorku.ca/SCS/Gallery/re-minard.html (2001) 45 Friendly, M.: Visions and Re-Visions of Charles Joseph Minard J Educ Behav Stat 27(1), 31–51 (2002) 46 Friendly, M., Denis, D.J.: Milestones in the history of thematic cartography, statistical graphics, and data visualization http://datavis.ca/milestones/ (2008) 47 Gershon, N., Page, W.: What storytelling can for information visualization Commun ACM 44(8), 31–37 (2001) References 249 48 Government, U.: UK Met Office http://www.metoffice.gov.uk/ 49 van Ham, F., Wattenberg, M., Vi´egas, F.B.: Mapping text with phrase nets IEEE Trans Vis Comput Graph 15(6), 1169–1176 (2009) 50 Haslett, J., Bradley, R., Craig, P., Unwin, A., Wills, G.: Dynamic graphics for exploring spatial data with application to locating global and local anomalies Am Stat 45(0), 234–242 (1991) 51 Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn Springer Series in Statistics Springer, Berlin Heidelberg New York (2009) 52 Havre, S., Hetzler, E., Whitney, P., Nowell, L.: Themeriver: Visualizing thematic changes in large document collections IEEE Trans Vis Comput Graph 8(1), 9–20 (2002) DOI http://dx.doi.org/10.1109/2945.981848 53 Hayes, B.: Group Theory in the Bedroom, and Other Mathematical Diversions MacMillan, New York (2009) 54 Hearst, M.A.: Search User Interfaces, edn Cambridge University Press, Cambridge, UK (2009) 55 Heer, J., Agrawala, M.: Multi-scale banking to 45 degrees IEEE Trans Vis Comput Graph 12(5), 701–708 (2006) 56 Holford-Strevens, L.: A Short History of Time The Folio Society, London, UK (2007) 57 Huff, D.: How to Lie With Statistics Norton, New York (1993) 58 Human-Computer Interaction Lab, University of Maryland: Lifelines for visualizing patient records URL: http://www.cs.umd.edu/hcil/lifelines/ (1998) 59 Hustwit, G.: Helvetica Documentary Video (2007) 60 Indulska, M., Orlowska, M.E.: On aggregation issues in spatial data management In: ADC ’02: Proceedings of the 13th Australasian database conference, pp 75–84 Australian Computer Society, Darlinghurst, Australia (2002) 61 Inselberg, A.: The Plane with Parallel Coordinates Vis Comput 1, 69–91 (1985) 62 Inselberg, A.: Parallel Coordinates: Visual Multidimensional Geometry and Its Applications Springer, Berlin Heidelberg New York (2009) 63 International Organization for Standardization: ISO 8601 Data elements and interchange formats — Information interchange — Representation of dates and times International Organization for Standardization, Geneva, Switzerland (1988) See also 1-page correction, ISO 8601:1988/Cor 1:1991 64 Johnson, A.: Solving Stonehenge: The New Key to an Ancient Enigma Thames and Hudson, London, UK (2008) 65 Kaplan, A.: From Krakow to Krypton: Jews and Comic Books Jewish Publication Society of America, Philadelphia (2008) 66 Kendall, M.: A new measure of rank correlation Biometrika 30(1/2), 81–93 (1938) 67 Kipling, R.: Jungle book MacMillan, London, UK (1894) 68 Kohonen, T.: Self-Organizing Maps, Springer Series in Information Sciences, vol 30 Springer, Berlin, Heidelberg (1995) (3rd extended edn 2001) 69 Koike, K.: The Assassin’s Road No in Lone Wolf and Cub Dark Horse, Milwaukie, OR (2000) 70 Kuhlthau, C.C.: Inside the search process: information seeking from the user’s perspective J Am Soc Inf Sci 42(5), 361–371 (1999) 71 Leung, Y.K., Aerley, M.D.: A review and taxonomy of distortion-oriented presentation techniques ACM Trans Comput.-Hum Interact 1(2), 126–160 (1994) 72 Levkowitz, H.: Perceptual steps along color scales Int J Imag Syst Technol pp 97–101 (1996) 73 Lie, H.W., Bos, B.: Cascading Style Sheets: Designing for the Web Addison-Wesley Longman, Boston (1997) 74 Mallet, R.: Great Neapolitan Earthquake of 1857: The First Principles of Observational Seismology as Developed in the Report to the Royal Society of London of the Expedition Made by Command of the Society Into the Interior of the Kingdom of Naples, to Investigate the Circumstances of the Great Earthquake of 1857 Chapman & Hall, London, UK (1862) 250 References 75 Marden, J.I.: Analyzing and Modeling Rank Data Chapman & Hall, London, UK (1995) 76 Martin, A., Ward, M.: High dimensional brushing for interactive exploration of multivariate data In: Visualization, 1995 Visualization ’95 Proceedings, IEEE Conference on, pp 271– Los Alamitos (1995) 77 McLachlan, R.: The earthquake Nature 30 (1884) 78 Miller, J.E.: The Chicago Guide to Writing About Numbers 0226526313 University of Chicago Press, Chicago (2004) 79 Minard, C.J.: Carte figurative des pertes successives en hommes de l’Arm´ee Franc¸aise dans la campagne de Russie 1812-1813 self-published (1861) 80 Mosteller, F., Tukey, J.: Data Analysis and Regression: A Second Course in Statistics Addison-Wesley, Reading (1977) 81 Moyer, G.: Luigi lilio and the gregorian reform of the calendar Sky and Telescope (1982) 82 Nightingale, F.: Diagram of the causes of mortality in the army in the East Private publication (1858) 83 Ogata, Y.: Space-time point-process models for earthquake occurrences Ann Inst Stat Math 50(2), 379–402 (1998) 84 Orton, H., Sanderson, S., Widdowson, J (eds.): The Linguistic Atlas of England Routledge, New York (1978) 85 Pinker, S.: A theory of graph comprehension In: R Freedle (ed.) Artificial Intelligence and the Future of Testing (1990) 86 Plaisant, C., Mushlin, R., Snyder, A., Li, J., Heller, D., Shneiderman, B.: Lifelines: Using visualization to enhance navigation and analysis of patient records In: American Medical Informatic Association Annual Fall Symposium (Orlando, 9–11 Nov 1998), pp 76–80 AMIA (1998) 87 Playfair, W.: Commercial and Political Atlas: Representing, by Copper-Plate Charts, the Progress of the Commerce, Revenues, Expenditure, and Debts of England, during the Whole of the Eighteenth Century Corry, London (1786) 88 Priestley, J.: A Description of a New Chart of History J Johnson, London, UK (1769) 89 Radford, M.: Photograph of stonehenge http://www.flickr.com/photos/misterrad/ (2009) 90 Richard Weber, J.M.: Business intelligence competition, bi cup 2006 In: Talleres de Ingenier´ıa de Sistemas 2007 (2006) 91 Richardson, L.: Statistics of Deadly Quarrels Boxwood, Pittsburgh (1960) 92 Robbins, N.: Creating More Effective Graphs Wiley-Interscience, New York (2004) 93 Russell, M.A.: Dojo: The Definitive Guide O’Reilly Media, Sebastopol, CA (2008) 94 Sattler, K.U., Schallehn, E.: A data preparation framework based on a multidatabase language In: IDEAS ’01: Proceedings of the International Database Engineering & Applications Symposium, pp 219–228 IEEE Computer Society, Washington, DC (2001) 95 Schervish, M.J.: P values: What they are and what they are not Am Stat 50(3), 203–206 (1996) 96 Schuster, A.: On the investigation of hidden periodicities with application to a supposed 26 day period of meteorological phenomena Terr Magn Atmos Electr 3, 13–41 (1898) 97 Scott, D.W.: On optimal and data-based histograms Biometrika 66(0), 605–610 (1979) 98 Shneiderman, B.: The eyes have it: A task by data type taxonomy for information visualizations In: IEEE Visual Languages, UMCP-CSD CS-TR-3665, pp 336–343 College Park (1996) 99 Silverman, B.W.: Density Estimation for Statistics and Data Analysis Chapman & Hall, London, UK (1986) 100 Slingsby, A., Dykes1, J., Wood, J.: Using treemaps for variable selection in spatio-temporal visualisation Inf Vis 7, 210–224 (2008) 101 Smith, E.E.: Triplanetary Amazing Stories Jan - Apr (1934) 102 Sobel, D.: Longitude Penguin, London, UK (1995) 103 Sobel, D.: Longitude: The True Story of a Lone Genius Who Solved the Greatest Scientific Problem of His Time Penguin (1996) 104 Software, T.: Tableau software url: http://www.tableausoftware.com References 251 105 Stevens, S.S.: On the theory of scales of measurement Science 103, 677–680 (1946) 106 Stoker, B.: Dracula Constable, London, UK (1897) 107 Sullivan, L.H.: The tall office building artistically considered Lippincott’s Mag (1896) 108 Swayne, D.F., Cook, D., Buja, A.: XGobi: Interactive Dynamic Graphics in the X Window System with a Link to S In: American Statistical Association 1991 Proceedings of the Section on Statistical Graphics, pp 1–8 American Statistical Association, Alexandria (1992) 109 Theus, M.: Trellis displays vs interactive graphics Comput Stat 10, 113–127 (1995) 110 Titchener, E.B.: Experimental Psychology: A Manual of Laboratory Practice University of Michigan Press, Ann Arbor (1901) 111 Tufte, E.R.: The visual display of quantitative information, edn Graphics Press, Chesire, CT (2001) 112 Tukey, J.W.: Exploratory data analysis Addison Wesley, Boston (1977) 113 Tukey, J.W., Tukey, P.: Strips displaying empirical distributions: textured dot strips Tech rep., Bellcore (1990) 114 University of Reading: The Soldier in Later Medieval England Online Database url: http://medievalsoldier.org (2009) 115 Unwin, A., Volinsky, C., Winkler, S.: Parallel coordinates for exploratory modelling analysis Comput Stat Data Anal 43(4), 553–564 (2003) 116 Unwin, A.R., Hawkins, G., Hofmann, H., Siegl, B.: Interactive Graphics for Data Sets with Missing Values - MANET J Comput Graph Stat 5(2), 113–122 (1996) 117 U.S Government: Popular Baby Names http://www.ssa.gov/babynames 118 Velleman, P.F.: Data desk The New Power of Statistical Vision Data Description Inc (1992) 119 Velleman, P.F., Wilkinson, L.: Nominal, ordinal, interval, and ratio typologies are misleading (c/r: 93v47 p314-316; com: 94v48 p61-62) Am Stat 47(0), 65–72 (1993) 120 Vi´egas, F., Wattenberg, M.M.: ManyEyes http://manyeyes.alphaworks.ibm.com (2007) 121 Viegas, F.B., Wattenberg, M., van Ham, F., Kriss, J., McKeon, M.: ManyEyes: a Site for Visualization at Internet Scale IEEE Trans Vis Comput Graph 13(6), 1121–1128 (2007) DOI 10.1109/TVCG.2007.70577 122 Wainer, H.: Graphic Discovery: A Trout in the Milk and Other Visual Adventures Princeton University Press, Princeton (2004) 123 Wainer, H., Spence, I (eds.): The Commercial and Political Atlas and Statistical Breviary Cambridge University Press, Cambridge, UK (2005) 124 Ware, C.: Visual Thinking for Design, illustrated edn Morgan Kaufmann, Waltham (2008) 125 Wattenberg, L.: Baby Name Wizard http://www.babynamewizard.com/voyager 126 Wattenberg, M.: Arc diagrams: visualizing structure in strings In: Information Visualization, 2002, pp 110–116 (2002) 127 Wattenberg, M.: Baby names, visualization, and social data analysis In: IEEE Symposium on Information Visualization (InfoVis 2005) IEEE Computer Society, Los Alamitos (2005) 128 Wegman, E.: Hyperdimensional Data Analysis Using Parallel Coordinates J Am Stat Assoc 85, 664–675 (1990) 129 Wegman, E.J.: Hyperdimensional data analysis using parallel coordinates J Am Stat Assoc 85(411), 664–675 (1990) 130 Wickham, H.: ggplot: an implementation of the grammar of graphics in r In: UseR Conference, Vienna (2006) 131 Wickham, H.: ggplot: Elegant Graphics for Data Analysis Springer, Berlin Heidelberg New York (2009) ISBN: 978-0-98140-6 132 Wilhelm, A.: Interactive Statistical Graphics: The Paradigm of Linked Views Universităat Augsburg, Augsburg, Germany (1999) 133 Wilkinson, D.O.: Deadly Quarrels: Lewis F Richardson and the Statistical Study of War University of California Press, Berkeley (1980) 134 Wilkinson, L.: The Grammar of Graphics Statistics and Computing Springer, Berlin Heidelberg New York (1999) 135 Wilkinson, L.: The Grammar of Graphics Statistics and Computing Springer, Berlin Heidelberg New York (2005) 252 References 136 Wilkinson, L., Anand, A., Grossman, R.: Graph-theoretic scagnostics In: IEEE Symposium on Information Visualization, pp 157–164 IEEE Computer Society, Los Alamitos (2005) 137 Wilkinson, L., Wills, G.: Scagnostics distributions J Comput Graph Stat 17(2), 473–491 (2008) DOI 10.1198/106186008 138 Wills, G.: Linked data views In: A.U Chun-houh Chen Wolfgang Hăardle (ed.) Handbook of Data Visualization (Springer Handbooks of Computational Statistics), chap II.9, pp 216–241 Springer, Santa Clara (2008) 139 Wills, G., Wilkinson, L.: Autovis: automatic visualization Inf Vis 9, 47–69 (2010) 140 Wills, G.J.: Natural selection: Interactive subset creation J Comput Graph Stat 9(3) (2000) 141 Wills, G.J., Keim, D.: Data visualization for domain exploration: interactive statistical graphics In: Handbook of Data Mining and Knowledge Discovery, pp 226–232 Oxford University Press, Inc., New York (2002) 142 Wright, W.: Simcity Computer Game (1988) 143 Young, F.: Visual Statistics: Seeing Data with Dynamic Interactive Graphics WileyInterscience, New York (2006) 144 Young, F.W., Hamer, R.M.: Multidimensional Scaling: History, Theory, and Applications Erlbaum, Mahwah, NJ (1987) ... Chambers D Hand W Hăardle For further volumes: http://www.springer.com/series/3022 Graham Wills Visualizing Time Designing Graphical Representations for Statistical Data 123 Graham Wills Hidden Spring... different calendar systems that have been employed across G Wills, Visualizing Time: Designing Graphical Representations for Statistical Data, Statistics and Computing, DOI 10.1007/978-0-387-77907-2... portrayal of time data is ubiquitous Any newspaper will feature time- based plots; any company report will show historical data as charts Even the gas bill for my home invites me to compare a time series