{"id":1426,"date":"2010-03-19T07:06:00","date_gmt":"2010-03-19T07:06:00","guid":{"rendered":"http:\/\/www.smartdatacollective.com\/index.php\/post\/25594\/"},"modified":"2010-03-19T07:06:00","modified_gmt":"2010-03-19T07:06:00","slug":"25594","status":"publish","type":"post","link":"https:\/\/www.smartdatacollective.com\/25594\/","title":{"rendered":"Looking at Trees to Understand the Forest"},"content":{"rendered":"<p>David Simon (of <a href=\"http:\/\/en.wikipedia.org\/wiki\/The_Wire\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">The Wire<\/a> fame) has sucked me into another brilliant television series with&nbsp;<a href=\"http:\/\/en.wikipedia.org\/wiki\/Generation_Kill_%28TV_series%29\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Generation Kill<\/a>. It is&nbsp;the story of a Marine recon unit at the beginning of the Iraq war. At the heart of all the action, the seven-part miniseries offers an intimate and honest profiles of individual Marines.<\/p>\n<p><object height=\"405\" width=\"500\"><param name=\"movie\" value=\"http:\/\/www.youtube.com\/v\/aSLAIKjT7y8&amp;hl=en_US&amp;fs=1&amp;rel=0&amp;color1=0x006699&amp;color2=0x54abd6&amp;border=1\"><param name=\"allowFullScreen\" value=\"true\"><param name=\"allowscriptaccess\" value=\"always\"><embed src=\"http:\/\/www.youtube.com\/v\/aSLAIKjT7y8&amp;hl=en_US&amp;fs=1&amp;rel=0&amp;color1=0x006699&amp;color2=0x54abd6&amp;border=1\" ;=\"\" type=\"application\/x-shockwave-flash\" allowscriptaccess=\"always\" allowfullscreen=\"true\" height=\"405\" width=\"500\"><\/object><\/p>\n<p>The characters don&#8217;t so much displace stereotypes as reveal texture and insight about the unique qualities of individual Marines.<\/p>\n<p>The series got me thinking once again about different ways to analyze data. Almost four years ago, I posted a couple blog posts (<a href=\"http:\/\/www.juiceanalytics.com\/writing\/customer-sparklines\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Part 1<\/a> and <a href=\"http:\/\/www.juiceanalytics.com\/writing\/a-missing-link-in-business-analytics-part-2\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Part 2<\/a>)&nbsp;making a case for analyzing and visualizing data at a granular level to uncover patterns and behaviors. Generation Kill is a case study in looking closely at the individual trees to understand the forest.<\/p>\n<p>Analytics is a journey of exploration&#8211;a continuous series of iterations with the goal of deeper understanding based on better questions and more targeted analyses.&nbsp;Einstein said:<\/p>\n<blockquote><p>&#8220;To raise new questions, new possibilities, to regard old problems from a new angle, requires creative imagination and marks real advance in science.&#8221;<\/p><\/blockquote>\n<p>How to arrive at new questions?<\/p>\n<p>In the previous <span class=\"dots\">&#8230;<\/span><br \/>\n<!--more--><\/p>\n<p><!--break--><\/p>\n<p>David Simon (of <a href=\"http:\/\/en.wikipedia.org\/wiki\/The_Wire\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">The Wire<\/a> fame) has sucked me into another brilliant television series with&nbsp;<a href=\"http:\/\/en.wikipedia.org\/wiki\/Generation_Kill_%28TV_series%29\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Generation Kill<\/a>. It is&nbsp;the story of a Marine recon unit at the beginning of the Iraq war. At the heart of all the action, the seven-part miniseries offers an intimate and honest profiles of individual Marines.<\/p>\n<p><object height=\"405\" width=\"500\"><param name=\"movie\" value=\"http:\/\/www.youtube.com\/v\/aSLAIKjT7y8&amp;hl=en_US&amp;fs=1&amp;rel=0&amp;color1=0x006699&amp;color2=0x54abd6&amp;border=1\"><param name=\"allowFullScreen\" value=\"true\"><param name=\"allowscriptaccess\" value=\"always\"><embed src=\"http:\/\/www.youtube.com\/v\/aSLAIKjT7y8&amp;hl=en_US&amp;fs=1&amp;rel=0&amp;color1=0x006699&amp;color2=0x54abd6&amp;border=1\" ;=\"\" type=\"application\/x-shockwave-flash\" allowscriptaccess=\"always\" allowfullscreen=\"true\" height=\"405\" width=\"500\"><\/object><\/p>\n<p>The characters don&#8217;t so much displace stereotypes as reveal texture and insight about the unique qualities of individual Marines.<\/p>\n<p>The series got me thinking once again about different ways to analyze data. Almost four years ago, I posted a couple blog posts (<a href=\"http:\/\/www.juiceanalytics.com\/writing\/customer-sparklines\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Part 1<\/a> and <a href=\"http:\/\/www.juiceanalytics.com\/writing\/a-missing-link-in-business-analytics-part-2\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Part 2<\/a>)&nbsp;making a case for analyzing and visualizing data at a granular level to uncover patterns and behaviors. Generation Kill is a case study in looking closely at the individual trees to understand the forest.<\/p>\n<p>Analytics is a journey of exploration&#8211;a continuous series of iterations with the goal of deeper understanding based on better questions and more targeted analyses.&nbsp;Einstein said:<\/p>\n<blockquote><p>&#8220;To raise new questions, new possibilities, to regard old problems from a new angle, requires creative imagination and marks real advance in science.&#8221;<\/p><\/blockquote>\n<p>How to arrive at new questions?<\/p>\n<p>In the previous blog post, I described examples from online learning, credit cards usage, and football film study to show how granular analysis can spur new questions. I&#8217;ve stumbled across a series of new examples recently:<\/p>\n<p><strong>Surveys.<\/strong> Survey analysis is hard work&#8211;just ask Ken who recently presented&nbsp;<a href=\"http:\/\/www.juiceanalytics.com\/writing\/survey-results-are-viz-pundits-really-helping\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">results<\/a>&nbsp;from Juice&#8217;s survey on the practice of information visualization in organizations. If a survey is mostly about understanding your audience, rolling up responses by questions can&#8217;t be the only approach (though it is the most common).&nbsp;<a href=\"http:\/\/en.wikipedia.org\/wiki\/Cross_tabulation\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Cross tabs<\/a>&nbsp;(&#8220;displays the joint distribution of two or more variables&#8221;) are one direction to go. Another approach is to look for people who share common characteristics or patterns in their responses.<\/p>\n<p>Macrofocus&#8217;&nbsp;<a href=\"http:\/\/macrofocus.com\/public\/products\/surveyvisualizer\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">SurveyVisualizer<\/a>&nbsp;is the most innovative survey analysis tool I&#8217;ve seen and it emphasizes data at a granular level.<\/p>\n<p><img loading=\"lazy\" loading=\"lazy\" decoding=\"async\" src=\"http:\/\/media.juiceanalytics.com\/images\/SurveyVisualizer.jpg\" height=\"340\" width=\"450\"><\/p>\n<p>&#8220;All the analysis elements&nbsp;are always shown as grey lines in the background. This provides an overview of the ranges and spreads of the individual values for each node, and facilitates the detection of outliers.&#8221; (from&nbsp;Visualization of Large-Scale Customer Satisfaction Surveys Using a Parallel Coordinate Tree)<\/p>\n<p><strong>Medical research.<\/strong>&nbsp;Research studies are conducted against carefully defined target and control populations with aggregate statistics across these populations required for conclusions. However, the ability to review the patterns of diagnoses and procedures at the individual patient-level can help test assumptions about the target population and refine the parameters of a study. Better model inputs; better results.<\/p>\n<p><strong>Speech analytics.<\/strong> Michel Guillet at&nbsp;<a href=\"http:\/\/www.nexidia.com\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Nexidia<\/a>&nbsp;recently told me about their approach to speech data:<\/p>\n<blockquote><p>Nexidia\u2019s speech analytics can mine thousands of hours of audio to categorize, correlate or spot trends. However, it is quite often in identifying and listening to a lone outlier that the application provides its most valuable insights. Some examples of outliers can be the very long call of a particular call type, the extremely abrupt one, the one with the most languages spoken or the one where&nbsp;&nbsp;no one is speaking at all. An outlier can change your hypotheses and put you in a different direction\u2026perhaps a better one.&nbsp;&nbsp;Nexidia\u2019s reporting and analysis tools offer many different methodologies including histograms, analysis of means charts and flexible filtration by meta-data to identify outliers in large amounts of data. In addition, Nexidia\u2019s ad-hoc search functionality allows users to search an entire body of audio content at any time, which is often helpful to find the \u201csmoking gun\u201d or a single recording which can make or break an argument.<\/p><\/blockquote>\n<p>Of course you can&#8217;t be assured of a full or accurate picture when looking at granular data, but somewhere between standard aggregation-based analysis and granular views lies the truth.<\/p>\n<p>\n&nbsp;<a href=\"http:\/\/www.juiceanalytics.com\/writing\/looking-at-trees-to-understand\/\" title=\"http:\/\/www.juiceanalytics.com\/writing\/looking-at-trees-to-understand\/\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">Link to original post<\/a> <\/p>\n","protected":false},"excerpt":{"rendered":"<p>David Simon (of The Wire fame) has sucked me into another brilliant television series with&nbsp;Generation Kill. It is&nbsp;the story of a Marine recon unit at the beginning of the Iraq war. At the heart of all the action, the seven-part miniseries offers an intimate and honest profiles of individual Marines. The characters don&#8217;t so much [&hellip;]<\/p>\n","protected":false},"author":94,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_seopress_robots_primary_cat":"","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","footnotes":""},"categories":[6,8],"tags":[116,282,577],"class_list":{"0":"post-1426","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-data-visualization","7":"category-predictive-analytics","8":"tag-analytics","9":"tag-data-visualization","10":"tag-survey-analysis"},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/posts\/1426","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/users\/94"}],"replies":[{"embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/comments?post=1426"}],"version-history":[{"count":0,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/posts\/1426\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/media?parent=1426"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/categories?post=1426"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/tags?post=1426"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}