{"id":7021,"date":"2012-05-31T19:55:52","date_gmt":"2012-05-31T19:55:52","guid":{"rendered":"http:\/\/www.smartdatacollective.com\/index.php\/post\/does-it-take-scientist-find-gold-big-data\/"},"modified":"2012-05-31T19:55:52","modified_gmt":"2012-05-31T19:55:52","slug":"does-it-take-scientist-find-gold-big-data","status":"publish","type":"post","link":"https:\/\/www.smartdatacollective.com\/does-it-take-scientist-find-gold-big-data\/","title":{"rendered":"Does It Take a Scientist to Find Gold in Big Data?"},"content":{"rendered":"<p><a href=\"http:\/\/bit.ly\/L8t3vm\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\"><img loading=\"lazy\" loading=\"lazy\" decoding=\"async\" class=\"alignleft  wp-image-12467\" style=\"float: left; margin-top: 5px; margin-bottom: 5px; margin-left: 15px; margin-right: 15px;\" title=\"big-data-gold\" src=\"http:\/\/spotfireblog.tibco.com\/wp-content\/uploads\/big-data-gold-150x150.jpg\" alt=\"big data gold 150x150 photo (enterprise data scientist 2 data analytics business intelligence big data )\" width=\"200\" height=\"200\" \/><\/a>In a recent&nbsp;<a href=\"http:\/\/news.cnet.com\/8301-1001_3-57434736-92\/big-data-is-worth-nothing-without-big-science\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">CNET article<\/a>, Alex Yoder (<a h\n<!--more--><\/p>\n<p><a href=\"http:\/\/bit.ly\/L8t3vm\" target=\"_blank\"><img loading=\"lazy\" loading=\"lazy\" decoding=\"async\" class=\"alignleft  wp-image-12467\" style=\"float: left; margin-top: 5px; margin-bottom: 5px; margin-left: 15px; margin-right: 15px;\" title=\"big-data-gold\" src=\"http:\/\/spotfireblog.tibco.com\/wp-content\/uploads\/big-data-gold-150x150.jpg\" alt=\"big data gold 150x150 photo (enterprise data scientist 2 data analytics business intelligence big data )\" width=\"200\" height=\"200\" \/><\/a>In a recent&nbsp;<a href=\"http:\/\/news.cnet.com\/8301-1001_3-57434736-92\/big-data-is-worth-nothing-without-big-science\/\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">CNET article<\/a>, Alex Yoder (<a href=\"http:\/\/www.twitter.com\/yodera\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">@yodera<\/a>) compared data to the value of other can\u2019t-live-without assets \u2013 oil and gold. However, he says that big data has a major differentiator \u2013 \u201cdata has no intrinsic value.\u201d<\/p>\n<p>He\u2019s right. The value of data comes from its refinement.<\/p>\n<p>Yoder writes, \u201cGold requires mining and processing before it fines its way into our jewelry, electronics and even the Fort Knox vault. Oil requires extraction and refinement before it becomes the gasoline that fuels our vehicles.&nbsp;Likewise, data requires collection, mining and, finally, analysis before we can realize its true value for businesses, governments and individuals alike.\u201d<span><\/span><\/p>\n<p>What\u2019s interesting about this comparison is that behind the mining, processing and analysis is big science. It takes a scientific approach and processes to find the value in data and precious commodities. Yoder defines the analysis portion of extracting value from data as the science.<\/p>\n<p><strong>The Science of Finding Gold in Data<\/strong><\/p>\n<p>As Yoder points out, there\u2019s also science in the mining and processing of the data as well&nbsp;\u2013&nbsp;just go back to our <a href=\"http:\/\/spotfireblog.tibco.com\/?p=10811\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">interview<\/a> with Gregory Piatetsky-Shapiro (<a href=\"http:\/\/www.twitter.com\/kdnuggets\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">@KDNuggets<\/a>) on the topic of data science and predictive analytics.<\/p>\n<p>Piatetsky-Shapiro says, \u201cWhat we want to find in data is some understandable knowledge and not just incomprehensible patterns. He adds, \u201c[A] better understanding of predictive models will contribute to increased trust for such models.\u201d<\/p>\n<p>Piatetsky-Shapiro is a living example of how science can extract value from data. He\u2019s developed models to help change the world. In our interview, he told us he has developed models for life-changing predictive analytics such as predicting child support non-payment and attrition to predicting drug effectiveness to developing methods for identifying fraud in online auctions.<\/p>\n<p><strong>Didn\u2019t Gold Mining Start with a Simple Quest for Something More?<\/strong><\/p>\n<p>While Yoder and Piatetsky-Shapiro both point to the methods, models and software as keys to refining value, they also recognize the human element. Yoder says, \u201cIt takes complex algorithms, powerful computing and perhaps most of all, human analysts to build and administer the big science that turns the \u2018then and now\u2019 nature of big data into \u2018when.\u2019\u201d<\/p>\n<p>He points to a projection from <a href=\"http:\/\/www.mckinsey.com\/Features\/Big_Data\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">McKinsey Global<\/a> that the US needs 140,000 to 190,000 additional deep analytical minds working to derive the value of data. Often labeled as data scientists (quite fitting with the topic of big data being backed by big science), Yoder says these experts are thought to be the next class of \u201cdigital and corporate geniuses.\u201d<\/p>\n<p>Piatetsky-Shapiro\u2019s take is that the data scientist role is much like an engineer or wizard with a skill set that combines \u201ccode tuning, business insight and knowing how to extract business value from data.\u201d<\/p>\n<p><strong>Our Take: Give Your People Gold Fever<\/strong><\/p>\n<p>We think there\u2019s a place and a definite need for these data wranglers. However, if an organization is going to join the gold rush that big data promises, don\u2019t you think we have to start with the users? And give them the tools they need \u2013 the picks, the pans and the drive or \u201cgold fever\u201d to \u201chead for the hills?\u201d<\/p>\n<p>Their tools are analytics systems (maps) and the \u201cquest for knowledge\u201d that allows them to inspect the data landscape and find the gold nugget-filled streams and hills.<\/p>\n<p>Mark Lorion (<a href=\"http:\/\/www.twitter.com\/mark_lorion\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">@mark_lorion<\/a>) alludes to why the tools and the questions matter most in a <a title=\"Defining Business Analytics \u2013 Recap of the May 17 DM Radio Show Featuring Spotfire\u2019s Mark Lorion\" href=\"http:\/\/spotfireblog.tibco.com\/?p=12260\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">recent interview<\/a> on DM radio. He says, \u201cIf you measure it, you\u2019ll change it. What\u2019s hiding in your data that you haven\u2019t measured?\u201d<\/p>\n<p>For example, he says when a business user has questions, the tools often get in the way of the answers. He says it\u2019s hard for a user to find the right answers in a spreadsheet or other tools when the presentation is hiding what he\u2019s trying to see. It\u2019s frustrating because the user knows he can beat the competition if he can get reliable information from the data.<\/p>\n<p>Maybe the gold or \u201cknowledge discovery\u201d (as Piatetsky-Shapiro says) starts with the right tools and the right questions and a little bit of fever.<\/p>\n<p><strong>Next Steps:<\/strong>&nbsp;See how Spotfire version 4.5 empowers users to \u201cfind the gold\u201d insights hidden in&nbsp;big data&nbsp;and unstructured information in our&nbsp;<a href=\"http:\/\/bit.ly\/L8t3vm\" target=\"_blank\" data-wpel-link=\"external\" rel=\"external noopener noreferrer ugc\">webcast<\/a>, \u201cWhat\u2019s New with Spotfire 4.5,\u201d taking place today, May 31, at 1 p.m. Eastern.<\/p>\n<p>Amanda Brandon<br \/> Spotfire Blogging Team<\/p>\n<p><img loading=\"lazy\" loading=\"lazy\" decoding=\"async\" src=\"http:\/\/feeds.feedburner.com\/~r\/tibco\/mRBO\/~4\/Rhuq_M2wnKM\" alt=\"\" width=\"1\" height=\"1\" \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In a recent&nbsp;CNET article, Alex Yoder (<\/p>\n","protected":false},"author":33,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_seopress_robots_primary_cat":"","_seopress_titles_title":"","_seopress_titles_desc":"","_seopress_robots_index":"","footnotes":""},"categories":[15,4,29,16],"tags":[252],"class_list":{"0":"post-7021","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-analytics","7":"category-data-mining","8":"category-text-analytics","9":"category-unstructured-data","10":"tag-big-data"},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/posts\/7021","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/users\/33"}],"replies":[{"embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/comments?post=7021"}],"version-history":[{"count":0,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/posts\/7021\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/media?parent=7021"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/categories?post=7021"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.smartdatacollective.com\/wp-json\/wp\/v2\/tags?post=7021"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}