{"id":2788,"date":"2015-10-20T10:34:54","date_gmt":"2015-10-20T17:34:54","guid":{"rendered":"http:\/\/mozscienceblog.wpengine.com\/?p=2788"},"modified":"2015-10-20T10:34:54","modified_gmt":"2015-10-20T17:34:54","slug":"drinking-from-the-firehose","status":"publish","type":"post","link":"https:\/\/blog.mozilla.org\/foundation-archive\/mozilla-science\/drinking-from-the-firehose\/","title":{"rendered":"Drinking from the Firehose: Jumping into Open Data"},"content":{"rendered":"<p>It\u2019s now been <del>three<\/del> four weeks since I left academia to join the <a title=\"Mozilla Science Lab\" href=\"https:\/\/www.mozillascience.org\/\">Mozilla Science Lab<\/a> team as the inaugural <a title=\"Welcoming Our First Open Data Training Lead\" href=\"https:\/\/www.mozillascience.org\/welcoming-our-first-open-data-training-lead\">Open Data Training Lead<\/a>. \u00a0Being the first person in a newly created position isn\u2019t new to me. \u00a0I was the first Data Services Coordinator position at the University of Washington Libraries and built the Research Data Services unit there. \u00a0Working at a non-profit foundation isn\u2019t new to me either. \u00a0I spent a couple years at the Bill &amp; Melinda Gates Foundation traveling the country as a computing trainer in libraries. \u00a0That being said, I still feel like this is a whole new world to me\u2026 a fun, engaging and highly collaborative world.<\/p>\n<div id=\"attachment_2789\" style=\"width: 310px\" class=\"wp-caption alignright\"><a href=\"https:\/\/mozscienceblog.wpengine.com\/wp-content\/uploads\/2015\/10\/201510FellowsEscapeRoom.jpg\"><img aria-describedby=\"caption-attachment-2789\" decoding=\"async\" loading=\"lazy\" class=\"size-medium wp-image-2789\" src=\"https:\/\/mozscienceblog.wpengine.com\/wp-content\/uploads\/2015\/10\/201510FellowsEscapeRoom-300x169.jpg\" alt=\"We escaped\" width=\"300\" height=\"169\" \/><\/a><p id=\"caption-attachment-2789\" class=\"wp-caption-text\">Mozilla Science Lab staff and fellows Escape the Room!<\/p><\/div>\n<p><strong>There has been so much going on this month!<\/strong> \u00a0We\u2019ve had a <a title=\"Mozilla Open Science Leadership Summit\" href=\"https:\/\/www.mozillascience.org\/open-science-summit-2015\">Mozilla Open Science Leadership Summit<\/a> in Toronto, participated in the <a title=\"Open Source &amp; Feelings\" href=\"http:\/\/osfeels.com\/\">Open Source &amp; Feelings<\/a> conference in Seattle, and brought on new <a title=\"Mozilla Fellows for Science\" href=\"https:\/\/www.mozillascience.org\/announcing-our-2015-mozilla-fellows-for-science\">Mozilla Fellows for Science<\/a> and mingled with <a title=\"Data &amp; Society\" href=\"http:\/\/www.datasociety.net\/\">Data &amp; Society<\/a> folks in Brooklyn. \u00a0The team is also getting things ready for <a title=\"Mozilla Festival\" href=\"https:\/\/2015.mozillafestival.org\/\">MozFest<\/a> in London in a few weeks. \u00a0Busy, busy, busy! Even with all that, I\u2019ve had some time to reflect on the experiences as well as think of some plans for leading our open data training going forward.<\/p>\n<p>Before I begin, I want to take what I learned about jargon-busting at the Leadership Summit mentioned above to define \u201copen data\u201d. \u00a0(You can read about and play along with the jargon-busting exercise <a title=\"Jargon-busting exercise\" href=\"http:\/\/mozillascience.github.io\/open-science-leadership-workshop\/de-jargonizing_activity.html\">here<\/a>.) The idea behind this exercise is to only use the 1,000 most popular words in the English language to communicate what you\u2019re trying to say. \u00a0After some trial and error, here\u2019s what I came up with: <strong>open data = groups of facts and figures shared in a way that allows other people to use them to make new understanding of how the world works.<\/strong> \u00a0If you have a different definition you like, try it out here and let me know: <a title=\"Upgoer 5\" href=\"http:\/\/splasho.com\/upgoer5\/\">http:\/\/splasho.com\/upgoer5\/<\/a>.<\/p>\n<h2>Value of open data<\/h2>\n<p>So far most of my experience with data in my professional life has been in the world of academia and research data. \u00a0These last two weeks have opened my eyes to how relevant open data is beyond that narrow focus: from the elementary student who wants to get her &#8220;hands dirty&#8221; with a science project to the octogenarian looking for health resources in his neighborhood. \u00a0I was excited to stumble upon projects such as <a title=\"Smart Chicago\" href=\"http:\/\/www.smartchicagocollaborative.org\/\">Smart Chicago<\/a> and one of their projects, <a title=\"Chicago Early Learning Portal\" href=\"http:\/\/www.smartchicagocollaborative.org\/work\/education\/chicago-early-learning-portal\/\">Chicago Early Learning Portal<\/a>, which provides open data allowing residents to find and compare early learning programs in their neighborhoods.<\/p>\n<p><strong>Open data encourages transparency, feedback, collaboration and advances knowledge in ways that can\u2019t be done when it is kept on a hard drive in a research lab for a small group of people. <\/strong>\u00a0To quote an old African proverb (which also happens to be the frequently stated motto of <a title=\"Project Hydra\" href=\"http:\/\/projecthydra.org\/\">an open source repository framework<\/a>): &#8220;If you want to go fast, go alone. \u00a0If you want to go far, go together.&#8221;\u00a0 I could go on for ages about this but it really deserves its own post, so I&#8217;ll move on.<\/p>\n<h2>How to make it open<\/h2>\n<p>Open data is more than just placing a final research product on a personal website. \u00a0It requires context and sustainable formats to make it usable. \u00a0Think of a book on a shelf in a library. \u00a0If there wasn\u2019t information in a catalog (physical or electronic), how would you know what the book was about or where to find it? \u00a0What if it was an e-book? \u00a0If that book was saved on a <a title=\"Wikipedia entry for &quot;Zip Drive&quot;\" href=\"https:\/\/en.wikipedia.org\/wiki\/Zip_drive\">Zip disk<\/a>, would you be able to open it and read it?<\/p>\n<p>Open data follows the same rules. \u00a0<strong>We need to work with and train researchers and other data producers to identify what information about a dataset (metadata) needs to be collected and maintained with the dataset to make it findable and reusable. \u00a0<\/strong>(The Digital Curation Centre in the UK has some <a title=\"Digital Curation Centre info on metadata standards\" href=\"http:\/\/www.dcc.ac.uk\/resources\/briefing-papers\/standards-watch-papers\/using-metadata-standards\">great resources for that<\/a>, by the way.) \u00a0There are also places where data can be stored where it is much more likely to be found and expertly maintained than on a personal website. \u00a0Check out <a title=\"re3data repository catalog\" href=\"http:\/\/re3data.org\/\">re3data.org<\/a> for a catalog of data repositories. \u00a0For an example of how to do great open research, check out the <a title=\"Roberts Lab\" href=\"https:\/\/faculty.washington.edu\/sr320\/\">Roberts Lab<\/a> at the University of Washington, particularly their <a title=\"Roberts Lab Data &amp; Resource Sharing Plan\" href=\"https:\/\/github.com\/sr320\/LabDocs\/blob\/master\/DataSharePlan.md\">Data and Resource Sharing Plan<\/a>.<\/p>\n<h2>A focus on re-use<\/h2>\n<p><strong>The benefits of open data are not realized unless that data is used and reused by others. \u00a0\u00a0<\/strong>Undersea data collected in the 1960s assisted today\u2019s researchers in <a title=\"1960's data used in new earthquake research\" href=\"http:\/\/www.washington.edu\/news\/2014\/08\/15\/research-from-1960s-shakes-up-understanding-of-west-coast-earthquakes\/\">understanding earthquakes in the PNW<\/a>.\u00a0 <a title=\"New York data used for public good during Hurricane Sandy\" href=\"http:\/\/techcrunch.com\/2013\/01\/11\/data-and-digital-saved-lives-in-nyc-during-hurricane-sandy\/\">New York City data made freely available<\/a> during and after Hurricane Sandy allowed for the creation of tools to help the public find shelters, gas and food.\u00a0 <a title=\"Student project from Data Science for Social Good in Chicago\" href=\"http:\/\/dssg.io\/2014\/12\/10\/mexico-distilling-data-policy.html\">Students in the Data Science for the Social Good<\/a> program at the University of Chicago combined several datasets from Mexico to develop a plan to reduce maternal mortality there.<\/p>\n<p><strong>Work like this should be encouraged and rewarded. It should be taught as part of the research process early in a researcher\u2019s career so it is thought of as the norm.<\/strong> \u00a0There has already been <a title=\"&quot;Data reuse and the open data citation advantage&quot;\" href=\"https:\/\/peerj.com\/articles\/175\/\">research indicating there is a long-lasting effect on citation and reuse<\/a> for open and citable data, long after a paper is published. \u00a0We need to encourage funders to reward the reuse of data in grant proposals and work with institutions to acknowledge data sharing and reuse in the tenure and promotion process.<\/p>\n<p><a title=\"Open Knowledge Foundation\" href=\"https:\/\/okfn.org\/\">Open Knowledge Foundation<\/a> founder Rufus Pollock once said \u201cThe best thing to do with your data will be thought of by someone else.\u201d This isn\u2019t a statement to be afraid of, it should be embraced. \u00a0By sharing your data and using data from others, you too can be that \u201csomeone else\u201d.<\/p>\n<h2>Jump into the water with me<\/h2>\n<div style=\"width: 435px\" class=\"wp-caption alignleft\"><img decoding=\"async\" loading=\"lazy\" class=\"\" src=\"http:\/\/memberhub.com\/wp-content\/uploads\/2009\/06\/firehose-girl.jpg\" alt=\"Girl drinking from firehose\" width=\"425\" height=\"282\" \/><p class=\"wp-caption-text\">Drinking from the firehose<\/p><\/div>\n<p>This program will be more successful with contributions from the community it is meant to serve. \u00a0Feedback is both welcome and wanted. \u00a0Are there other areas that should be a priority as we develop plans for the open data training program? More programs and projects you think I should be investigating? \u00a0Tools and resources of which we should be aware? \u00a0I want to hear it. \u00a0You can leave a comment here , find me on Twitter <a title=\"my Twitter handle\" href=\"https:\/\/twitter.com\/shefw\">@shefw<\/a>, or <a href=\"mailto:%20stephanie@mozillafoundation.org\">drop me an email<\/a>. \u00a0I will also be starting a repository on GitHub for development of our Open Data Training plan and sharing our work there as it evolves. \u00a0I\u2019ll get that link out when it happens\u2026 stay tuned!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>It\u2019s now been three four weeks since I left academia to join the Mozilla Science Lab team as the inaugural Open Data Training Lead. \u00a0Being the first person in a newly created position isn\u2019t new to me. \u00a0I was the &hellip; <a class=\"go\" href=\"https:\/\/blog.mozilla.org\/foundation-archive\/mozilla-science\/drinking-from-the-firehose\/\">Continue reading<\/a><\/p>\n","protected":false},"author":144,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[290376],"tags":[],"_links":{"self":[{"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/posts\/2788"}],"collection":[{"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/users\/144"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/comments?post=2788"}],"version-history":[{"count":0,"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/posts\/2788\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/media?parent=2788"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/categories?post=2788"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.mozilla.org\/foundation-archive\/wp-json\/wp\/v2\/tags?post=2788"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}