Why Do Scientists Want to Learn About Code Review?

A month ago, we launched a second pilot study of code review for scientific software as a follow-on to the first study reported here in November. Where the first round had professional software developers review the software associated with papers published in PLOS Computational Biology, this round paired experienced scientific programmers with small groups of less-experienced ones to explore ways of transferring the practice itself.

Ten groups ranging in size from a couple of people to half a dozen are taking part in this study. In interviews and early discussions, they identified four main reasons why they think they ought to do code review. The first is rigor: scientists want to get the right answer, and they want other scientists and their future selves to be able to reproduce that answer. Most recognize that correctness and reproducibility aren’t the same thing, i.e., that it’s possible for a wrong answer to be 100% reproducible. If given the choice, most would probably choose the former over the latter, but the goal is to achieve both, reliably.

Many are also acutely aware of how their understanding of their own code dissipates over time and of the cost downstream of either recovering understanding or re-coding. As once said, “one month later, it’s too much work to impose order… six months later, you just throw it away and start from scratch”. They may not know how code review will help with that, but they hope that it will.

The second motivation is reusability. Scientists suspect that they spend a lot of time reinventing the wheel, and that their wheels are continually being reinvented by colleagues. They believe (or at least hope) that doing code reviews will help them create software that others will be able to use. Some hope that, as a result, results may be aggregated more easily. However, they also recognize that packaging and distribution for re-use will be relatively easy, compared to shifting their colleagues to a culture of re-use. Part of what they hope to learn through this study is what the actual costs and benefits of making something reusable are, so that they can tell whether that effort will be worthwhile.

A third reason for taking part in this study is that it’s an opportunity for collaboration: they can use code review as an excuse for talking about code with their colleagues. More than one has said that these conversations simply don’t happen right now, and that as a result, a lot of sharing doesn’t happen either. Some believe that coding as part of a collaborative team is a “better way to work”, both for the code and for the science, because the collaboration will foster better testing, will encourage scientists to produce code that is easier to understand and share, and will make team members more aware of each other’s research. It’s not enough to develop rigorous and reusable code — achieving the full benefits of re-use requires good social mechanisms for promoting awareness and sharing.

And finally, many participants are using the study as an opportunity to learn about more than just code review, e.g., “I have a lot to learn about better coding”. They all feel that there must be a better way to build programs than what they’re doing right now; taking part in this study is, for them, a way to get mentoring from someone who can answer questions they don’t know to ask. They also want to “share best practice with other scientists”.

One thing that we didn’t hear in our interviews is people saying that they wanted to do code reviews in order to learn about the science embodied in that code. Rightly or wrongly, scientists seem to feel that “what it does” can be learned in others ways. We’ll talk about a few other surprises in our next posts, which will look at what mentors think the payoff from code review is.

Changes to Thimble URLs: goodbye naming conflicts, hello bit.ly


Creating web pages in Thimble? You may notice some changes to your Thimble links.

What’s different?

Your Thimble URLs now have an extra bit in the middle: a unique identifier made up of random letters. So for example, in the past your URL might have looked like this:

Why did we do this?

To prevent naming conflicts. Previously, the lack of any project identifier meant you could only ever create one page with a specific title.
Say, for example, you needed two pages with the title “my projects” or “test.” Or maybe you made a page called “School project” a year ago, and now you want to make a new one with that same title. Before, you couldn’t do that, because the URLs would have confllicted. Now you can.
We’ve basically taken the same strategy we were already using for Popcorn and applied it to Thimble. And following the same convention as sites like Twitter, Medium and You Tube.

Coming soon: shorter, shinier links with bit.ly

We know these URLs are longer, and maybe a little harder to share. That’s why we’re working on bit.ly URL shortening next, so that you can publish your page and get a super short URL for sharing.
The new short URLs will use “mzl.la,” and look like http://mzl.la/shortcode — so that everyone can see your URL is special, and spread some Mozilla pride. :)
We’re also working on fixing up Thimble sharing for Twitter. If you have underscores or spaces in your Webmaker account name, you may have noticed problems when you try to Tweet about your makes. This problem has to do with how Twitter parses URLs. Luckily, this issue will disappear once we get bit.ly integrated, so we should have a fix coming for you soon!

Questions or comments?

rOpenHack, dat and publishing like the web – Thurs, April 10 (11 ET)

Our next community call will take place this Thursday, April 10. The call is open to the public and will start at 11 am ET. Call in details can be found on the call etherpad (where you can also find notes and the agenda) and on the wiki. (If you have trouble with the toll-free number, try one of the numbers at the bottom of this post.)

The Science Lab meeting is our community call, taking place each month, highlighting recent developments and work of the community relevant to science and the web. Join us to hear more about current projects, find out how you can get involved, and hear from others about their work in and around open research.

This month, it’s all about data, rOpenSci’s recent hackathon and new tools for collaboration in research.

We’ll be hearing more about rOpenSci‘s recent hackbreak (affectionately referred to as “rOpenHack” on Twitter), their first major developer meeting and open science hackathon. You might know rOpenSci from some of their educational work, leading trainings in R, as well as their work to create open source packages and tools for open science. Scott Chamberlain will be joining us to tell us more about the event, helf at GitHub HQ, as well as some of the projects that came out of the weekend. Want a sneak peek? He’s added some project pointers to the etherpad. Have a look …

We’ll also be joined by Max Ogden, who’ll be telling us about his project dat, a system for real-time replication, transformation, and versioning of large tabular datasets which recently received support from the Sloan Foundation (note: they also support the Science Lab). Interested in what it means for data to move freely on the web, like the web? Do tune in. Max will also be talking about how this ties into database package management,  his plans for the project moving forward, and how something like dat could help address some of the bottlenecks in research.

And last but not least, we’ll hear from Tiffany Bogich, a former disease ecologist and co-founder of Standard Analytics, an open source tool for organizing, publishing, and sharing research. Tiffany will be speaking about some of their work thus far, and their approach to better packaging together research components for collaboration and reproducibility.

Have a project, blog post or event you’d like to share relevant to open science? Add it to the etherpad (see line 121). It’s a great way to share what you’re working on and/or interested in with the community. Don’t be shy. Have a look at last month’s notes for an idea of what others contributed to the conversation.

Mark your calendars, tune in and help us spread the word. Our first few calls have hit record participation (and stretched the limits of open software solutions). Let’s see if we can drum up the same turnout, and be sure to join us a few minutes before 11 ET to secure a spot on the line. For call-in details and links to the etherpad, visit our wiki page. We hope you’ll join us.

Note: Having trouble dialing in? Try one of these numbers. (Note that they are toll calls and you’ll be charged by your telephone company if the number is long-distance.)

After you enter the extension, you’ll be asked for the conference ID, which is 7677.

  • US/California/Mountain View: +1 650 903 0800, extension 92
  • US/California/San Francisco: +1 415 762 5700, extension 92
  • US/Oregon/Portland: +1 971 544 8000, extension 92
  • CA/Vancouver: +1 778 785 1540, extension 92
  • CA/Toronto: +1 416 848 3114, extension 92
  • UK/London: +44 (0)207 855 3000, extension 92
  • FR/Paris: +33 1 44 79 34 80, extension 92

What can you do with a gig? Mozilla’s Gigabit Fund tells us with 20 proposed pilot projects in KC

Silicon Prairie News just released a two-part piece exploring the impact of Google Fiber on

Google Fiberspace / photo credit Silicon Prairie News

Kansas City, now 3 years since the original announcement that declared Kansas City, Kan. the first city in America to get it.  SPN dives into how Kansas City scored the biggest technological coup of the century, and charts the past, present and future implications of being the first in a long of line of ambitious cities to come on board. It also dishes on Kansas City’s struggle to wield a power that is largely misunderstood and certainly underused, all while trying to predict where and when outcomes of successful next-generation applications using gigabit technology can be seen and measured.

Like Google Fiber, Mozilla chose Kansas City, and in February launched a fund that supports innovation in education by leveraging gigabit technology.  Mozilla is interested in how technology can serve educational systems, programs and learning.  Mozilla has deployed the Gigabit Community Fund to champion transformative innovation in classrooms and in after-school programs or informal education spaces like Science City’s Maker Studio, Arts Tech, Upper Room, and in our public libraries.

When I started working for Mozilla in mid January, I had no idea how deeply vested and mission-driven Mozilla was when it came to education, web literacy, connected learning principles, and in building a learning network here in Kansas City.  So, it’s not surprising that Mozilla chose the lens of education and learning in which to focus this project to fund innovation on KC’s fiber-fast Internet.

When the application window for our first round of funding opened in late February, I was blown away by Kansas City’s resounding response to the question “What good is a gig?”  Twenty concepts put forward by schools, nonprofits, businesses, libraries and agencies were submitted for funding consideration.  Grantees will be announced here on April 14th.  And while these applications are being evaluated, we are preparing to put funded projects into practice inside live classrooms and spaces in Kansas City.  These funded projects will report their progress with blog updates, videos and use Mozilla’s Webmaker tools to share their success and feedback from the front lines.  These pilots will be evidence of our 3-year gig journey, and test its impact in real ways that we can see and measure.

Moving forward, the Gigabit Fund is how Mozilla plans to build a big tent for education and connected learning in KC.  We’ve worked to both grow our two foundational Hive Learning Networks (Chicago and NYC), helped to onboard HLNs in Toronto and Pittsburgh, and we’re now fostering the growth of new HLNs in Kansas City, Chattanooga, and the Bay Area, among others.  This energy, this “Hiveyness,” has led to a three-tiered engagement ladder that outlines what it means to contribute to the Hive Learning Network project.

 The Gigabit Fund is how we will create a community of practice, launching successful pilots with teachers and educators, funding scalable and remixable programs and technology to serve our community and beyond.  These inaugural pilots will be done by the Fourth of July – so maybe we can send up some fireworks to showcase real progress and projects built for education, using gig tech, right here in KC.

That’s how you build a gig city.  And that’s how we’ll build Hive KC.

Software Carpentry Week in Review: March 31-April 6, 2014

What’s happening next week

  • A workshop at the Scripps Institute in San Diego needs instructors and helpers. It will be sometime in the second half of May and cannot happen without your help. If you are not into San Diego’s perfect weather, the ocean breeze bothers you, or, simply, cannot make it, scroll down for other amazing workshops looking for instructors and helpers.
  • There are still a few open spaces at the Pycon bootcamps. Librarians, aspiring R programmers, bioinformaticians, and programming teachers can find a matching event. Register here.
  • Know people in bioinformatics? There will be two Software Carpentry bootcamps for them in May in Canada: one in Vancouver, one in Toronto. Seats still available on both.

Workshops

Three workshops happened this week:

Upcoming workshops

  • April, 07: University Corporation for Atmospheric Research, Boulder CO, USA.
  • April, 14: PyCon Bootcamps, Montreal QC, Canada.
  • April, 14: Women in Science and Engineering, Lawrence Berkeley National Lab, Berkeley CA, USA.
  • April, 21: George Washington University, Washington DC, USA.
  • April, 23: University of Bergen, Bergen, Norway.

Lesson development

10 pull requests were merged, 10 issues were closed, and 8 issues were opened. In addition to the merges, we had 41 commits made by Rob Beagrie, Greg Wilson, Ethan White, Stephen Turner, Raniere Silva, Trevor W. King, Brian Miles, Daniel Chen, Erik Bray, John Blischak, Marianne Corvellec, and Philipp Bayer.

Other news

A workshop needs you

  • US and Canada
    • If you’re close to North Carolina, they need an instructor for a workshop at Duke University in June. In adittion to the workshop in San Diego in May, a lot of instructors are needed in the West Coast: four at the University of California at Davis, four in total for two workshops in Stanford, and another four in Portland. In the East Coast, a workshop at the University of Delaware needs instructors. Helpers are needed for Vancouver in Canada and Atlanta in Georgia.
  • Europe and Middle East
    • A double workshop, first in Cyprus and one week later in Jordan, needs two intructors. In addition, instructors are needed for a workshop in Pisa, Italy, in June, and another one in Kiel, Germany.
  • Oceania
    • A workshop in Brisbane, Australia, needs instructors and helpers.

Please check the details in this Etherpad or contact Amy and Arliss.

Planning Super Mentor Badges to encourage contribution within the Webmaker community

This year Webmaker is using the Open Badges Infrastructure (OBI) – and the newly-minted BadgeKit – to issue Webmaker badges. The first of these will be a Super Mentor badge.
(note that the badge won’t look like the image to the left!)
The aim with Super Mentor badges is to empower and equip those people teaching the web within the Mozilla community. Super Mentors will work with paid contributors (aka staff) to badge Mentors, who in turn will be able to issue Web Literacy badges.
A recent community consultation suggested that Super Mentors should be required to provide evidence of the skills and expertise listed on this Mozilla wiki page.
We have taken these skills and created the following metadata which would be included a Super Mentor badge. Please note that this is subject to review and only a first pass.


Badge Name

Super Mentor badge

Description

This badge is issued by Mozilla to ‘Super Mentors’ within the Mozilla community.
Super Mentors are very active community leaders. They are very knowledgeable and experienced in web literacy, how people learn and have strategies to teach skills to others.

Criteria

The earner of this badge:

  • has a solid understanding of several competencies featured on the Web Literacy Map (https://webmaker.org/literacy)
  • uses Webmaker tools extensively
  • is familiar with other web literacy and digital making methods and tools
  • has a solid understanding of key learning methods and concepts
  • remixes, or creates teaching kits to help teach learners
  • teaches Mentors how to teach web literacy
  • organizes Webmaker events and helps others organize as well
  • contributes to the strategy and core offerings of Webmaker through testing and working in the open
  • recruits, coaches, and organizes Mentors in their community

Issuer

Mozilla

Evidence

(link to Webmaker portfolio)

Date Issued

(date)

Tags

#supermentor, #mentor, #contribution, #mozilla, #community, #webliteracy, #learning, #education


We’d love your feedback on this, so please do leave a comment below or in the Webmaker Google+ community.

Software Carpentry Week in Review: March 24-30, 2014

What’s happening this week

Workshops

Upcoming workshops

  • 04/02: University of Victoria, Victoria BC CAN.
  • 04/03: University of Southern Denmark, Odense, Denmark.
  • 04/04: University of Oklahoma, Norman OK USA.
  • 04/07: University Corporation for Atmospheric Research, Boulder CO USA.
  • 04/14: PyCon Bootcamps, Montreal QC CAN.
  • 04/14: Women in Science and Engineering, Lawrence Berkeley National Lab, Berkeley CA USA.

Lesson development

6 pull requests were merged, 8 issues were closed, and 2 issues were opened. In addition to the merges, we had 42 commits made by Sarah Supp, uiuc-cse, Daniel Chen, David Rio, Stephen Turner, Diego Barneche, Philipp Bayer, Erik Bray, Raniere Silva, and Greg Wilson.

Other news

A workshop needs you

  • US and Canada
    • If you’re close to North Carolina, they need an instructor for a workshop at Duke University in June. A workshop in San Diego, California, needs instructors and helpers in May. In addition, instructors are needed for Ithaca in New York, the University of California at Davis, and the Federal Reserve in Washington. Helpers are needed for Toronto and Vancouver in Canada, and Atlanta in Georgia.
  • Europe and Middle East
    • A double workshop, first in Cyprus and one week later in Jordan, needs two instructors. In addition, instructors are needed for two different workshops in Pisa, Italy, in June; one in Kiel, Germany, and another one in Paris, France.
  • Oceania
    • A workshop in Brisbane, Australia, needs instructors and helpers.

Please check the details in this Etherpad or contact Amy and Arliss.

New Report Reinforces Gig City Focus on Access and Inclusion

Earlier this month, the office of Chattanooga Mayor Andy Berke released the first report by the Chattanooga Forward Gig, Technology, and Entrepreneurship Task Force.  This report is the product of many months of studying how Chattanooga has responded to and grown from its status as the city with the fastest, most pervasive, and least expensive gigabit service in the western hemisphere.  The report points to many successes Chattanooga has experienced in becoming the “Gig City” – including the city’s partnership with Mozilla on the Gigabit Community Fund – but also lays out some key challenges for our community going forward, most notably digital access and inclusion for all Chattanoogans.  The report also recommends the creation of a public-private partnership to direct Chattanooga’s gigabit efforts and to better communicate what our next-generation network can do for our local innovation economy.

It’s exciting that as a result of this report, a new organization is being formed to help shape the important ongoing conversation about what a gig can really do for a city.  Since we launched the Gigabit Community Fund in early February, I’ve been meeting with Chattanoogans every day to have exactly this conversation as we explore what the gig can mean for our community’s classrooms and informal education spaces.  These meetings resulted in the 20+ applications that were submitted by Chattanoogans during the Gigabit Community Fund’s spring application window, which closed last Friday night.  Though these proposals have yet to be fully digested, scored, and reviewed, all could potentially help us to answer the task force report’s call to create “actual, practical demonstrations and applications of [Chattanooga’s] digital assets to help people understand.”

Not all of the applications we received ask for funding to support the next showstopping gig app or to host the newest high-profile tech demo.  In fact, a lot of the proposed projects are about applying existing technology in a new setting or thinking about a common tech tool in a new way, and that’s awesome because as we begin to search for new, practical demonstrations of our digital assets, it’s not only about finding cool ways to use 1024 MB/second but also about finding ways to do old things better, more inclusively, and more openly without ever having to worry about the limits of our connectivity.  The proposed projects are also awesome because, by taking place in classrooms and in informal learning organizations,  they’re exposing whole new groups of Chattanoogans to our next-generation network.  In so doing, they begin to address the task force report’s important call for increased digital access and inclusion and build critical capacity for Chattanooga’s technology future.

Though project funding decisions won’t be announced until mid-April, the conversation about what the gig can and should mean for Chattanooga’s educators and students continues, especially as the larger community conversation about our innovation economy grows louder.  Our city is becoming a living laboratory in which to explore how gig networks impact lives and learning in classrooms, in libraries, in boardrooms, and in every other space in which Chattanoogans work, connect, and learn.  Chattanooga’s Gigabit Hive Learning Community is creating a network through which to spur innovation in this living lab and develop connected learning opportunities that cross the boundaries of formal and informal learning spaces in the Gig City.  Through this community, we can spread the impact of Mozilla Gigabit Community Fund projects beyond their pilot audiences and begin to create opportunities for all Chattanoogans to benefit from the gig as they create and remix on an unlimited web.   Your voice is needed in this vibrant learning laboratory as we pilot, play, and test.  Come to a meetup, share what you’re doing and learning with us on Twitter, and get in touch.

March Break Report from SBL

This is a cross-post from Hive Toronto member Success Beyond Limits (SBL). The post originally appeared on the SBL site and we include it here with permission.

In 2010, in response to youth in our community expressing a need for support in securing employment interest in getting exposure to greater career options, we launched our March Break “Employment Readiness and Career Exploration” program for 30 students with support from Westview Centennial and the Toronto District School Board. The focus at that time was on important items such as resumes, cover letters, interview skills, goal-setting, time-management and financial literacy.

Late in 2013, Success Beyond Limits joined the Hive Toronto (@HiveToronto) and this got us thinking about what career exploration and employment readiness means for youth in 2014. That opened up issues about the location of the program (bringing it into the core of the city) and the types of skills and experiences that we needed to offer.

The program is ran from March 10th to 14th with 60 students (20 being paid through the TDSB Focus on Youth Program, and 10 being paid through SBL), and was delivered out of the Mozilla community space. You can get a sense of how amazing it was in the pages that follow!

SBL March Break Report 2014 PDF

The scrum never stops: building a better workbench for Webmaker

As we head into Q2, let’s build a better “workbench” and online scrum board for Webmaker.

TLDR version:

  • Check out the freshly udpated Webmaker Wiki. It’s one-stop shopping for key roadmaps, planning documents, tickets, and scrums. And will help provide more transparency and co-ordination across teams going forward.
  • It clearly lays out Webmaker’s key components. And how they all fit together. From teaching kits and training to localization, badges, and Maker Party. These will provide the main tracks for our scrum board as we go forward.
  • Shifting to real time production, instead of static documentation. The new wiki is a production document. The main goal is to provide a clear view of *what we’re building now.* Week by week, quarter by quarter. Instead of a static repository for documentation that quickly goes out of date.
  • Project scrum boards. To that end, each individual project page will lead with a virtual scrum board at the top of the page. We’ll embed Bugzilla tickets to do this, and use Bugzilla components and whiteboard tags to do it in a smart, automated way. (If you don’t use Bugzilla, just include links to wherever you’re tracking the work.) Continue reading …