Good spreadsheets

Spreadsheetin’: making data usable for computers

A colleague from the CDS days recently asked for some good resources on spreadsheets / tabular data. I used to run “spreadsheetin’” sessions where I tried (through hands-on examples) to share some of the principles or practices I’ve stumbled into over the years.1

Important fundamental assumption alert! In assembling this assortment of links, I worked from this assumption: good spreadsheet / tabular data practices amount, at their core, to storing data in a way that’s usable for computers. That may not be the only purpose for good spreadsheets, of course, but I find it’s a frequent and worthy one. (With the caveat that sometimes we make things less intuitively usable for people by making data more immediately usable for computers, which is a tradeoff worth interrogating depending on the work at hand! See the end of the post for some links to readings on critical data studies.)

With that in mind:

From there, if you wanted to get more into applied data practices (using a scripting language), you could follow my path into R for Data Science and the tidyverse, or, more recently, into working with Observable notebooks (check out trending notebooks for examples of what they can do).

But good spreadsheets (and / or other forms of tabular data storage!) will get you far on your way.


BONUS CONTENT!!

If you’d like to interrogate the idea of data from a critical perspective (because who wouldn’t want to do that while gaining some practical skills!?), some good readings:

  1. At its core, I remember the advice boiled down to “don’t use column or row highlighting alone to convey meaning”, from which we can extrapolate a bunch of good practices. That to say, these principles and practices were hardly novel!

    Really, I’d been making so-so spreadsheets for years, then read R for Data Science years ago (there’s a new version, cool!). Reading the tidy data chapter and applying the tidy philosophy clarified the better practices underpinning those so-so spreadsheets, and I started making better ones. 

  2. August 11, 2024 update: This book is not open access! My bad! If others know of an equivalent-ish suggestion, I’d be glad to share it here.