For a couple of years I’ve been harvesting datasets created or published by Australian GLAM organisations through government data portals. I’ve just completed the latest harvest, and there’s now 369 datasets, containing 983 files, from 23 GLAM organisations. 628 of these files are in CSV (spreadsheet) format.
With this harvest I’ve added a new way of searching and filtering the harvested data using Datasette running on Glitch. This interface lets you narrow your queries by field or facet, and search text fields for keywords.
But what’s actually in all those CSV files? If you’d like to start exploring the content of the datasets, then give my GLAM CSV Explorer a go! The CSV Explorer looks at each column in the dataset and tries to identify the type of data inside. It then attempts to tell you something useful about it.
For all the details, links, and harvesting code, see the #GLAMWorkbench. #dhhacks