Quick notebook to harvest GLAM datasets via the new(ish) @datagovau API. Includes 447 CSVs from 19 institutions.

mp-photo-alt[]=mp-photo-alt[]=

This is why we can’t have nice things…

Still plenty to do, but my CSV Explorer is taking shape… (coming soon to @TDHASSN & elsewhere!)

I’ll be giving a demo at the @HumanitiesAU data summit on Friday.

Now with animated gif…

Doesn’t take much to show when there’s a problem with dates in metadata… (Yep, post 1900 dates have all jumped 100 years into the future.)

Fun day talking to the @dhpanu team at ANU about digital history possibilities. Slides/links are all online.

Currently working on a CSV explorer to give researchers an overview of the contents of GLAM datasets. Sort of like WTFCSV, but in a Jupyter notebook…

A bit more progress. Having found columns with OpenCV, I can use Tesseract to help me find the rows…

mp-photo-alt[]=

After much OpenCV fiddling & tweaking, sorry… iteration, I’m pretty pleased with this. Columns and headers being detected accurately despite lots of variation in the images.

So right around now I think I’m talking (via video) about my adventures with #HistoricHansard for the ‘Between Cyberutopia and Cyberphobia’ workshop at @witswiser in South Africa. You can follow along: vimeo.com/321657685

More updates! Latest data and images from The Real Face of White Australia transcription project are up on GitHub. #dhhacks

mp-photo-alt[]=mp-photo-alt[]=mp-photo-alt[]=

I’ve finally updated the @TroveAustralia API Console to use version 2 of the API & https by default. (Also updated to Python3 & latest Heroku stack.) More examples coming soon… #dhhacks

Lots of exciting new stuff has been added to @TroveAustralia’s digitised journals in the last few months. To explore it all, head here and click on the ‘New titles’ button. #dhhacks

mp-photo-alt[]=

Art & Architecture: the journal of the Institute of Architects NSW, 51 issues from 1905 to 1912 now on @TroveAustralia.

mp-photo-alt[]=

Only 12 issues, but check out the fabulous covers on The New Triad from 1927-8. Now on @TroveAustralia.

mp-photo-alt[]=mp-photo-alt[]=mp-photo-alt[]=

Want some arts? 130 issues of RealTime from 1994 to 2016 now on @TroveAustralia.

mp-photo-alt[]=mp-photo-alt[]=mp-photo-alt[]=

Hey #ozhist, 295 issues of the journal of the Royal Australian Historical Society from 1918 to 1954 are now on @TroveAustralia.

Also amongst the latest batch of digitised journals on @TroveAustralia, 39 issues of Camp Ink from 1970 to 1977.

mp-photo-alt[]=

There’s more literary journals digitised in @TroveAustralia as well. Including 18 issues of the Bookfellow from 1907.

mp-photo-alt[]=

But wait, there’s more — the KCC Kennel Gazette was renamed, wait for it, Dogs. Another 94 issues from 1962 to 1969 in @TroveAustralia here.

mp-photo-alt[]=

People, you need to know that @TroveAustralia has digitised 360 issues of the KCC Kennel Gazette from 1932 to 1962. 13/10 would browse.

mp-photo-alt[]=