Current status — extracting data from Library and Archives Canada’s 1915-1946 naturalisation database. Coming soon to my GLAM Workbench…

The full text of ‘Who belongs? Reading identity, ownership, and legitimacy’, my talk for #text2data last week, is now online. Includes slides, code, data & more… #dhhacks

My talk for #text2data at the National Library of Sweden looks at occurence of the words ‘aliens’ & ‘immigrants’ in @TroveAustralia newspapers, The Bulletin, & Hansard. The slides, code & data are online. #dhhacks

Back to school report — what I did on my holidays…

Another slide for Sweden — this one comparing words appearing before ‘aliens’ in The Bulletin and Commonwealth Hansard (1901-1980).

Working on my slides for From Text to Data in Stockholm this week…

I’ve added a ‘save chart’ option to the QueryPic app in my GLAM Workbench. Visualise your searches in @TroveAustralia newspapers, then save the results as HTML for easy download. #dhhacks

Pleased and proud to see the chapter @baibi & I wrote on the Real Face of White Australia now published as part of an awesome collection. Buy now or read the CC-BY version online!

5CBECBFE-75D3-400E-9274-C3BE0B2357F5.jpg

In a bit over a week I’ll be heading to Stockholm for the ‘From text to data’ conference. Preparing myself for the 40 degree temperature difference…

BDAFC715-2AE0-4186-9211-ABA97D5E09DC.jpg

Talking about 'immigrants' in Trove's digitised newspapers

I’m giving a talk in a week or so (eep!) that looks at some of the changing contexts in which the word ‘aliens’ has been used in Australia. I thought, by way of comparison, it would be useful to do the same for ‘immigrants’. While I was playing around with the data last night, I came across something interesting, so here’s a sneak preview… Getting the data Using my TroveHarvester I downloaded the full text of all newspaper articles in Trove that included the word ‘immigrants’.

Continue reading →

In case you’re wondering, it took about 13 hours to download the metadata and full text of more than 2,000,000 @TroveAustralia articles including the word ‘Chinese’ using my Trove Newspaper Harvester. You can try it here.

Ok, so let’s see how I go harvesting 2 million newspaper articles from @TroveAustralia conatining the word ‘Chinese’…

30,000+ occurences of the word ‘Chinese’ in the OCRd full text of The Bulletin, 1880-1968.

One more and I’m done for the night… New GLAM Workbench page for the ‘Trove API introduction’ notebooks.

I’ve finished putting details of all the current GLAM Workbench repositories into the new documentation site. Still a few notebooks to migrate from the original workbench, but getting there! There’s about 50 Jupyter notebooks so far. #dhhacks

Added a ‘data’ section to the GLAM Workbench docs, with info on harvests from government data portals, as well as series from @naagovau relating to ASIO and the White Australia Policy.

And now a GLAM Workbench page for @Te_Papa…

Added a page for @ArchivesNZ’s Archway to the GLAM Workbench docs…

So here’s some fun things to do with @TroveAustralia newspapers… (via GLAM Workbench)

Ok, more documentation for you — page for the @DigitalNZ API in GLAM Workbench updated!