Glamworkbench

2022-09-22: Do you want your Trove newspaper articles in bulk? Meet the new Trove Newspaper Harvester Python package! The Trove Newspaper Harvester has been around in different forms for more than a decade. It helps …

2022-09-15: From 48 PDFs to one searchable database – opening up the Tasmanian Post Office Directories with the GLAM Workbench A few weeks ago I created a new search interface to the NSW Post Office Directories from 1886 to …

2022-09-05: Fresh harvest of OCRd text from Trove's digitised periodicals – 9gb of text to explore and analyse! I’ve updated the GLAM Workbench’s harvest of OCRd text from Trove’s digitised periodicals. …

2022-09-05: Explore Trove's digitised newspapers by place I’ve updated my map displaying places where Trove digitised newspapers were published or …

2022-09-01: Making NSW Postal Directories (and other digitised directories) easier to search with the GLAM Workbench and Datasette As part of my work on the Everyday Heritage project I’m looking at how we can make better use of …

2022-08-29: Interested in Victorian shipwrecks? Kim Doyle and Mitchell Harrop have added a new notebook to the …

2022-08-29: Updates! troveharvester Python package updated to v0.5.1: github.com/wragge/tr… Trove …

2022-08-25: Minor update to RecordSearch Data Scraper – now captures ‘institution title’ for …

2022-08-16: Many thanks to the British Library – sponsors of the GLAM Workbench’s web archives section! You might have noticed some changes to the web archives section of the GLAM Workbench. I’m very …

2022-08-15: New GLAM data to search, visualise and explore using the GLAM Workbench! There’s lots of GLAM data out there if you know where to look! For the past few years I’ve been …

2022-08-09: Zotero now saves links to digitised items in Trove from the NLA catalogue! I’ve made a small change to the Zotero translator for the National Library of …

2022-08-01: View embedded JSON metadata for Trove's digitised books and journals The metadata for digitised books and journals in Trove can seem a bit sparse, but there’s …

2022-07-29: Where did all those NSW articles go? Trove Newspapers Data Dashboard update! I was looking at my Trove Newspapers Data Dashboard again last night trying to figure out why the …

2022-07-28: Catching up – some recent GLAM Workbench updates! There’s been lots of small updates to the GLAM Workbench over the last couple of months and I’ve …

2022-07-14: Calling all Tasmanian historians – you can now save resources from Libraries Tasmania into Zotero! I’ve created a Zotero translator for the Libraries Tasmania catalogue. Using it, you can save …

2022-07-13: Updated dataset! Harvests of Trove list metadata from 2018, 2020, and 2022 are now available on …

2022-07-10: Updated dataset! Details of 2,201,090 unique public tags added to 9,370,614 resources in Trove …

2022-07-09: Ok, I’ve created a Zenodo community for datasets documenting changes in the content and …

2022-07-09: Coz I love making work for myself, I’ve started pulling datasets out of #GLAMWorkbench code …

2022-06-28: Ahead of my session at #OzHA2022 tomorrow, I’ve updated the NAA section of the #GLAMWorkbench. …

2022-06-26: 55,633 items digitised by the National Archives of Australia last week. Including: Bonegilla name …

2022-06-26: Newspapers added to Trove last week Freelance (WA) The Standard (WA) Berrigan Advocate (NSW) …

2022-06-26: Noticed that QueryPic was having a problem with some date queries. Should be fixed in the latest …

2022-06-24: The Trove Newspapers section of the #GLAMWorkbench has been updated! Voilá was causing a problem in …

2022-06-24: Some more #GLAMWorkbench maintenance – this app to download a high-res page images from Trove …

2022-06-23: The Trove Newspaper and Gazette Harvester section of the #GLAMWorkbench has been updated! No major …

2022-06-19: Main changes to individual Trove newspapers last week: +19,862 articles in Daily News (WA) +10,822 …

2022-06-19: Changes to Trove newspapers last week: +86,761 articles +16,367 articles with corrections +5,355 …

2022-06-19: 42,472 files were digitised by the National Archives of Australia last week. 36,238 of these were …

2022-06-16: I wrote up something for the #GLAMWorkbook on ‘Empty searches and hacking urls’: …

2022-06-16: Under development – a Zotero translator for Libraries Tasmania! I’ve created a Zotero translator for the Libraries Tasmania catalogue. Using it, you can save …

2022-06-13: Ok, I’ve submitted my Libraries Tasmania translator to the Zotero repository for inclusion. No …

2022-06-05: Getting to work migrating the Real Face of White Australia transcription site from scribeAPI (no …

2022-06-01: It’s getting there – new Real Face of White Australia site using Datasette, IIIF, and …

2022-06-01: Ordering some #GLAMWorkbench stickers…

2022-05-29: After much faffing about today I’ve got the latest version of the UniversalViewer building …

2022-05-29: Files digitised by the National Archives of Australia this week: 25,981 Top series: +14,025 in B884 …

2022-05-29: New in Trove this week: Berrigan Advocate (NSW) Tingha Spectator & North Western Journal (NSW) …

2022-05-29: Major changes to Trove newspaper titles this week: +30,713 in Queanbeyan Age (NSW, end date now …

2022-05-29: Added to Trove newspapers this week: +43,015 articles +10,263 articles with corrections +7,499 …

2022-05-26: Using Datasette on Nectar If you have a dataset that you want to share as a searchable online database then check out …

2022-05-22: This week the National Archives of Australia digitised 21,488 files: +10,305 in B884 (CMF personnel …

2022-05-22: Major changes to individual Trove newspapers this week: +52,543 articles from The Daily News (WA) …

2022-05-22: Changes to Trove newspapers this week: +74,859 articles +15,852 articles with corrections +8,668 …

2022-05-20: Convert your Trove newspaper searches to an API query with just one click! I’m thinking about the Trove Researcher Platform discussions & ways of integrating Trove …

2022-05-15: Also on tonight’s episode of ‘In GLAM This Week’, the NAA digitised 20,517 files. …

2022-05-15: This week’s changes in Trove newspapers: +7,451 articles +16,628 articles with corrections …

2022-05-15: Another overdue maintenance task completed! The Tung Wah Newspaper index has been migrated from a …

2022-05-12: TIL you can do date maths in Trove. Searching for date:[NOW-10YEAR TO NOW] in newspapers & …

2022-05-11: My Trove researcher platform wishlist The ARDC is collecting user requirements for the Trove researcher platform for advanced research. …

2022-05-10: Spending the evening updating the NAA section of the #GLAMWorkbench. Here’s a fresh harvest of …

2022-05-10: This morning has been all bug hunting. But at least I’ve now found & fixed the problem, …

2022-05-08: Changes in Trove newspaper titles in the last week: +26,731 articles from Daily News (WA) +88,485 …

2022-05-08: Changes in Trove newspapers in the past week: +124,404 articles available +17,306 articles with …

2022-05-07: Somewhat unexpectedly the US National Archives & Records Administration catalogue includes some …

2022-05-07: I’ve got a site, I suppose I need to add some content now… glam-workbook.net …

2022-05-02: Working with Trove data – a collection of tools and resources The ARDC is organising a couple of public forums to help gather researcher requirements for the …

2022-05-01: New articles available this week from specific Trove newspapers: +39,296 from Daily News (WA) …

2022-05-01: Changes to Trove newspapers in the last week: +211,393 articles available +15,738 articles with …

2022-04-30: And so it starts… #GLAMWorkbench

2022-04-29: Followed up my last FOI request about HASS research infrastructure to find out why the appendix was …

2022-04-28: And with that I think I’ll call it quits for #DayofDH2022 in this part of the world. Time to …

2022-04-28: Ok, I’ve created a new #GLAMWorkbench meta issue to try and bring together all the things …

2022-04-28: A couple of hours of #DayofDH2022 left – feeling a bit uninspired, so I’m going to do some …

2022-04-28: Hmm, didn’t get any writing done and now it’s time to cook dinner for the family. Bit of …

2022-04-28: Workspace photo for #DayofDH2022. Old iMac for the socials & meetings. Newish laptop running …

2022-04-28: Now after coding, bug chasing, meeting, & prototype demo, I have to try and get into a headspace …

2022-04-28: Meeting and demo done. Love working with ANU Archives, & really excited about the latest version …

2022-04-28: I’ve hit a bit of a brick wall with my Datasette deployment. I’m probably missing …

2022-04-28: And Cloudstor is down for maintenance. I suppose I wan’t be doing that demo then… …

2022-04-28: Perhaps appropriately for #DayofDH2022, I’ve spent most of the morning trying to hunt down a …

2022-04-26: Micro.blog offers another alternative for people wanting more control over their socials. I’m …

2022-04-20: Tracking Trove changes over time I’ve been doing a bit of cleaning up, trying to make some old datasets more easily available. …

2022-03-03: Adventures in FOI – HASS RDC Scoping Studies So my FOI request to release the scoping studies that informed investments in the current round of …

2022-03-02: The GLAM Workbench wants you! Over the past few months I’ve been doing a lot of behind-the-scenes work on the GLAM Workbench …

2022-02-17: Omeka S Tools – new Python package Over the last couple of years I've been fiddling with bits of Python code to work with the Omeka S …

2022-01-29: Zotero support in Australian GLAMs Last year I started compiling information about the level of Zotero integration provided by …

2022-01-28: Testing, testing... I regularly update the Python packages used in the different sections of the GLAM Workbench; though …

2021-12-09: Some big pictures of newspapers in Trove and DigitalNZ One of the things I really like about Jupyter is the fact that I can share notebooks in a variety of …

2021-12-09: Exploring GLAM data at ResBaz The video of my key story presentation at ResBaz Queensland (simulcast via ResBaz Sydney) is now …

2021-12-01: GLAM Workbench Nectar Cloud Application updated! The newly-updated DigitalNZ and Te Papa sections of the GLAM Workbench have been added to the list …

2021-12-01: DigitalNZ & Te Papa sections of the GLAMWorkbench updated! In preparation for my talk at ResBaz Aotearoa, I updated the DigitalNZ and Te Papa sections of the …

2021-11-11: A template for GLAM Workbench development I’m hoping that the GLAM Workbench will encourage GLAM organisations and GLAM data nerds (like me) …

2021-11-08: More thoughts on the Trove researcher platform for advanced research Previously on ‘What could we do with $2.3 million?’, the National Library of Australia produced a …

2021-11-04: Coming up! GLAM Workbench at ResBaz(s) Want a bit of added GLAM with your digital research skills? You’re in luck, as I’ll be speaking at …

2021-11-01: New video – using the Trove Newspaper & Gazette Harvester The latest help video for the GLAM Workbench walks through the web app version of the Trove …

2021-11-01: Harvest newspaper issues as PDFs An inquiry on Twitter prompted me to put together a notebook that you can use to download all …

2021-10-21: GLAM Workbench now in the Nectar Research Cloud! The GLAM Workbench isn’t dependent on one big piece of technological infrastructure. It’s basically …

2021-10-18: More GLAM Name Index updates from Queensland State Archives and SLWA A new version of the GLAM Name Index Search is available. An additional 49 indexes have been added, …

2021-10-15: Getting data about newspaper issues in Trove When you search Trove’s newspapers, you find articles – these articles are grouped by page, …

2021-10-15: GLAM Workbench at eResearch Australasia 2021 Way back in 2013, I went to the eResearch Australasia conference as the manager of Trove to talk …

2021-10-05: New Python package to download Trove newspaper images There’s no reliable way of downloading an image of a Trove newspaper article from the web …

2021-09-29: More records for the GLAM Name Index Search Two more datasets have been added to the GLAM Name Index Search! From the History Trust of South …

2021-09-29: New preprint – ‘More than newspapers’ Here’s the preprint version of an article, ‘More than newspapers’, that I’ve submitted for a forum …

2021-09-29: More QueryPic in action Recently I created a list of publications that made use of QueryPic, my tool to visualise searches …

2021-09-10: Some thoughts on the ‘Trove Researcher Platform for Advanced Research’ draft plan Late last year the Federal Government announced it was making an $8.9 million investment in HASS and …

2021-08-30: Some research projects that have used QueryPic A Twitter thread about some of the research uses of QueryPic… QueryPic, my tool for …

2021-08-30: Government publications in Trove Over the last few weeks I’ve been updating my harvests of OCRd text from digitised books and …

2021-08-26: GLAM Workbench – a platform for digital HASS research We’re in the midst of planning for the HASS Research Data Commons, which will deliver some …

2021-08-23: A Family History Month experiment – search millions of name records from GLAM organisations There’s a lot of rich historical data contained within the indexes that Australian GLAM …

2021-08-16: Explore Trove’s digitised books The Trove books section of the GLAM Workbench has been updated! There’s freshly-harvested data, as …

2021-08-13: A miscellany of ephemera, oddities, & estrays I’m just in the midst of updating my harvest of OCRd text from Trove’s digitised books (more about …

2021-08-09: Everyday heritage and the GLAM Workbench Some good news on the funding front with the success of the Everyday Heritage project in the latest …

2021-08-06: Recent GLAM Workbench presentations So far this year I’ve given eight workshops or presentations relating to the GLAM Workbench, with …

2021-08-06: Updated! Lots and lots of text freshly harvested from Trove periodicals For a few years now I’ve been harvesting downloadable text from digitised periodicals in Trove and …

2021-08-02: New dataset – Politicians talking about COVID The Trove Journals section of the GLAM Workbench includes a notebook that helps you download press …

2021-07-14: 8 million Trove tags to explore! I’ve always been interested in the way people add value to resources in Trove. OCR correction tends …

2021-07-01: Integrating GLAM Workbench news and discussion I’ve spent a lot of time this year working on ways of improving the GLAM Workbench’s documentation …

2021-07-01: GLAM Workbench now on YouTube! I’ve started creating short videos to introduce or explain various components of the GLAM Workbench. …

2021-06-28: GLAM Workbench office hours To help you make use of the GLAM Workbench, I’ve set up an ‘office hours’ time slot every Friday …

2021-06-28: ‘Missing Links’ – new open access article! An article written by Kate Bagnall and me has just been published in a special issue of the Journal …

2021-06-21: QueryPic: The Next Generation QueryPic is a tool to visualise searches in Trove’s digitised newspapers. I created the first …

2021-06-21: Everyone gets a Lab! I recently took part in a panel at the IIPC Web Archiving Conference discussing ‘Research use of web …

2021-06-14: Minor change to Reclaim Cloud config When the 1-click installer for Reclaim Cloud works its magic and turns GLAM Workbench repositories …

2021-06-14: Preprint! The limits and affordances of online collections I’ve been working on an essay for publication in a forthcoming edited collection. I wanted to …

2021-06-14: Trove Query Parser Here’s a new little Python package that you might find useful. It simply takes a search url from …

2021-06-13: Some GLAM Workbench stats I deliberately don’t keep any stats about GLAM Workbench visits, because I think they’re pretty …

2021-06-13: More Reclaim Cloud integrations! Five of the GLAM Workbench repositories now have automatically built Docker images and 1-click …

2021-06-13: Get your GLAM datasets here! I’ve updated my harvest of Australian GLAM datasets from state/national government open data …

2021-05-24: NAA RecordSearch section of the GLAM Workbench updated! If you work with the collections of the National Archives of Australia, you might find the …

2021-05-17: Web archives section of GLAM Workbench updated! My program of rolling out new features and integrations across the GLAM Workbench continues. The …

2021-05-12: Using web archives to find out when newspapers were added to Trove There’s no doubt that Trove’s digitised newspapers have had a significant impact on the practice of …

2021-05-12: GLAM Jupyter Resources To make it easier for people to suggest additions, I’ve created a GitHub repository for my list of …

2021-05-12: Running notebooks – a sign of things to come in the GLAM Workbench I recently made some changes in the GLAM Workbench’s Help documentation, adding a new Running …

2021-05-12: Sponsor my work on GitHub! As I foreshadowed some weeks ago, I’ve shut down my Patreon page. Thanks to everyone who has …

2021-05-12: Updates to the Trove Newspapers section of GLAM Workbench I’ve updated, refreshed, and reorganised the Trove newspapers section of the GLAM Workbench. There’s …

2021-04-27: Introducing the new, improved RecordSearch Data Scraper! It was way back in 2009 that I created my first scraper for getting machine-readable data out of the …

2021-04-21: Secrets and lives Here’s the video of my presentation, ‘Secrets and lies’, for the (Re)create symposium at the …

2021-03-29: Recently digitised files in the National Archives of Australia I’m interested in understanding what gets digitised and when by our cultural institutions, but …

2021-03-26: Moving on from Patreon... Over the last few years, I’ve been very grateful for the support of my Patreon subscribers. …

2021-03-25: What can you do with the GLAM Workbench? You might have noticed some changes to the GLAM Workbench home page recently. One of the …

2021-03-25: Reclaim Cloud integration coming soon to the GLAM Workbench I’ve been doing a bit of work behind the scenes lately to prepare for a major update to the GLAM …

2021-03-25: Some recent GLAM Workbench presentations I’ve given a couple of talks lately on the GLAM Workbench and some of my other work relating to the …

2021-03-08: Some GLAM Workbench datasets to explore for Open Data Day It was Open Data Day on Saturday 6 March – here’s some of the ready-to-go datasets you can find in …

2021-02-22: Zotero translator for NAA RecordSearch updated The recent change of labels from ‘Barcode’ to ‘ItemID’ in the National Archives of …

2021-02-22: TroveNewsBot upgraded – now sharing articles published 'on this day'! @TroveNewsBot has been sharing Trove newspaper articles on Twitter for over 7 years. With its latest …

2021-02-11: The NAA recently changed field labels in RecordSearch, so that ‘Barcode’ is now ‘Item ID’. …

2021-02-11: Open access publishing for Australian historians After some recent investigations of the availability of open access versions of articles published …

2021-02-11: Who was linking to Trove newspapers in 2014? In 2014 I pulled together a sample of web pages that included links back to digitised newspaper …

2021-02-03: New! DigitalNZ API Query Builder added to GLAM Workbench I’ve added an API Query Builder to the DigitalNZ section of the GLAM Workbench. You can use it to …

2021-01-28: OpenGLAM fireworks! Finding open collections in DigitalNZ Lately I’ve been updating and expanding the notebooks in the DigitalNZ section of the GLAM …

2021-01-25: Easy browsing of Trove newspapers with these keyboard shortcuts! If you like browsing Trove’s digitised newspapers page by page, you might have found that the …

2021-01-18: New dataset and notebooks – twenty years of ABC Radio National There’s a new GLAM Workbench section for working with data from Trove’s Music & Sound zone! …

2021-01-14: Finding non-English newspapers in Trove There are a growing number of non-English newspapers in Trove, but how do you know what’s …

2021-01-14: Open access versions of Australian history articles Last year I did some analysis of the availability of open access versions of research articles …

2021-01-03: A long thread exploring files in the National Archives of Australia with the access status of …

2021-01-03: More updates from The Real Face of White Australia – running facial detection code over NAA: SP42/1. …

2021-01-03: I reharvested NAA: ST84/1 and ended up with 14,545 images from 461 digitised files (about 17% of the …

2020-12-16: GLAM Workbench wins British Library Labs Research Award! Asking questions with web archives – introductory notebooks for historians has won the British …

2020-12-15: Want to relive the early days of digital humanities in Australia? I’ve archived the websites created …

2020-12-15: The Invisible Australians website has been given a much needed overhaul, and we’ve brought all our …

2020-12-15: The GLAM Workbench as research infrastructure (some basic stats) Repositories in the GLAM Workbench have been launched on Binder 3,529 times since the start of this …

2020-11-27: Earlier this year I gave a seminar for the International Internet Preservation Consortium (IIPC) …

2020-11-25: Harvest text from the Australian Women's Weekly! The Trove Newspaper & Gazette Harvester has been updated to version 0.4.0. The major change is …

2020-11-13: Beyond the copyright cliff of death If you’ve done any searching in Trove’s digitised newspapers, you’ve probably …

2020-11-13: Updated! Find Trove newspapers by place of publication by using this simple interface – just click …

2020-10-26: I’ve added a new section to the GLAM Workbench for the ANU Archives. The first set of notebooks …

2020-10-26: Any regular user of RecordSearch, the National Archives of Australia’s online database, will …

2020-10-26: I’ve added more years to my repository of Commonwealth Hansard! The repository now includes …

2020-10-26: It was Open Access Week last week, so I tried a little experiment. How many research articles …

2020-09-24: The Trove Newspaper and Gazette Harvester has been updated to include the snippet field in the …

2020-09-22: Calling users of Australian galleries, libraries, archives, & museums – OzGLAM Help is now live! …

2020-09-21: The Zotero translator for RecordSearch (the National Archives of Australia’s online database) …

2020-09-20: If you try to share or bookmark the url of an item in RecordSearch (the National Archives of …

2020-09-15: The Zotero translator for Trove was failing on newspaper articles with tags. I’ve submitted a fix …

2020-08-14: Another #GLAMWorkbench update! Snip words out of @TroveAustralia newspaper pages and create big …

2020-08-10: Just in time for #GovHack, I’ve given the Trove API Console a major overhaul. It’s been updated for …

2020-07-30: Ok, so do you want to make your own ‘scissors & paste’ messages using words from @TroveAustralia …

2020-07-29: Another #GLAMWorkbench update! The Trove Harvester will now download both newspaper and gazette …

2020-07-28: Interested in using web archives in your research? Join us on 5/6 August for a free @netpreserve …

2020-07-27: Introducing a brand new section of the #GLAMWorkbench, exploring the @MuseumsVictoria collection …

2020-07-27: New additions to the @TroveAustralia books section of the #GLAMWorkbench – word frequency examples …

2020-07-27: With the recent changes to @TroveAustralia, the Australian Women’s Weekly cover browser was retired. …

2020-07-17: The Trove books section of the #GLAMWorkbench has been updated. There’s a fresh harvest of …

2020-07-17: Revisiting my Historic Hansard XML repository & realising how easy it is to load files as needed …

2020-07-14: The Trove Journals section of the #GLAMWorkbench has been updated to work with the new …

2020-07-12: New in #GLAMWorkbench! After you’ve used the @TroveAustralia Newspaper Harvester to download lots …

2020-06-29: Download newspaper articles in bulk! The Trove Newspaper Harvester has been updated to work with the …

2020-06-22: My app for searching in @TroveAustralia’s digitised journals has been updated to work with the new …

2020-06-09: Another db migrated and app updated! Have you ever wondered what interjections in historic hansard …

2020-06-09: Here’s a map of places where @TroveAustralia digitised newspapers were published/circulated. …

2020-05-27: New GLAM Workbench section on web archives! We tend to think of a web archive as a site we go to when links are broken – a useful fallback, …

2020-05-08: Thanks to @NetPreserve, I’ve been spending time lately working on a set of web archive exploration …

2020-04-18: Do you have a CSV file you’d like to make searchable, maybe even share online? New on …

2020-04-13: New on #dhhacks – make your own @TroveAustralia newspaper game! Thanks to @glitch, just edit a …

2020-04-12: I’ve given my #dhhacks site a refresh, and updated my @TroveAustralia Twitter bot tutorial to …

2020-04-04: If you’d ever wished you could get a random(ish) newspaper article from @TroveAustralia’s API, …

2020-04-02: The GLAM CSV Explorer has had a few updates — you can now filter by organisation, and upload your …

2020-03-31: Buildings might be closed, but the data is open – explore hundreds of datasets from Australian GLAM organisations! For a couple of years I’ve been harvesting datasets created or published by Australian GLAM …

2020-03-30: Updated! My notebook to upload digitised newspapers from @TroveAustralia to an @Omeka-S site has …

2020-03-12: My data file of public holidays in NSW from 1900-1950 has been updated – now including variations in …

2020-03-11: My harvest of OCRd text from @TroveAustralia digitised books, ephemera, and parliamentary papers has …

2020-03-09: The simple Trove proxy that you can use to get to download links for PDFs of newspaper articles from …

2020-03-03: I’ve added some more documentation to the Trove Newspaper Harvester page in the #GLAMWorkbench. Get …

2020-02-27: New section added to the #GLAMWorkbench with examples from @Library_Vic! #slvdata #dhhacks …

2020-02-27: More fun with @iiif_io and images from @library_vic – resize, rotate, crop and more! Try it out with …

2020-02-26: New #GLAMWorkbench notebook! Download images from @Library_Vic using IIIF and Handle… #dhhacks

2020-02-22: And for a taste of the recent additions to @TroveAustralia’s digitised journals, check out …

2020-02-22: Explore @TroveAustralia’s digitised journals with this simple app. Now updated with the latest …

2020-02-21: Want to save @TroveAustralia newspaper articles as images (that aren’t sliced up in annoying …

2020-02-20: I’ve updated the repository of data transcribed from White Australia policy records in @naagovau. …

2020-02-20: I’ve added a few updates to my ‘Digital tools and such like…’ list for Australian historians. Hope …

2020-02-17: New ‘Trove images’ section added to the #GLAMWorkbench! Here you’ll find my latest Jupyter …

2020-02-14: Ok, the LODBook Jekyll plugin is cleaned & commented enough for me to put it aside for a while …

2020-02-14: Voting in the 2019 @dhawards is now open! Go and check out all the cool #DigitalHumanities projects …

2020-01-31: Gathering together videos of past presentations. Here they are on YT: …

2019-12-20: Archived via Zenodo – ‘Inigo Jones: the weather prophet’ – exploring our desire for certainty amidst …

2019-12-20: Archived via Zenodo – ‘Civilization versus the giant, winged lizards’ – a thing I wrote about the …

2019-12-17: One of my favourite things this year was finally publishing ‘The People Inside’ with @baibi — …

2019-11-20: New #GLAMWorkbench section with examples of how to get random-ish works and newspaper articles from …

2019-11-19: Did you make a DIY Exhibition from a @TroveAustralia list using my GitHub starter kit? If so, …

2019-11-07: The death and (hopefully) resurrection of Trove Twitter bots Today version 1 of the Trove API was decommissioned. As I explained elsewhere, this meant that a …

2019-10-21: Updated my NSW public holidays data to include a few extras proclaimed by the government: …

2019-10-09: Creators and users of my Trove Twitter Bots, please read and share this update! tl;dr Version 1 of the Trove API will be discontinued soon so Trove Twitter bots need to be …

2019-09-18: Over the last few weeks I’ve been exploring ways of recording dates for 70,000 digitised pages …

2019-09-16: Here’s my attempt to calculate NSW holidays from 1900 to 1950. It’s probably incomplete, …

2019-09-11: A couple of years ago I gave a talk in which I tried to justify what I do as research. I was going …

2019-09-04: The @naagovau RecordSearch section of the #GLAMWorkbench has been updated with more notebooks to …

2019-09-04: Crikey, my notebook for getting useful data out of @naagovau keeps growing! Now with sections on …

2019-09-03: Want to save searches for items in @naagovau’s RecordSearch as CSVs for exploration & …

2019-08-25: I’ve updated my harvest of OCRd text from digitised journals in @TroveAustralia. The complete …

2019-08-25: My app to browse & search @TroveAustralia’s digitised journals has been updated! Since 4 …

2019-08-13: Another WIP notebook in need of additional documentation… This one explores the stats around …

2019-08-13: And this notebook uses TF-IDF to explore the OCRd text of a digitised journal from Trove. Get the …

2019-08-13: This notebooks lets you download the OCRd text of a digitised journal from @TroveAustralia (via …

2019-08-13: A new notebook looking at the data about digitised journals on @TroveAustralia. #dhhacks

2019-08-09: There’s a new section of the GLAM Workbench devoted to the National Museum of Australia collection …

2019-07-28: The second new notebook looks at @TroveAustralia’s newspapers as a whole, visualising both by time …

2019-07-28: Some brand new Jupyter notebooks for those interested in #ozhist & digital exploration of …

2019-07-27: I’ve updated the @invisibleaus data repository with latest transcriptions/markings from White …

2019-07-25: According to my last harvest, @TroveAustralia’s digitised journals comprise 31,216 separate issues. …

2019-07-24: Updates to the Trove newspapers section of GLAM Workbench – adding links to app-ified versions of …

2019-07-24: Download & explore 1,499,259 rows of open data from NSW State Archives Online Indexes NSW State Archives publishes a number of detailed indexes containing data manually extracted from …

2019-07-12: Visualising CV-detected column widths across 100 volumes (30,000+ pages) of Sydney Stock Exchange …

2019-07-11: New in GLAM Workbench! Notebooks to harvest, index, analyse, and aggregate transcripts of speeches …

2019-07-11: I’ve updated my harvest of the PM Transcripts site — 22,814 XML files with transcripts of speeches, …

2019-07-11: Reorganising things a little at GLAM Workbench. @statelibrarynsw gets its own section. Hansard and …

2019-07-07: What’s that? You want MORE GLAM data? Well, I’ve started a list of sources for Australian GLAM data. …

2019-07-07: I’ve updated my harvest of GLAM datasets from data.gov.au. Now there’s 584 CSV files available …

2019-07-06: I’ve put a copy of my article on using @TroveAustralia for digital research/play, written for the …

2019-07-05: I’ve updated the list of orgs who have supported the digitisation of @TroveAustralia’s …

2019-07-05: Today I finished updating a harvest of all OCRd text available from Trove’s digitised journals. …

2019-07-05: Update time! Yesterday I updated my Trove digitised journals app to include all the exciting new …

2019-07-03: A quick interactive view of newspaper articles in @TroveAustralia by state and year. Click on the …

2019-07-03: Anyone who’s been to one of my Trove workshops will be pleased to know that the WWI effect is still …

2019-07-03: So there are now almost twice as many newspaper articles in @TroveAustralia from NSW as there are …

2019-06-23: Well look at that! – a selection of my @TroveAustralia related Jupyter notebooks turned into simple …

2019-06-17: Kicked off a new GLAM Workbench repository dedicated to @SLSA with a quick notebook hack to get …

2019-06-10: Search @TroveAustralia newspapers without leaving Twitter using the updated and enhanced …

2019-06-07: Recent additions to the Trove Newspapers section of the GLAM Workbench: getting images from …

2019-06-04: Want to upload @TroveAustralia newspaper articles to @Omeka-S to create an exhibition or populate a …

2019-06-02: Slides ready for tomorrow’s workshop at @unicanberra – Trove as a pltform for digital research …

2019-05-31: Ever wanted to save a @TroveAustralia newspaper article as an image? This notebook lets you do just …

2019-05-26: More GLAM Workbench updates! More full text of Australian books! I’ve added the notebook & …

2019-05-25: 2019 has been pretty busy so far! I just compiled a list of tools, updates, and examples from the …

2019-05-24: Here’s how you can get the text of Australian books in @TroveAustralia from the Internet …

2019-05-22: I’ve updated the data that sits behind my Trove Places app and added more than 140 newspaper …

2019-05-21: If you’re researching foreign policy using @naagovau you might find this little tool useful – …

2019-05-19: And what can you do with 400 CSV files? Well, you could explore their contents using my GLAM CSV …

2019-05-19: Some overdue updates to the GLAM Workbench. First here’s details, data, and code from a …

2019-05-09: Over the last week I’ve been downloading editorial cartoons published in The Bulletin from …

2019-05-04: After a number of unsuccessful attempts, I seem to be getting The Bulletin title art fairly reliably …

2019-04-29: Here’s the notebook-ified version of the code I used to harvest all the Australian …

2019-04-28: I’ve reharvested Commonwealth Hansard from 1901 to 1980 and updated my repository of XML …

2019-04-27: And now my GLAM Workbench has a ‘Trove Maps’ section to document examples and …

2019-04-26: The other night @OpenGLAM was sharing collections of high-res images from GLAM orgs that are free to …

2019-04-25: If you’d like to make your own big, composite images from lots of @TroveAustralia newspaper …

2019-04-25: Australian pilots, aviators, airmen, and flyers — 4,950 thumbnails from a search in …

2019-04-23: I’ve been busy lately harvesting LOTS of full text data from @TroveAustralia’s digitised …

2019-04-22: I’ve added a section for the @TroveAustralia ‘book’ zone to the GLAM Workbench.

2019-04-22: Ok, so I’ve downloaded the OCRd text from 27,426 issues of 358 digitised journals/series in …

2019-04-22: All 9,738 OCRd text files harvested from books, pamphlets and leaflets in @TroveAustralia’s …

2019-04-21: So @TroveAustralia includes more than 370,000 press releases, speeches, and interview transcripts …

2019-04-20: Among the OCRd texts I’m currently harvesting from Trove’s journals zone are things like the …

2019-04-20: Wow, there are now over 371,000 press releases, interview transcripts and more from the @ParlLibrary …

2019-04-19: Another collection of OCRd text from @TroveAustralia is on its way…

2019-04-17: Playing with @TroveAustralia newspaper results. Here’s illustrated articles with ‘White Australia …

2019-04-15: The final tally – after much tweaking I’ve downloaded OCRd text from 9,738 works in the …

2019-04-14: I’m looking for books in @TroveAustralia, but there’s lots of ephemera (pamphlets, posters etc) in …

2019-04-12: Text of over 3 thousand digitised books and pamphlets downloaded so far from @TroveAustralia…

2019-04-11: After talking to @PrimahadiWijaya today about work at @MonashLing, I started harvesting metadata …

2019-04-11: What I did at #valatechcamp! Here’s a CSV with basic details of 7,719 digitised books …

2019-04-11: TIL that the web pages for digitised works (like books and journal issues) on @TroveAustralia embed …

2019-04-11: Just posting the link to my ‘Introducing APIs’ slides for #VALATechCamp again, so that …

2019-04-07: Hmm, it occurs to me that the method I used to generate newspaper article thumbnails from Trove, …

2019-04-07: So, I’ve finally figured out a way to automatically generate nice-looking thumbnails from …

2019-04-06: So I put the recent report into Australia’s national cultural institutions into the @TDHASSN …

2019-03-31: Train from Canberra to Melbourne booked for #VALATechCamp. I’ll be hanging around both days, …

2019-03-26: Sneak preview of my GLAM CSV Explorer now live on @MyBinderTeam! Select one of 447 GLAM-related CSVs …

2019-03-26: Having ripped out a lot of code and simplified a mess of conditionals, I think this CSV Explorer …

2019-03-25: Now to load that new CSV of GLAM CSVs into my CSV Explorer…

2019-03-25: Quick notebook to harvest GLAM datasets via the new(ish) @datagovau API. Includes 447 CSVs from 19 …

2019-03-23: This is why we can’t have nice things…

2019-03-22: Still plenty to do, but my CSV Explorer is taking shape… (coming soon to @TDHASSN & elsewhere!) …

2019-03-21: Doesn’t take much to show when there’s a problem with dates in metadata… (Yep, post 1900 …

2019-03-20: Fun day talking to the @dhpanu team at ANU about digital history possibilities. Slides/links are all …

2019-03-16: Currently working on a CSV explorer to give researchers an overview of the contents of GLAM …

2019-03-12: A bit more progress. Having found columns with OpenCV, I can use Tesseract to help me find the rows…

2019-03-11: After much OpenCV fiddling & tweaking, sorry… iteration, I’m pretty pleased with this. Columns …

2019-03-08: So right around now I think I’m talking (via video) about my adventures with #HistoricHansard …

2019-03-07: More updates! Latest data and images from The Real Face of White Australia transcription project are …

2019-03-07: I’ve finally updated the @TroveAustralia API Console to use version 2 of the API & https by …

2019-03-07: Lots of exciting new stuff has been added to @TroveAustralia’s digitised journals in the last few …

2019-03-07: Art & Architecture: the journal of the Institute of Architects NSW, 51 issues from 1905 to 1912 …

2019-03-07: Only 12 issues, but check out the fabulous covers on The New Triad from 1927-8. Now on …

2019-03-07: Want some arts? 130 issues of RealTime from 1994 to 2016 now on @TroveAustralia.

2019-03-07: Hey #ozhist, 295 issues of the journal of the Royal Australian Historical Society from 1918 to 1954 …

2019-03-07: Also amongst the latest batch of digitised journals on @TroveAustralia, 39 issues of Camp Ink from …

2019-03-07: There’s more literary journals digitised in @TroveAustralia as well. Including 18 issues of …

2019-03-07: But wait, there’s more — the KCC Kennel Gazette was renamed, wait for it, Dogs. Another 94 …

2019-03-07: People, you need to know that @TroveAustralia has digitised 360 issues of the KCC Kennel Gazette …

2019-03-07: Updating my list of digitised journals on @TroveAustralia this morning and seeing what’s new. …

2019-03-01: I’ll be running some more @TroveAustralia workshops for @UniCanberraReD this year. On 13 May …

2019-02-26: #dhhacks — Save a page image from the State Archives of NSW's Bubonic Plague Register So NSW State Archives has digitised the Register of Cases of Bubonic Plague 1900-1908. Great work! …

2019-02-24: I’ve updated the notebook for harvesting records from @archivesnz’s Archway database in …

2019-02-24: Uh, ok — so an advanced search for keywords only in Archway gives me a maximum of 1000 results. But …

2019-02-21: Looks like I’ll be heading to the VALA Tech Camp in April to talk APIs. See you there!

2019-02-21: New section added to my GLAM Workbench for the Queensland State Archives (@qsarchives). Includes a …

2019-02-21: So in case you’re wondering, the @qsarchives ‘Naturalisations 1851 to 1904’ index …

2019-02-19: Whoops. Here’s the actual full list of countries of origin from the @nswarchives NSW …

2019-02-19: Here’s the full list of countries of origin from the NSW naturalisations data, 1834-1903.

2019-02-19: NSW naturalisations 1834 to 1903. The sudden rise in Chinese naturalisations followed the …

2019-02-17: Suggestions of new topics and collections for my GLAM workbench are welcome!

2019-02-17: Here’s an example dataset harvested from Library and Archives Canada’s naturalisation …

2019-02-17: I’ve added a section for Library and Archives Canada to my GLAM workbench. The first notebook …

2019-02-15: Current status — extracting data from Library and Archives Canada’s 1915-1946 naturalisation …

2019-02-14: The full text of ‘Who belongs? Reading identity, ownership, and legitimacy’, my talk for …

2019-02-08: My talk for #text2data at the National Library of Sweden looks at occurence of the words …

2019-02-05: Back to school report — what I did on my holidays…

2019-02-04: Another slide for Sweden — this one comparing words appearing before ‘aliens’ in The …

2019-02-03: Working on my slides for From Text to Data in Stockholm this week…

2019-02-01: I’ve added a ‘save chart’ option to the QueryPic app in my GLAM Workbench. …

2019-01-30: Pleased and proud to see the chapter @baibi & I wrote on the Real Face of White Australia now …

2019-01-27: In a bit over a week I’ll be heading to Stockholm for the ‘From text to data’ …

2019-01-26: Talking about 'immigrants' in Trove's digitised newspapers I’m giving a talk in a week or so (eep!) that looks at some of the changing contexts in which …

2019-01-26: In case you’re wondering, it took about 13 hours to download the metadata and full text of …

2019-01-25: Ok, so let’s see how I go harvesting 2 million newspaper articles from @TroveAustralia …

2019-01-25: 30,000+ occurences of the word ‘Chinese’ in the OCRd full text of The Bulletin, …

2019-01-23: One more and I’m done for the night… New GLAM Workbench page for the ‘Trove API …

2019-01-23: I’ve finished putting details of all the current GLAM Workbench repositories into the new …

2019-01-23: Added a ‘data’ section to the GLAM Workbench docs, with info on harvests from government …

2019-01-23: And now a GLAM Workbench page for @Te_Papa…

2019-01-23: Added a page for @ArchivesNZ’s Archway to the GLAM Workbench docs…

2019-01-22: So here’s some fun things to do with @TroveAustralia newspapers… (via GLAM Workbench)

2019-01-22: Ok, more documentation for you — page for the @DigitalNZ API in GLAM Workbench updated!

2019-01-22: Slowly working my way through the documentation for my GLAM Workbench. Still lots to do, but I think …

2019-01-22: If there are APIs or other data sources you’d like me to add to my GLAM Workbench, feel free …

2019-01-21: Updated list of the fifty most common words occuring before the word ‘aliens’ in …

2019-01-21: Just updated my harvest of metadata and full text from The Bulletin in @TroveAustralia. …

2019-01-20: Fifty most common words occuring before the word ‘aliens’ in @TroveAustralia newspapers …

2019-01-19: You want big data? I just harvested 213,340 newspaper articles (including full OCRd text) from …

2019-01-19: So now I’ve updated TroveHarvester and built a new interface I can get back to the task I …

2019-01-19: Want an easy way to download @TroveAustralia newspaper articles in bulk? No installation? Point and …

2019-01-19: And version 0.2.2 of TroveHarvester quickly follows 0.2.1 as I squash a bug when downloading …

2019-01-18: TroveHarvester 0.2.1 — updated to work with version 2 of the @TroveAustralia API. Now on pypi! More …

2019-01-18: Ok, that’s more like it. Full text and metadata of 29,203 newspaper articles harvested using …

2019-01-18: Ah ok, I forgot about the new ‘bulkHarvest’ parameter in the @TroveAustralia API. …

2019-01-18: Uh, never come across one of these before from the @TroveAustralia API. Needless to say it causes …

2019-01-18: Testing the updated Trove Newspaper Harvester… Run into a problem with the @TroveAustralia …

2019-01-18: Thanks to the @TroveAustralia API upgrade, the new version of the Trove Newspaper Harvester should …

2019-01-18: Since I’m updating the Trove Newspaper Harvester to work with version 2 of the @TroveAustralia …

2019-01-17: I’m enjoying using micro.blog as a way of capturing what I’m working on: …

2019-01-17: Finally biting the bullet and getting to work on updating the TroveHarvester to work with version 2 …

2019-01-17: That’s cool — just realised I can share easily share live versions of Altair charts from …

2019-01-17: And also “coloured alien” which, not suprisingly, peaks in 1901 when the Immigration …

2019-01-17: Exploring some of the adjectives attached to ‘alien’ in @TroveAustralia …

2019-01-17: Just to emphasise my point the other day about the impact of stemming on searches for …

2019-01-17: Nothing like browsing the databases of another country’s national/state archives to make you …

2019-01-16: The Australian version of ‘Who’s responsible?’ is up! Just select a government …

2019-01-16: New notebook added to the #GLAMWorkbench RecordSearch repository — get the basic details of agencies …

2019-01-16: Hmm, wondering why the ‘National Council of Women of the Australian Capital Territory’ …

2019-01-15: As well as cross-posting updates to Twitter and Mastodon, I’ve now set up IFTTT to keep an eye …

2019-01-15: Adventures in stemming, or what happens when you search Trove for 'naturalization' Fun fact — the Porter stemming algorithm treats the words ‘naturalisation’ and …

2019-01-14: I have a brand new updates page powered by micro.blog!